NYSBC course: nextPYP practical (day 2)¶
This session demonstrates how explicitly optimizing for fast runtime and giving users flexibility in pre-processing steps can aid in achieving high-quality and high-throughput data acquisition in `nextPYP
.
Starting from raw data obtained at the microscope, we’ll build an automatic pipeline that can perform all pre-processing tasks up to and including particle picking.
Dataset¶
For this session we will use the EMPIAR-10164 dataset of HIV-1 purified VLPs.
On-the-fly pre-processing¶
Creating/starting new sessions¶
Sessions allow pre-processing of tilt-series on-the-fly
On your Dashboard, select the Go to Sessions button.
Click the + Start Tomography button.
Give your session a user-readable name by typing in the Name box.
The Parent folder box will be auto-populated with a default location to store the data.
Pick a unique Folder name for your session. There can only be one folder name per session, regardless of the user-readable name!
Select the
Workshop
group.
Click on the Raw data tab.
Set
Path to raw data
to/nfs/bartesaghilab/nextpyp/workshop/10164/TS_*.tif
Click on the Microscope parameters tab.
Set
Pixel size (A)
to 1.35Set
Acceleration voltage (kV)
to 300Set
Tilt-axis angle (degrees)
to 85.3
Click on the Session settings tab.
Set
Number of tilts
to 41Set
Raw data transfer
to linkLink
: Create a symlink between the data on the microscope and the Session folder. The data still only exists at the microscope.Move
: Transfer the data from the microscope to the Session folder, removing the data at the microscope. The data will now only exist on the Sessions folder.Copy
: Make a copy of the data in the microscope to your Session folder. The data will now exist at both the microscope and your Session folder.
Click on the CTF Determination tab.
Set
Max resolution
to 5
Click on the Virion detection tab.
Set
Virion radius
to 500Set
Virion detection method
to autoSet
Spike detection method
to uniformSet
Size of equatorial band to restrict spike picking
to 800
Click on the Particle detection tab.
Set
Detection method
to noneSet
Detection radius
to 50
Click on the Resources tab.
Set
Split, Threads
to 41
General advice for setting resource limits:
The
Split, Threads
should match the number of tilts in your tilt series, if you have the computational resources to do so.In general, the more threads you use, the more tilts that can be processed at the same time, and the faster you see pre-processing results.
Click Save, which will automatically take you to the Operations page.
Click Start to launch the session.
Restarting sessions¶
Use the Restart
option to make changes to ongoing Sessions
Restart is a “smart” method of re-running only what is necessary after changing pre-processing parameters.
Workflow: Change a parameter → Save settings changes → Restart pre-processing daemon.
Example: Changing the minimum distance between spikes
Go to the Virion detection tab
Increase Minimum distance between spikes (voxels) to 50
Click Save
Navigate to Operations tab
Click Restart on pre-processing daemon
Open Logs to check that the restart flag has been detected and new pre-processing jobs will be launched in response to this change.
Check the Tilt Series tab to see that fewer particles have been picked.
Using the Clear
option
Clear will start pre-processing procedure from scratch
This is helpful if you want to start fresh making sure any previous pre-processing results are ignored.
Copying/deleting sessions¶
Quickly create a session by copying an existing one
Sessions can be copied or deleted.
Click the icon to find the session’s file storage location.
Warning
Deleting a session whose mode of file transfer was set to Move
will delete all files (including the raw data)!.
Importing/exporting sessions¶
Exporting a session in .star
format
Sessions can be exported in .star
format for downstream processing and refinement in other software.
Navigate to the Table tab.
In the Filters box, type a name for your exported session.
Click Export to launch the export job. The job’s log will indicate the location of the exported
.star
file.
Importing a session into a project
Since Sessions also perform pre-processing, we can import a finished Session into a project to kick-start the process of structure determination.
Click the Dashboard link to go back to nextPYP’s homepage.
Click the Create New Project button and give your project a name.
Click the Import Data button, and select the option Tomography (from Session).
Search for the name of the session you wish to import.
Click the Save button, and then launch the job.
Day 2 summary¶
What we learned today
In this session we learned how to run on-the-fly pre-processing in nextPYP
:
Continuously monitor raw data folder for incoming tilt-series
Raw data transfer (copy, link, move)
Pre-processing (frame alignment, tilt-series alignment, CTF estimation, and tomogram reconstruction)
Particle picking (geometry-based, size-based, etc.)
Restart, clear, copy or delete sessions
Import and export sessions
Feel free to revisit day 1 or explore other options available in nextPYP
as described in the User Guide.