NYSBC course: nextPYP practical (day 2)¶
This session demonstrates how explicitly optimizing for fast runtime and giving users flexibility in pre-processing steps can aid in achieving high-quality and high-throughput data acquisition in `nextPYP.
Starting from raw data obtained at the microscope, we’ll build an automatic pipeline that can perform all pre-processing tasks up to and including particle picking.
Dataset¶
For this session we will use the EMPIAR-10164 dataset of HIV-1 purified VLPs.
On-the-fly pre-processing¶
Creating/starting new sessions¶
Sessions allow pre-processing of tilt-series on-the-fly
On your Dashboard, select the Go to Sessions button.
Click the + Start Tomography button.
Give your session a user-readable name by typing in the Name box.
The Parent folder box will be auto-populated with a default location to store the data.
Pick a unique Folder name for your session. There can only be one folder name per session, regardless of the user-readable name!
Select the
Workshopgroup.
Click on the Raw data tab.
Set
Path to raw datato/nfs/bartesaghilab/nextpyp/workshop/10164/TS_*.tif
Click on the Microscope parameters tab.
Set
Pixel size (A)to 1.35Set
Acceleration voltage (kV)to 300Set
Tilt-axis angle (degrees)to 85.3
Click on the Session settings tab.
Set
Number of tiltsto 41Set
Raw data transferto linkLink: Create a symlink between the data on the microscope and the Session folder. The data still only exists at the microscope.Move: Transfer the data from the microscope to the Session folder, removing the data at the microscope. The data will now only exist on the Sessions folder.Copy: Make a copy of the data in the microscope to your Session folder. The data will now exist at both the microscope and your Session folder.
Click on the CTF Determination tab.
Set
Max resolutionto 5
Click on the Virion detection tab.
Set
Virion radiusto 500Set
Virion detection methodto autoSet
Spike detection methodto uniformSet
Size of equatorial band to restrict spike pickingto 800
Click on the Particle detection tab.
Set
Detection methodto noneSet
Detection radiusto 50
Click on the Resources tab.
Set
Split, Threadsto 41
General advice for setting resource limits:
The
Split, Threadsshould match the number of tilts in your tilt series, if you have the computational resources to do so.In general, the more threads you use, the more tilts that can be processed at the same time, and the faster you see pre-processing results.
Click Save, which will automatically take you to the Operations page.
Click Start to launch the session.
Restarting sessions¶
Use the Restart option to make changes to ongoing Sessions
Restart is a “smart” method of re-running only what is necessary after changing pre-processing parameters.
Workflow: Change a parameter → Save settings changes → Restart pre-processing daemon.
Example: Changing the minimum distance between spikes
Go to the Virion detection tab
Increase Minimum distance between spikes (voxels) to 50
Click Save
Navigate to Operations tab
Click Restart on pre-processing daemon
Open Logs to check that the restart flag has been detected and new pre-processing jobs will be launched in response to this change.
Check the Tilt Series tab to see that fewer particles have been picked.
Using the Clear option
Clear will start pre-processing procedure from scratch
This is helpful if you want to start fresh making sure any previous pre-processing results are ignored.
Copying/deleting sessions¶
Quickly create a session by copying an existing one
Sessions can be copied or deleted.
Click the icon to find the session’s file storage location.
Warning
Deleting a session whose mode of file transfer was set to Move will delete all files (including the raw data)!.
Importing/exporting sessions¶
Exporting a session in .star format
Sessions can be exported in .star format for downstream processing and refinement in other software.
Navigate to the Table tab.
In the Filters box, type a name for your exported session.
Click Export to launch the export job. The job’s log will indicate the location of the exported
.starfile.
Importing a session into a project
Since Sessions also perform pre-processing, we can import a finished Session into a project to kick-start the process of structure determination.
Click the Dashboard link to go back to nextPYP’s homepage.
Click the Create New Project button and give your project a name.
Click the Import Data button, and select the option Tomography (from Session).
Search for the name of the session you wish to import.
Click the Save button, and then launch the job.
Day 2 summary¶
What we learned today
In this session we learned how to run on-the-fly pre-processing in nextPYP:
Continuously monitor raw data folder for incoming tilt-series
Raw data transfer (copy, link, move)
Pre-processing (frame alignment, tilt-series alignment, CTF estimation, and tomogram reconstruction)
Particle picking (geometry-based, size-based, etc.)
Restart, clear, copy or delete sessions
Import and export sessions
Feel free to revisit day 1 or explore other options available in nextPYP as described in the User Guide.