Classification tutorial (EMPIAR-10304)

This tutorial shows how to convert raw tilt-series from EMPIAR-10304 (E. coli. ribosomes) into a ~4.9A resolution structure and resolve 8 different conformations.

We first use the command line to download and decompress a tbz file containing: 1) a script to download the raw tilt-series from EMPIAR, 2) corresponding metadata with tilt angles and acquisition order, and 3) an initial model:

# cd to a location in the shared file system and run:

wget https://nextpyp.app/files/data/nextpyp_class_tutorial.tbz
tar xfz nextpyp_class_tutorial.tbz
source download_10304.sh

Note

Depending on the speed of your network connection, downloading the raw data from EMPIAR can take several minutes.

Open your browser and navigate to the url of your nextPYP instance (e.g., https://nextpyp.myorganization.org).

Step 1: Create a new project

Data processing runs are organized into projects. We will create a new project for this tutorial
  • The first time you login into nextPYP, you should see an empty Dashboard:

    Create new project
  • Click on Create new project, give the project a name, and select Create

  • Select the new project from the Dashboard and click Open

  • The newly created project will be empty and a Jobs panel will appear on the right

Step 2: Import raw tilt-series

Import the raw tilt-series downloaded above ( <1 min)
  • Go to Import Data and select Tomography (from Raw Data)

    Import dialog
  • A form to enter parameters will appear:

    File browser
  • Go to the Raw data tab:

    • Set the Location of the raw data by clicking on the icon and browsing to the directory where the you downloaded the raw movie frames

    • Type tilt*.mrc in the filter box (lower right) and click on the icon to verify your selection. 12 matches should be displayed

    • Click Choose File Pattern to save your selection

    • Click on the Microscope parameters tab

    • Set Pixel size (A) to 2.1

    • Set Acceleration voltage (kV) to 300

  • Click Save and the new block will appear on the project page

  • The block is in the modified state (indicated by the sign) and is ready to be executed

  • Clicking the button Run will show another dialog where you can select which blocks to run:

  • Since there is only one block available, simply click on Start Run for 1 block. This will launch a process that reads one tilt image, applies the gain reference (if applicable) and displays the resulting image inside the block

  • Click inside the block to see a larger version of the image

Step 3: Pre-processing

4 min - Movie frame alignment, CTF estimation and tomogram reconstruction
  • Click on Tilt-series (output of the Tomography (from Raw Data) block) and select Pre-processing

  • Go to the Frame alignment tab:

    • Check Single-file tilt-series

    • Click on the CTF determination tab

    • Set Max resolution to 5.0

    • Click on the Tilt-series alignment tab

    • Uncheck Resize squares to closest multiple of 512

    • Click on the Tomogram reconstruction tab

    • Set Thickness of reconstruction (unbinned voxels) to 3072

    • Set Binning factor for reconstruction to 12

    • Check Erase fiducials

    • Click on the Resources tab

  • Click Save, Run, and Start Run for 1 block. Follow the status of the run in the Jobs panel

Step 4: Particle picking

Particle detection from tomograms ( 2 min)
  • Click on Tomograms (output of the Pre-processing block) and select Particle picking

  • Go to the Particle detection tab:

    • Set Detection method to size-based

    • Set Particle radius (A) to 80

    • Set Threshold for contamination detection to 2.0

    • Set Minimum contamination size (voxels) to 60

    • Set Minimum distance between particles to 2

    • Check Local refinement

    • Set Z-axis detection range (binned slices) to 40

    • Set Particle detection threshold to 2

  • Click Save, Run, and Start Run for 1 block

  • Navigate to the Particle picking block to inspect the coordinates

Note

In this tutorial, we use the size-based method for particle detection. Other methods are available, including manual, geometry-based, and neural network-based picking, and molecular pattern mining.

Step 5: Reference-based refinement

Reference-based particle alignment ( 14 hr)
  • Click on Particles (output of the Particle picking block) and select Reference-based refinement

  • Go to the Sample tab:

    • Set Molecular weight (kDa) to 2000

    • Set Particle radius (A) to 150

    • Click on the Particle extraction tab

    • Set Box size (pixels) to 64

    • Set Image binning to 4

    • Uncheck Skip gold fiducials

    • Check Invert CTF handedness

    • Click on the Particle scoring function tab

    • Set First tilt for refinement to 15

    • Set Last tilt for refinement to 25

    • Set Max resolution (A) to 22.0

    • Click on the Reference-based refinement tab

    • Specify the location of the Initial model (*.mrc) by clicking on the icon , navigating to the folder where you downloaded the data for the tutorial, and selecting the file EMPIAR-10304_init_ref.mrc

    • Set Particle rotation Phi range (degrees), Particle rotation Psi range (degrees) and Particle rotation Theta range (degrees)` to 180

    • Set Rotation step (degrees) to 6.0

    • Set Particle translation range (A) to 50

    • Click on the Reconstruction tab

    • Check Show advanced options

    • Set Max tilt-angle (degrees) to 50

    • Set Min tilt-angle (degrees) to -50

    • Click on the Resources tab

    • Set Split, Threads to the maximum allowable by your system

  • Save your changes, click Run and Start Run for 1 block

  • One round of refinement and reconstruction will be executed. Click inside the block to see the results

Step 6. Filter particles

Identify duplicates and particles with low alignment scores ( 3 min)
  • Click on Particles (output of the Reference-based refinement block) and select Particle filtering

  • Go to the Particle filtering tab:

    • Specify the location of Input parameter file (*.bz2) by clicking on the icon and selecting the file tomo-reference-refinement-*_r01_02.bz2

    • Set Score threshold to 15

    • Set Min distance between particles (unbinned pixels/voxels) to 20

    • Set Lowest tilt-angle (degrees) to -7

    • Set Highest tilt-angle (degrees) to 7

    • Check Generate reconstruction after filtering

    • Check Permanently remove particles

  • Click Save, Run, and Start Run for 1 block. You can see how many particles were left after filtering by looking at the job logs.

Step 7. Fully constrained refinement

Tilt-geometry parameters and particle poses are refined in this step ( 10 min)
  • Click on Particles (output of the Particle filtering block) and select 3D refinement

  • Go to the Particle extraction tab:

    • Set Box size (pixels) to 256

    • Set Image binning to 1

    • Click on the Particle scoring function tab

    • Set Max resolution (A) to 18:14

    • Click on the Refinement tab

    • Select the location of the Initial parameter file (*.bz2) by clicking on the icon and selecting the file tomo-fine-refinement-*_r01_clean.bz2

    • Set Last iteration to 3

    • Check Refine tilt-geometry

    • Check Refine particle alignments

    • Set Particle translation range (A) to 30.0

    • Click on the Reconstruction tab

    • Check Apply dose weighting

    • Check Global weights

  • Click Save, Run, and Start Run for 1 block to execute three rounds of refinement and reconstruction

  • Click inside the 3D refinement block to inspect the results

Step 8: Create shape mask

Use most recent reconstruction to create a shape mask ( <1 min)
  • Click on Particles (output of 3D refinement block) and select Masking

  • Go to the Masking tab:

    • Select the Input map (*.mrc) by click on the icon and selecting the file tomo-new-coarse-refinement-*_r01_03.mrc

    • Set Threshold for binarization to 0.4

    • Set Width of cosine edge (pixels) to 8

  • Click Save, Run, and Start Run for 1 block to run the job

  • Click on the menu icon of the Masking block, select the Show Filesystem Location option, and Copy the location of the block in the filesystem (we will use this in the next step))

  • Click inside the Masking block to inspect the results of masking

Step 9. Region-based local refinement

Constraints of the tilt-geometry are applied over local regions ( 25 min)
  • Click on Particles (output of 3D refinement block) and select 3D refinement

  • Go to the Particle scoring function tab:

    • Set First tilt for refinement to 18

    • Set Last tilt for refinement to 22

    • Set Max resolution (A) to 18:14:12:10:8:6:5

    • Set Masking strategy to from file

    • Specify the location of the Shape mask (*.mrc) produced in Step 9 by clicking on the icon , navigating to the location of the Masking block by copying the path we saved above, and selecting the file frealign/maps/mask.mrc

    • Click on the Refinement tab

    • Select the location of the Initial parameter file (*.bz2) by clicking on the icon and selecting the file tomo-new-coarse-refinement-*_r01_03.bz2

    • Set Last iteration to 5

    • Set Number of regions to 8,8,2

  • Click Save, Run, and Start Run for 1 block to run the job

  • Click inside the 3D refinement block to inspect the results

Step 10: Particle-based CTF refinement

Per-particle CTF refinement using most recent reconstruction ( 2 hr)
  • Click on Particles (output of 3D refinement block) and select 3D refinement

  • Go to the Particle scoring function tab:

    • Set First tilt for refinement to 15

    • Set Last tilt for refinement to 25

    • Set Max resolution (A) to 4.5

    • Click on the Refinement tab

    • Select the location of the Initial parameter file (*.bz2) by clicking on the icon and selecting the file tomo-new-coarse-refinement-*_r01_05.bz2

    • Uncheck Refine tilt-geometry

    • Uncheck Refine particle alignments

    • Check Refine CTF per-particle

    • Set Defocus 1 range (A) and Defocus 2 range (A) to 2000.0

  • Click Save, Run, and Start Run for 1 block

  • Click inside the 3D refinement block to inspect the results

Step 11: Region-based refinement after CTF refinement

Constraints of the tilt-geometry are applied over local regions ( 20 min)
  • Click on Particles (output of 3D refinement block) and select 3D refinement

  • Go to the Particle scoring function tab:

    • Set First tilt for refinement to 18

    • Set Last tilt for refinement to 22

    • Set Max resolution (A) to 18:14:12:10:8:6:5:4.5:6:5:4.5

    • Click on the Refinement tab

    • Select the location of the Initial parameter file (*.bz2) by clicking on the icon and selecting the file tomo-new-coarse-refinement-*_r01_02.bz2

    • Set Last iteration to 12

    • Check Refine tilt-geometry

    • Check Refine particle alignments

    • Uncheck Refine CTF per-particle

    • Set Number of regions to 16,16,4

    • Set Optimizer - Max step length to 20.0

    • Click on the Reconstruction tab

    • Set Frame weight fraction to 2

  • Click Save, Run, and Start Run for 1 block to run the job

  • Click inside the 3D refinement block to inspect the results

Step 12: 3D classification

Constrained classification ( 3 hr)
  • Click on Particles (output of the Particle refinement block) and select 3D classification

  • Go to the Classification tab:

    • Select the location of the Initial parameter file (*.bz2) by clicking on the icon and selecting the file tomo-new-coarse-refinement-*_r01_02.bz2

    • Set Last iteration to 20

    • Set Number of classes to 8

    • Uncheck Refine particle alignments

    • Click on the Reconstruction tab

    • Specify the location of the External weights by clicking on the icon and selecting the file frealign/weights/global_weight.txt from the file location of the previous block

  • Click Save, Run, and Start Run for 1 block

  • Click inside the 3D classification block to inspect the results

    Tip

    Click on the round blue markers (top right of the page) to inspect different classes or go to the Class view or Classes Movie tabs to show all classes simultaneously

Info

Running times were measured running all tilt-series in parallel on nodes with 124 vCPUs, 720GB RAM, and 3TB of local SSDs