MitoTNT is a Python-based pipeline for the tracking, visualization, and analysis of 4D mitochondrial network data.
It is built upon mitochondria segmentation provided by MitoGraph, and visualization engine provided by ChimeraX.
MitoTNT is written by Zichen (Zachary) Wang ([email protected]), with the help from people in the Johannes Schöneberg lab at UCSD.
On github, download this repository. Alternative clone the repository on terminal:
git clone https://github.com/pylattice/mitoTNT.git
-
Jupyter Notebook or get it from Anaconda.
-
MitoGraph for mitochondria segmentation.
Note that MitoGraph is currently only available on MacOS and Windows, but you can build it from source on Linux. Please contact [email protected] if you need the Linux build. -
ChimeraX for tracking visualization.
We will create a conda environment that automatically installs all the required dependencies.
- Open anaconda prompt on Windows or just terminal on MacOS/Linux
- Go to the root directory of MitoTNT repository
- Create the enviroment using the provided .yml file:
conda env create --name mitotnt --file=mitotnt_env.yml
To use MitoTNT, first activate the environmnet we created with conda activate mitotnt
, and then open notebook with jupyter notebook
.
To perform tracking, mitochondria 3D image stacks needs to be segmented using MitoGraph for all time points.
Example dataset has been provided under test_data/mitograph
, which are MitoGraph-segmented. You can skip this section if using the example dataset.
If your data is a 4D hyperstack, you will need to save it into 3D image stacks for all timepoints.
We have provided a utility script convert_to_tiff.ipynb
under helper_scripts
directory for this purpose.
MitoGraph and MitoTNT usually work with mitochondria in a single cell. This helps to avoid tracking mitochondria across different cells and reduces the computational time significantly. Also, in order to run MitoGraph for each 3D stack, we want to place it under a separate folder at the given timepoint.
Directory structure:
-
frame_0/frame_0.tif
-
frame_1/frame_1.tif
......
We have included a sample cell segmentation script crop_image_using_ImageJ_ROI.ipynb
under helper_scripts
directory, where it can read ImageJ ROI files to extract single cell patches for each timepoint. The users need to draw rectangular boxes in ImageJ and save as .roi
selection files under a ROI folder for the movie.
To run MitoGraph on command line for one snapshot:
MitoGraph -xy lateral_pixel_size -z axial_pixel_size -path tif_dir
Note: you need to replace lateral_pixel_size
and axial_pixel_size
with microscope parameters, otherwise the segmentation is empty
To segment multiple z-stacks for a number of timepoints simultaneously:
for frame in frame*; do MitoGraph -xy lateral_pixel_size -z axial_pixel_size -path "$frame"; done
Segmentation outputs will be saved under each timepoint folder.
Please find the description for the output files here.
We recommend using ChimeraX or ParaView to open .tif
and .vtk
files for visual inspection of the segmentation results.
More information for using MitoGraph and parameters can be found in the publication here.
Open the terminal and type jupyter notebook
to start Jupyter Notebook.
Find the root directory of MitoTNT and open mitotnt_tracking_pipeline.ipynb
for mitochondrial network tracking.
In this section we will process the raw data into a format that is used for the subsequent tracking.
First specify the directories we will use:
-
work_dir
: the directory where data will be processed and stored. For test data, you can use the directory oftest_data
on your machine -
data_dir
: the directory where MitoGraph segmented mitochondria is stored. For test data, this istest_data/mitograph
. -
input_dir
: the directory where the processed inputs used for tracking will be stored. For test data, you can usetest_data/tracking_inputs
. This is an empty folder that will be created.
After specifying the folders, we need to set a few parameters:
-
start_frame
,end_frame
: the range of frames to process. -
node_gap_size
: the number of nodes to skip when creating full-resolution graphs from mitograph.gnet
files. Default to 0 (use all nodes).
All processed inputs will be saved as a single compressed .npz
file in input_dir
.
To start making inputs:
generate_tracking_inputs.generate()
In this section we will perform node assignments for each consecutive frames.
In addition to the directories declared above, we will create output_dir
to store the tracking outputs. For test data, you can use test_data/tracking_outputs
.
Additional parameters needed for frame-to-frame tracking:
tracking_interval
: the frame interval between the two frames to be tracked. Default to 1 (every consecutive frame).
Distance threshold is computed as the minimum of two values:
- the distance to the N-th closest neighbor where N is given by
cutoff_num_neighbor
, default to 10; - the frame interval times the maximum allowed speed given by
cutoff_speed
, default toNone
and estimated from the distance matrix.
graph_matching_depth
: the maximum level used for graph comparison. Default to 2 (usually sufficient).dist_exponent
,top_exponent
: the final cost term is given by Ddist_exponent x Ttop_exponent, where D, T are the distance and topology costs respectively. Default both to 1 (equal weighting).
To run tracking:
network_tracking.frametoframe_tracking()
In this section we attempt to merge tracks that are mistakenly terminated during frame-to-frame tracking.
Additional parameters need to be set:
-
min_track_size
: the minimum number of frames for the tracks to be kept. Default to 5. -
max_gap_size
: the maximum number of frames for which gap closing is allowed. Default to 3. Value of 1 indicates no gap closing. -
block_size_factor
: values less than 1 allows the sliding block implementation of gap closing to prevent memory overflow due to large cost matrix. The size of the block is given byblock_size_factor
* total number of tracks to be closed. Default to 1 (close all tracks at once).
The final node trajectories are saved in final_node_tracks.csv
file.
Each row is one node at one time point.
Each column is an attribute of the given node, described below.
To use gap closing:
network_tracking.gap_closing()
The final node trajectories are saved in final_node_tracks.csv file.
Each row is one node at one time point.
Each column is an attribute of the given node, described below.
-
frame_id
: frame number of the node. -
frame_node_id
: node id at the given frame. Each frame has its own indexing. -
unique_node_id
: node id shared by all the nodes in the same track at different frames. Each track is uniquely indexed throughout the whole trajectory. This is essetially the tracking information. -
frame_seg_id
: segment id for all the nodes in the same segment. Each frame has its own indexing. The branching points are not assigned. -
frame_frag_id
: fragment id for all the nodes in the same connected component. Each frame has its own indexing. -
connected_unique_node_id
: unique_node_id for neigboring nodes in the graph. This has all the topology information. -
x
,y
,z
: coordinates for the node. -
intensity
,width
: pixel intensity and tubular width for the node given by MitoGraph.
In this section we will visualize the tracked mitochondrial networks in ChimeraX
Please first download ChimeraX
We need to specify the directory to save visualization files
-
vis_dir
: store.cxc
commands to load in ChimeraX. You can usework_dir/chimerax_visualization/
for example. -
vis_data_dir
: store.cmap
and.bild
files created for each frame and used for visualization. You can usevis_dir/data/
for example.
We can use MitoGraph-generated *skeleton.vtk
files for visualizing skeleton, but this is not ideal because it has fixed width and color.
Alternatively here, we can render the skeleton using BILD format in ChimeraX. This allows us to set the skeleton sizes, node sizes and color. However, it also takes much longer to load in ChimeraX.
skeleton_colors
: a list of two colors to render for current and next frames. We use blue for current frame and red for next frame. See more colors.skeleton_size
: diameter of the skeleton that connects nodes.node_size
: diameter of the spheres that make up the nodes. Ifnode_size
=skeleton_size
, the nodes are not visible (but still required to fill the gaps along the skeletons).
tracking_visualization.generate_chimerax_skeleton()
We will use the frame-to-frame node assignments to draw the tracking vectors for two frames.
-
arrow_color
: color of the tracking arrows. Default to black. -
arrow_size
: diameter of the arrow stem.
tracking_visualization.generate_tracking_arrows()
Now we can combine the visualization files created above to visualize the tracking of timelapse data.
-
show_tif
: if true include fluorescence cloud in background. -
voxel_size
: voxel_size same as that used for MitoGraph segmentation, in the order of x,y,z.
For example,voxel_size='0.2,0.2,0.4'
refers to lateral pixel size 0.2 μm and axial pixel size 0.4 μm. -
tif_colors
: color of fluorescence cloud. See colors. -
threshold_level
: if you want to change the contrast/thickness of the fluorescence cloud, use the level argument in ChimeraX. -
use_chimerax_skeleton
:
if true use BILD format skeleton which is more flexible but slower to load;
if false use mitograph-generated .vtk files of fixed color and size (not recommended but quicker). -
skeleton_colors
: same as in step 2.
tracking_visualization.visualize_tracking()
Open chimerax_visualization/visualize_tracking.cxc in ChimeraX. This may take some time. Click Home -> Backgound -> White to see it better.
In this section we will detect nodes that undergo fusion or fission events based on the tracking results.
This is done using a sliding-window approach to identify nodes that undergo persistent structural changes as opposed to transient segmentation differences.
First, the fragment indices for each node are recorded for the half_win_size frames
before and after the current frame, to form the fragment list.
Second, for each network edge, the fragment lists for the connected nodes are compared.
Finally, Fission will be declared if the fragment lists before the current frame are strictly identical, as well as the fragment lists after the current frame are strictly non-overlapping.
Since fusion events can be considered as fission events reversed in time, the opposite criterion is used for fusion detection.
Note because of the sliding window approach:
start_frame
must be >= half_win_size
and end_frame
must be <= total number of frames - half_win_size
Please specify:
-
stride_size
: step size for sliding the window in number of frames. Default to 1 (to detect events happening in every frame). -
half_win_size
: size of the half sliding window in number of frames. The higher the value the stricter the requirement for calling fusion/fission. Default to 4. -
min_tracked_frames
: minimum number of frames that are tracked in both half window in order to declare an event. Default to 2. To detect remodeling events:
detect_fusion_fission.detect()
The remodeling events are saved under tracking_output/remodeling_events.csv
Multiple remodeling nodes located in proximity (less than 5 edges away) are grouped into a single fission/fusion site.
Columns in the output:
-
type
: fusion or fission -
frame_id
: a single frame number for describing when the event happens -
frame_id_before
: the frame numbers before the event for each detected node (may be different due to gap closing) -
frame_id_after
: the frame numbers after the event for each detected node (may be different due to gap closing) -
node_id_before
: theframe_node_id
at corresponding frame before the event for each detected node -
node_id_after
: theframe_node_id
at corresponding frame after the event for each detected node -
frag_id_before
: theframe_frag_id
at corresponding frame before the event for each detected node -
frag_id_after
: theframe_frag_id
at corresponding frame after the event for each detected node -
unique_node_id
: theunique_node_id
for each detected node
frame_node_id
, frame_frag_id
, unique_node_id
are as defined in the node tracking outputs
In this section we will use the tracking results to compute diffusivity at three levels of description and visualize motility in space.
The diffusision coeffients can be computed for 1) nodes, 2) segments, 3) fragments, in the order of higher level of coarse graining.
We need to specify the directory for save post analysis results:
-
analysis_dir
: umbrella directory for post analysis. A good choice iswork_dir+'post_analysis/'
. -
analy_motility_dir
: directory for saving motility measurements. This is usually a subfolder inanalysis_dir
, for exampleanalysis_dir+'motility/'
.
Information needed for computing MSD vs. time delay curve:
max_tau
: maximum number of frames/datapoints used for linear regression. Default to 5.
Additional information for computing segment and fragment level diffusivity:
-
selected_frames
: because segment and fragments undergo constant remodeling, we need to select the frames at which the segment and fragments are evaluated. This is recommended to be separated by 2x half window size (see below). Each frame will be visualized separately. -
half_win_size
: the time window size (frames) before and after the selected center frames for collecting track coordinates. Default to 10. -
tracked_ratio
: the minimum ratio of tracked nodes in each segment/fragment to be qualified for calculating diffusivity. Default to 0.5.
To compute diffusivity, use:
post_analysis.compute_node_diffusivity()
post_analysis.compute_segment_diffusivity()
post_analysis.compute_fragment_diffusivity()
The data is saved in analy_motility_dir/*diffusivity.csv
files. Each row is a node/segment/fragment. The columns are explained below.
-
center_frame_id
: selected frame for determining the segment/fragment diffusivity. N/A for node diffusivity. -
unique_node_id
as infinal_node_tracks.csv
.seg_id
,frag_id
are specific tocenter_frame_id
. -
diffusivity
: slope of MSD vs. time delay curve divided by 6 to account for 3D random walk. -
msd
: MSD per frame, euqal to 6 x diffusivity x frame interval -
r_squared
: coefficient of determination for the linear regression. -
num_points
: number of points in MSD vs. time delay curve used for linear regression.
Now we can visualize the computed motility measurements by mapping diffusivity values onto corresponding mitochondrial skeleton nodes with different coloring.
node_size
: the size scale of the nodesselected_frames
: the frames for which the skeleton is to be used for visualization. Each frame will produce its own visualization files. All tracks that include this frame will be shown.
To color diffusivity for the tracks that include the selected frames, use:
post_analysis.map_node_motility_onto_surface()
post_analysis.map_segment_motility_onto_surface()
post_analysis.map_fragment_motility_onto_surface()