A unified framework for packing deformable and non-deformable subcellular structures in crowded cryo-electron tomogram simulation

Background Cryo-electron tomography is an important and powerful technique to explore the structure, abundance, and location of ultrastructure in a near-native state. It contains detailed information of all macromolecular complexes in a sample cell. However, due to the compact and crowded status, the missing edge effect, and low signal to noise ratio (SNR), it is extremely challenging to recover such information with existing image processing methods. Cryo-electron tomogram simulation is an effective solution to test and optimize the performance of the above image processing methods. The simulated images could be regarded as the labeled data which covers a wide range of macromolecular complexes and ultrastructure. To approximate the crowded cellular environment, it is very important to pack these heterogeneous structures as tightly as possible. Besides, simulating non-deformable and deformable components under a unified framework also need to be achieved. Result In this paper, we proposed a unified framework for simulating crowded cryo-electron tomogram images including non-deformable macromolecular complexes and deformable ultrastructures. A macromolecule was approximated using multiple balls with fixed relative positions to reduce the vacuum volume. A ultrastructure, such as membrane and filament, was approximated using multiple balls with flexible relative positions so that this structure could deform under force field. In the experiment, 400 macromolecules of 20 representative types were packed into simulated cytoplasm by our framework, and numerical verification proved that our method has a smaller volume and higher compression ratio than the baseline single-ball model. We also packed filaments, membranes and macromolecules together, to obtain a simulated cryo-electron tomogram image with deformable structures. The simulated results are closer to the real Cryo-ET, making the analysis more difficult. The DOG particle picking method and the image segmentation method are tested on our simulation data, and the experimental results show that these methods still have much room for improvement. Conclusion The proposed multi-ball model can achieve more crowded packaging results and contains richer elements with different properties to obtain more realistic cryo-electron tomogram simulation. This enables users to simulate cryo-electron tomogram images with non-deformable macromolecular complexes and deformable ultrastructures under a unified framework. To illustrate the advantages of our framework in improving the compression ratio, we calculated the volume of simulated macromolecular under our multi-ball method and traditional single-ball method. We also performed the packing experiment of filaments and membranes to demonstrate the simulation ability of deformable structures. Our method can be used to do a benchmark by generating large labeled cryo-ET dataset and evaluating existing image processing methods. Since the content of the simulated cryo-ET is more complex and crowded compared with previous ones, it will pose a greater challenge to existing image processing methods.

[1]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[2]  Eric P. Xing,et al.  Deep Learning Based Supervised Semantic Segmentation of Electron Cryo-Subtomograms , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[3]  Ying Wah Teh,et al.  A study of density-grid based clustering algorithms on data streams , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[4]  Damien Larivière,et al.  Deep learning improves macromolecule identification in 3D cellular cryo-electron tomograms , 2020, Nature Methods.

[5]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6]  Daniel Baum,et al.  Exploring cavity dynamics in biomolecular systems , 2013, BMC Bioinformatics.

[7]  Valentina Tozzini,et al.  Coarse-grained models for proteins. , 2005, Current opinion in structural biology.

[8]  O. Medalia,et al.  Cellular structural biology as revealed by cryo-electron tomography , 2016, Journal of Cell Science.

[9]  Min Xu,et al.  A convolutional autoencoder approach for mining features in cellular electron cryo-tomograms and weakly supervised coarse segmentation , 2017, Journal of structural biology.

[10]  Hans-Peter Kriegel,et al.  Density‐based clustering , 2011, WIREs Data Mining Knowl. Discov..

[11]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[12]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[13]  R. Ellis,et al.  Macromolecular crowding: an important but neglected aspect of the intracellular environment. , 2001 .

[14]  Yuxiang Chen,et al.  PyTom: a python-based toolbox for localization of macromolecules in cryo-electron tomograms and subtomogram analysis. , 2012, Journal of structural biology.

[15]  Lin Song,et al.  Random generalized linear model: a highly accurate and interpretable ensemble predictor , 2013, BMC Bioinformatics.

[16]  Victor C. M. Leung,et al.  Balanced Iterative Reducing and Clustering Using Hierarchies with Principal Component Analysis (PBirch) for Intrusion Detection over Big Data in Mobile Cloud Environment , 2018, SpaCCS.

[17]  Paul D. McNicholas,et al.  Model-Based Clustering , 2016, Journal of Classification.

[18]  Desh Ranjan,et al.  Building the initial chain of the proteins through de novo modeling of the cryo-electron microscopy volume data at the medium resolutions , 2012, BCB.

[19]  Martin Ester,et al.  Density‐based clustering , 2019, WIREs Data Mining Knowl. Discov..

[20]  R. Leapman,et al.  Transfer Learning for Efficient Segmentation of Subcellular Structures in 3-D Electron Microscopy , 2019, Biophysical Journal.

[21]  Min Xu,et al.  Simulating cryo electron tomograms of crowded cell cytoplasm for assessment of automated particle picking , 2016, BMC Bioinformatics.

[22]  Joachim Frank,et al.  Advances in the field of single-particle cryo-electron microscopy over the last decade , 2017, Nature Protocols.

[23]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[24]  Suresh Chandra Satapathy,et al.  Partition Based Clustering Using Genetic Algorithm and Teaching Learning Based Optimization: Performance Analysis , 2015 .

[25]  Chang Liu,et al.  Automatic localization and identification of mitochondria in cellular electron cryo-tomography using faster-RCNN , 2019, BMC Bioinformatics.

[26]  Jing Zhou,et al.  Recovering Turbulence Details using Velocity Correction for SPH Fluids , 2019, SIGGRAPH Asia Technical Briefs.

[27]  Min Xu,et al.  AITom: Open-source AI platform for cryo-electron Tomography data analysis , 2019, ArXiv.

[28]  Ben Wang,et al.  Multiple-scale Simulation Method for Liquid with Trapped Air under Particle-based Framework , 2020, 2020 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[29]  Zhiping Weng,et al.  Accelerating Protein Docking in ZDOCK Using an Advanced 3D Convolution Library , 2011, PloS one.

[30]  A. Kolinski,et al.  Coarse-Grained Protein Models and Their Applications. , 2016, Chemical reviews.

[31]  Xiangrui Zeng,et al.  Gum-Net: Unsupervised Geometric Matching for Fast and Accurate 3D Subtomogram Image Alignment and Averaging , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  W F Drew Bennett,et al.  Improved Parameters for the Martini Coarse-Grained Protein Force Field. , 2013, Journal of chemical theory and computation.

[33]  Syma Khalid,et al.  Coarse-grained molecular dynamics simulations of membrane proteins and peptides. , 2007, Journal of structural biology.

[34]  Ying Li,et al.  WGDdetector: a pipeline for detecting whole genome duplication events using the genome or transcriptome annotations , 2019, BMC Bioinformatics.

[35]  Jing Zhou,et al.  Viscosity-based Vorticity Correction for Turbulent SPH Fluids , 2019, 2019 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[36]  Sjors H. W. Scheres,et al.  Unravelling biological macromolecules with cryo-electron microscopy , 2016, Nature.

[37]  Min Xu,et al.  De Novo Structural Pattern Mining in Cellular Electron Cryotomograms. , 2015, Structure.

[38]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[39]  Catherine M. Oikonomou,et al.  A new view into prokaryotic cell biology from electron cryotomography , 2016, Nature Reviews Microbiology.

[40]  Tian Xia,et al.  DeepPicker: a Deep Learning Approach for Fully Automated Particle Picking in Cryo-EM , 2016, Journal of structural biology.

[41]  P. Penczek,et al.  A Primer to Single-Particle Cryo-Electron Microscopy , 2015, Cell.