Image-derived generative modeling of pseudo-macromolecular structures - towards the statistical assessment of Electron CryoTomography template matching

Cellular Electron CryoTomography (CECT) is a 3D imaging technique that captures information about the structure and spatial organization of macromolecular complexes within single cells, in near-native state and at sub-molecular resolution. Although template matching is often used to locate macromolecules in a CECT image, it is insufficient as it only measures the relative structural similarity. Therefore, it is preferable to assess the statistical credibility of the decision through hypothesis testing, requiring many templates derived from a diverse population of macromolecular structures. Due to the very limited number of known structures, we need a generative model to efficiently and reliably sample pseudo-structures from the complex distribution of macromolecular structures. To address this challenge, we propose a novel image-derived approach for performing hypothesis testing for template matching by constructing generative models using the generative adversarial network. Finally, we conducted hypothesis testing experiments for template matching on both simulated and experimental subtomograms, allowing us to conclude the identity of subtomograms with high statistical credibility and significantly reducing false positives.

[1]  A. Hope A Simplified Monte Carlo Significance Test Procedure , 1968 .

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  J. Mccammon,et al.  Situs: A package for docking crystal structures into low-resolution maps from electron microscopy. , 1999, Journal of structural biology.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  A S Frangakis,et al.  Toward detecting and identifying macromolecules in a cellular context: template matching applied to electron tomograms. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Friedrich Förster,et al.  TOM software toolbox: acquisition and analysis for electron tomography. , 2005, Journal of structural biology.

[7]  J. Frank Three-Dimensional Electron Microscopy of Macromolecular Assemblies , 2006 .

[8]  Friedrich Förster,et al.  Classification of cryo-electron sub-tomograms using constrained correlation. , 2008, Journal of structural biology.

[9]  R. Aebersold,et al.  Visual proteomics of the human pathogen Leptospira interrogans , 2009, Nature Methods.

[10]  M. Shatsky,et al.  Survey of large protein complexes in D. vulgaris reveals great structural diversity , 2009, Proceedings of the National Academy of Sciences.

[11]  R. Henderson,et al.  Detective quantum efficiency of electron area detectors in electron microscopy , 2009, Ultramicroscopy.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Frank Alber,et al.  High-throughput subtomogram alignment and classification by Fourier space constrained fast volumetric matching. , 2012, Journal of structural biology.

[14]  Min Xu,et al.  Automated target segmentation and real space fast alignment methods for high-throughput classification and averaging of crowded cryo-electron subtomograms , 2013, Bioinform..

[15]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[16]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[17]  Keegan Kang,et al.  Feature Representation in Convolutional Neural Networks , 2015, ArXiv.

[18]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[20]  Steven J Ludtke,et al.  Single Particle Tomography in EMAN2 , 2012, Microscopy and Microanalysis.

[21]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  M. Topf,et al.  Two distinct trimeric conformations of natively membrane-anchored full-length herpes simplex virus 1 glycoprotein B , 2016, Proceedings of the National Academy of Sciences.

[23]  Wojciech Zaremba,et al.  Improved Techniques for Training GANs , 2016, NIPS.

[24]  R. Murphy Building cell models and simulations from microscope images. , 2016, Methods.

[25]  Jiajun Wu,et al.  Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling , 2016, NIPS.

[26]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[27]  Ardan Patwardhan,et al.  EMPIAR: a public archive for raw electron microscopy image data , 2016, Nature Methods.

[28]  Neelam Sharma,et al.  An Overview of Various Template Matching Methodologies in Image Processing , 2016 .

[29]  Shoh M. Asano,et al.  In Situ Cryo-Electron Tomography: A Post-Reductionist Approach to Structural Biology. , 2016, Journal of molecular biology.

[30]  Vincent Dumoulin,et al.  Deconvolution and Checkerboard Artifacts , 2016 .

[31]  Jiancheng Lv,et al.  Learning Inverse Mapping by AutoEncoder Based Generative Adversarial Nets , 2017, ICONIP.

[32]  Frank Alber,et al.  TomoMiner and TomoMinerCloud: A Software Platform for Large-Scale Subtomogram Structural Analysis. , 2017, Structure.

[33]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[34]  Eric P. Xing,et al.  Deep learning-based subdivision approach for large scale macromolecules structure recovery from electron cryo tomograms , 2017, Bioinform..

[35]  Jaakko Lehtinen,et al.  Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.