Automatic particle picking and multi-class classification in cryo-electron tomograms

Macromolecular structure determination using cryo-electron tomography requires large amount of subtomograms depicting the same molecule, which are averaged. In this paper, we propose a novel automatic particle picking and classification method for cryo-electron tomograms. The workflow comprises two stages: detection and classification. The detection method consists of a template-free picking procedure based on anisotropic diffusion filtering and connected component analysis. For classification, a novel 3D rotation invariant feature descriptor named Sphere Ring Haar and a hierarchical classification algorithm consisting of two machine learning models (DBSCAN and random forest) are proposed. The performance of our method is superior compared to template matching based methods and we achieved over 90% true positive rates for detection of proteasomes and ribosomes in experimental data.

[1]  R. Glaeser,et al.  Limitations to significant information in biological electron microscopy as a result of radiation damage. , 1971, Journal of ultrastructure research.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  F. Förster,et al.  Identification of macromolecular complexes in cryoelectron tomograms of phantom cells , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  V. Lučić,et al.  Structural studies by electron tomography: from cells to molecules. , 2005, Annual review of biochemistry.

[5]  D. Kriegman,et al.  Automatic particle selection: results of a comparative study. , 2004, Journal of structural biology.

[6]  Yuxiang Chen,et al.  Fast and accurate reference-free alignment of subtomograms. , 2013, Journal of structural biology.

[7]  Nassir Navab,et al.  Detection and identification of macromolecular complexes in cryo-electron tomograms using support vector machines , 2012, 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI).

[8]  Daniel A. Keim,et al.  Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering , 1999, VLDB.

[9]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Rolf Adams,et al.  Seeded Region Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Wolfgang Baumeister,et al.  A visual approach to proteomics , 2006, Nature Reviews Molecular Cell Biology.