A novel neural network technique for analysis and classification of EM single-particle images.

We propose a novel self-organizing neural network for the unsupervised classification of electron microscopy (EM) images of biological macromolecules. The radical novelty of the algorithm lies in its rigorous mathematical formulation that, starting from a large set of possibly very noisy input data, finds a set of "representative" data items, organized onto an ordered output map, such that the probability density of this set of representative items resembles at its possible best the probability density of the input data. In a way, it summarizes large amounts of information into a concise description that rigorously keeps the basic pattern of the input data distribution. In this application to the field of three-dimensional EM of single particles, two different data sets have been used; one comprised 2458 rotational power spectra of individual negative stain images of the G40P helicase of Bacillus subtilis bacteriophage SPP1, and the other contained 2822 cryoelectron images of SV40 large T-antigen. Our experimental results prove that this technique is indeed very successful, providing the user with the capability of exploring complex patterns in a succinct, informative, and objective manner. The above facts, together with the consideration that the integration of this new algorithm with commonly used software packages is immediate, prompt us to propose it as a valuable new tool in the analysis of large collections of noisy data.

[1]  J M Carazo,et al.  Polymorphic quaternary organization of the Bacillus subtilis bacteriophage SPP1 replicative helicase (G40 P). , 1998, Journal of molecular biology.

[2]  J Frank,et al.  Classification of images of biomolecular assemblies: a study of ribosomes and ribosomal subunits of Escherichia coli , 1988, Journal of microscopy.

[3]  J Frank,et al.  Fuzzy sets‐based classification of electron microscopy images of biological macromolecules with an application to ribosomal particles , 1990, Journal of microscopy.

[4]  P. Tegtmeyer,et al.  Functional interactions of the simian virus 40 core origin of replication with flanking regulatory sequences , 1986, Journal of virology.

[5]  Noël Bonnet,et al.  Artificial intelligence and pattern recognition techniques in microscope image processing and analysis , 2000 .

[6]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[7]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[8]  J M Carazo,et al.  Mapping and fuzzy classification of macromolecular images using self-organizing neural networks. , 2000, Ultramicroscopy.

[9]  M. Valle,et al.  Large T-Antigen Double Hexamers Imaged at the Simian Virus 40 Origin of Replication , 2000, Molecular and Cellular Biology.

[10]  J Frank,et al.  Three-dimensional reconstruction of single particles embedded in ice. , 1992, Ultramicroscopy.

[11]  Edward H. Egelman,et al.  The hexameric E. coli DnaB helicase can exist in different Quaternary states. , 1996, Journal of molecular biology.

[12]  R A Crowther,et al.  Harmonic analysis of electron microscope images with rotational symmetry. , 1971, Journal of molecular biology.

[13]  José María Carazo,et al.  A variant to the “random approximation” of the reference-free alignment algorithm , 1996 .

[14]  J. Frank,et al.  Correspondence analysis of aligned images of biological particles. , 1982, Journal of molecular biology.

[15]  Marin van Heel,et al.  Multivariate statistical classification of noisy images (randomly oriented biological macromolecules) , 1984 .

[16]  Marco,et al.  Xmipp: An Image Processing Package for Electron Microscopy , 1996, Journal of structural biology.

[17]  N. Bonnet,et al.  Multivariate statistical methods for the analysis of microscope image series: applications in materials science , 1998 .

[18]  J M Carazo,et al.  Six molecules of SV40 large T antigen assemble in a propeller-shaped particle around a channel. , 1997, Journal of molecular biology.

[19]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[20]  J M Carazo,et al.  Pattern recognition and classification of images of biological macromolecules using artificial neural networks. , 1994, Biophysical journal.

[21]  Gilles Pagès,et al.  Theoretical aspects of the SOM algorithm , 1998, Neurocomputing.

[22]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[23]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[24]  J. Frank,et al.  Use of multivariate statistics in analysing the images of biological macromolecules. , 1981, Ultramicroscopy.

[25]  P. Tegtmeyer,et al.  Domain structure of the simian virus 40 core origin of replication , 1986, Molecular and cellular biology.

[26]  E. Fanning,et al.  Structure and function of simian virus 40 large tumor antigen. , 1992, Annual review of biochemistry.

[27]  Michel Herbin,et al.  A clustering method based on the estimation of the probability density function and on the skeleton by influence zones. Application to image processing , 1996, Pattern Recognit. Lett..