Classification of image data in conjugate representation spaces

Over the past few years, automatic multivariate statistical classification techniques have successfully been used for analyzing large and noisy electron-microscopic data sets. After the raw data are compressed with eigenvector–eigenvalue procedures, classes of images are formed, using unsupervised clustering procedures. The classes elucidate even subtle differences existing within the data set. Here we extend these methods to find classes of pixels or features in the images (or other n-dimensional signals) that exhibit a homogeneous statistical behavior throughout the data set. This feature extraction—itself a form of data compression—is mathematically entirely symmetric to the determination of the image classes and also serves the purpose of revealing the information present in the set of input images. The properties of simultaneous representations of the image-space and feature-space data onto the same two-dimensional map are discussed in relation to the metrics used in both spaces. Model data are used to illustrate the basic ideas.

[1]  Marin van Heel,et al.  IMAGIC - A FAST, FLEXIBLE AND FRIENDLY IMAGE-ANALYSIS SOFTWARE SYSTEM , 1981 .

[2]  W. T. WILLIAMS,et al.  Nodal Analysis of Associated Populations , 1961, Nature.

[3]  Joachim Frank,et al.  Use of multivariate statistics in analysing the images of biological macromolecules , 1981 .

[4]  M. Heel,et al.  Classification of very large electron microscopical image data sets , 1989 .

[5]  M. Heel,et al.  Characteristic views of E. coli and B. stearothermophilus 30S ribosomal subunits in the electron microscope. , 1985, The EMBO journal.

[6]  Marin van Heel,et al.  Multivariate statistical classification of noisy images (randomly oriented biological macromolecules) , 1984 .

[7]  N. Bonnet,et al.  Application of multivariate statistical analysis to energetic image series , 1988 .

[8]  J Frank,et al.  Variations of the three-dimensional structure of the Escherichia coli ribosome in the range of overlap views. An application of the methods of multicone and local single-cone three-dimensional reconstruction. , 1989, Biophysical journal.

[9]  W. T. WILLIAMS,et al.  Concentration of Entries in Binary Arrays , 1966, Nature.

[10]  M van Heel,et al.  Statistical image analysis of electron micrographs of ribosomal subunits. , 1988, Methods in enzymology.

[11]  J Frank,et al.  Reconstitution of molecule images analysed by correspondence analysis: A tool for structural interpretation , 1986, Journal of Microscopy.

[12]  Harald Labischinski,et al.  The differentiation and identification of pathogenic bacteria using FT-IR and multivariate statistical analysis , 1988 .

[13]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .