The exploration machine: a novel method for analyzing high-dimensional data in computer-aided diagnosis

Purpose: To develop, test, and evaluate a novel unsupervised machine learning method for computer-aided diagnosis and analysis of multidimensional data, such as biomedical imaging data. Methods: We introduce the Exploration Machine (XOM) as a method for computing low-dimensional representations of high-dimensional observations. XOM systematically inverts functional and structural components of topology-preserving mappings. By this trick, it can contribute to both structure-preserving visualization and data clustering. We applied XOM to the analysis of whole-genome microarray imaging data, comprising 2467 79-dimensional gene expression profiles of Saccharomyces cerevisiae, and to model-free analysis of functional brain MRI data by unsupervised clustering. For both applications, we performed quantitative comparisons to results obtained by established algorithms. Results: Genome data: Absolute (relative) Sammon error values were 5.91·105 (1.00) for XOM, 6.50·105 (1.10) for Sammon's mapping, 6.56·105 (1.11) for PCA, and 7.24·105 (1.22) for Self-Organizing Map (SOM). Computation times were 72, 216, 2, and 881 seconds for XOM, Sammon, PCA, and SOM, respectively. - Functional MRI data: Areas under ROC curves for detection of task-related brain activation were 0.984 ± 0.03 for XOM, 0.983 ± 0.02 for Minimal-Free-Energy VQ, and 0.979 ± 0.02 for SOM. Conclusion: For both multidimensional imaging applications, i.e. gene expression visualization and functional MRI clustering, XOM yields competitive results when compared to established algorithms. Its surprising versatility to simultaneously contribute to dimensionality reduction and data clustering qualifies XOM to serve as a useful novel method for the analysis of multidimensional data, such as biomedical image data in computer-aided diagnosis.

[1]  Michel Verleysen,et al.  Nonlinear Projection with the Isotop Method , 2002, ICANN.

[2]  Arthur Flexer,et al.  On the use of self-organizing maps for clustering and visualization , 1999, Intell. Data Anal..

[3]  Anke Meyer-Bäse,et al.  Fully automated biomedical image segmentation by self-organized model adaptation , 2004, Neural Networks.

[4]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[5]  Klaus Obermayer,et al.  Self-organizing maps: Generalizations and new optimization techniques , 1998, Neurocomputing.

[6]  Barbara Hammer,et al.  Relational Topographic Maps , 2007, IDA.

[7]  Terrence J. Sejnowski,et al.  A Unifying Objective Function for Topographic Mappings , 1997, Neural Computation.

[8]  Michel Verleysen,et al.  Locally Linear Embedding versus Isotop , 2003, ESANN.

[9]  K. Obermayer,et al.  PHASE TRANSITIONS IN STOCHASTIC SELF-ORGANIZING MAPS , 1997 .

[10]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[12]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[13]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[14]  Terrence J. Sejnowski,et al.  Constrained Optimization for Neural Map Formation: A Unifying Framework for Weight Growth and Normalization , 1998, Neural Computation.

[15]  Helge J. Ritter,et al.  The deformable feature map - a novel neurocomputing algorithm for adaptive plasticity in pattern analysis , 2002, Neurocomputing.

[16]  Axel Wismüller,et al.  Cluster Analysis of Biomedical Image Time-Series , 2002, International Journal of Computer Vision.

[17]  Panu Somervuo,et al.  How to make large self-organizing maps for nonvectorial data , 2002, Neural Networks.

[18]  Axel Wismüller Exploration-Organized Morphogenesis (XOM): A general framework for learning by self-organization , 2001 .

[19]  Michel Verleysen,et al.  Rank-based quality assessment of nonlinear dimensionality reduction , 2008, ESANN.

[20]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[21]  Jarkko Venna,et al.  Dimensionality reduction for visual exploration of similarity structures , 2007 .