Hierarchical PCA Using Tree-SOM for the Identification of Bacteria

In this paper we present an extended version of Evolving Trees using Oja's rule. Evolving Trees are extensions of Self-Organizing Maps developed for hierarchical classification systems. Therefore they are well suited for taxonomic problems like the identification of bacteria. The paper focus on clustering and visualization of bacteria measurements. A modified variant of the Evolving Tree is developed and applied to obtain a hierarchical clustering. The method provides an inherent PCA analysis which is analyzed in combination with the tree based visualization. The obtained loadings support insights in the classification decision and can be used to identify features which are relevant for the cluster separation.

[1]  Thomas Martinetz,et al.  Learning Data Representations with Sparse Coding Neural Gas , 2008, ESANN.

[2]  Angelo Facchiano,et al.  Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry for the Discrimination of Food-Borne Microorganisms , 2006, Applied and Environmental Microbiology.

[3]  Thomas Villmann,et al.  Topology preservation in self-organizing feature maps: exact definition and measurement , 1997, IEEE Trans. Neural Networks.

[4]  Erkki Oja,et al.  The Evolving Tree—A Novel Self-Organizing Network for Data Analysis , 2004, Neural Processing Letters.

[5]  T. Maier,et al.  Fast and reliable MALDI-TOF MS–based microorganism identification , 2006 .

[6]  Thomas Villmann,et al.  Growing a hypercubical output space in a self-organizing feature map , 1997, IEEE Trans. Neural Networks.

[7]  Daniel C. Liebler,et al.  Introduction to Proteomics , 2002, Humana Press.

[8]  Trinad Chakraborty,et al.  Rapid Identification and Typing of Listeria Species by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry , 2008, Applied and Environmental Microbiology.

[9]  A. Mellmann,et al.  Evaluation of Matrix-Assisted Laser Desorption Ionization-Time-of-Flight Mass Spectrometry in Comparison to 16S rRNA Gene Sequencing for Species Identification of Nonfermenting Bacteria , 2008, Journal of Clinical Microbiology.

[10]  Gabriel Cristóbal,et al.  Identification of tuberculosis bacteria based on shape and color , 2004, Real Time Imaging.

[11]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[12]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .