Prototype based fuzzy classification in clinical proteomics

Proteomic profiling based on mass spectrometry is an important tool for studies at the protein and peptide level in medicine and health care. Thereby, the identification of relevant masses, which are characteristic for specific sample states e.g. a disease state is complicated. Further, the classification accuracy and safety is especially important in medicine. The determination of classification models for such high dimensional clinical data is a complex task. Specific methods, which are robust with respect to the large number of dimensions and fit to clinical needs, are required. In this contribution two such methods for the construction of nearest prototype classifiers are compared in the context of clinical proteomic studies, which are specifically suited to deal with such high-dimensional functional data. Both methods are suitable to the adaptation of the underling metric, which is useful in proteomic research to get a problem adequate representation of the clinical data. In addition they allow fuzzy classification and for one of them allows fuzzy classified training data. Both algorithms are investigated in detail with respect to their specific properties. A performance analyses is taken on real clinical proteomic cancer data in a comparative manner.

[1]  Koby Crammer,et al.  Margin Analysis of the LVQ Algorithm , 2002, NIPS.

[2]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[3]  Thomas Villmann,et al.  Generalized relevance learning vector quantization , 2002, Neural Networks.

[4]  Thomas Villmann,et al.  Fuzzy Labeled Soft Nearest Neighbor Classification with Relevance Learning , 2005, Fourth International Conference on Machine Learning and Applications (ICMLA'05).

[5]  Thomas Villmann,et al.  Mathematical Aspects of Neural Networks , 2003, ESANN.

[6]  Frank-Michael Schleif,et al.  On the Generalization Ability of Prototype-Based Classifiers with Local Relevance Determination , 2005 .

[7]  Thomas Villmann,et al.  Local Metric Adaptation for Soft Nearest Prototype Classification to Classify Proteomic Data , 2005, WILF.

[8]  Thomas Villmann,et al.  Prototype-based fuzzy classification with local relevance for proteomics , 2006, Neurocomputing.

[9]  Thomas Villmann,et al.  Margin based Active Learning for LVQ Networks , 2007, ESANN.

[10]  Samuel Kaski,et al.  Self-Organized Formation of Various Invariant-Feature Filters in the Adaptive-Subspace SOM , 1997, Neural Computation.

[11]  D. Hochstrasser,et al.  Mass Spectrometry-Based Proteomics: Current Status and Potential Use in Clinical Chemistry , 2003, Clinical chemistry and laboratory medicine.

[12]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[13]  Klaus Obermayer,et al.  Soft nearest prototype classification , 2003, IEEE Trans. Neural Networks.

[14]  Thomas Villmann,et al.  Supervised Neural Gas with General Similarity Measure , 2005, Neural Processing Letters.

[15]  Frank-Michael Schleif,et al.  Fuzzy Labeled Neural GAS for Fuzzy Classification , 2005 .

[16]  Atsushi Sato,et al.  Generalized Learning Vector Quantization , 1995, NIPS.

[17]  Thomas Villmann,et al.  Fuzzy classification by fuzzy labeled neural gas , 2006, Neural Networks.

[18]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[19]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .