Prototype-Based Classification of Dissimilarity Data

Unlike many black-box algorithms in machine learning, prototype-based models offer an intuitive interface to given data sets, since prototypes can directly be inspected by experts in the field. Most techniques rely on Euclidean vectors such that their suitability for complex scenarios is limited. Recently, several unsupervised approaches have successfully been extended to general, possibly non-Euclidean data characterized by pairwise dissimilarities. In this paper, we shortly review a general approach to extend unsupervised prototype-based techniques to dissimilarities, and we transfer this approach to supervised prototypebased classification for general dissimilarity data. In particular, a new supervised prototype-based classification technique for dissimilarity data is proposed.

[1]  Horst Bunke,et al.  Edit distance-based kernel functions for structural pattern classification , 2006, Pattern Recognit..

[2]  Barbara Hammer,et al.  Relational Topographic Maps , 2007, IDA.

[3]  Panu Somervuo,et al.  How to make large self-organizing maps for nonvectorial data , 2002, Neural Networks.

[4]  Martin A. Riedmiller,et al.  Incremental GRLVQ: Learning relevant features for 3D object recognition , 2008, Neurocomputing.

[5]  Barbara Hammer,et al.  Patch clustering for massive data sets , 2009, Neurocomputing.

[6]  Barbara Hammer,et al.  Relational Generative Topographic Map , 2010, ESANN.

[7]  Heiko Wersing,et al.  Online figure-ground segmentation with adaptive metrics in generalized LVQ , 2009, Neurocomputing.

[8]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[9]  Trinad Chakraborty,et al.  Rapid Identification and Typing of Listeria Species by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry , 2008, Applied and Environmental Microbiology.

[10]  Michael Biehl,et al.  Adaptive Relevance Matrices in Learning Vector Quantization , 2009, Neural Computation.

[11]  Atsushi Sato,et al.  Generalized Learning Vector Quantization , 1995, NIPS.

[12]  Thomas Villmann,et al.  Generalized relevance learning vector quantization , 2002, Neural Networks.

[13]  Michael Biehl,et al.  Distance Learning in Discriminative Vector Quantization , 2009, Neural Computation.

[14]  Francisco Escolano,et al.  Graph-Based Representations in Pattern Recognition, 6th IAPR-TC-15 International Workshop, GbRPR 2007, Alicante, Spain, June 11-13, 2007, Proceedings , 2007, GbRPR.

[15]  Ming Li,et al.  Clustering by compression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[16]  Frank-Michael Schleif,et al.  Accelerating kernel clustering for biomedical data analysis , 2011, 2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[17]  Robert P. W. Duin,et al.  The Dissimilarity Representation for Pattern Recognition - Foundations and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[18]  A. Kai Qin,et al.  A novel kernel prototype-based learning algorithm , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[19]  Barbara Hammer,et al.  Graph-Based Representation of Symbolic Musical Data , 2009, GbRPR.

[20]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[21]  Frank-Michael Schleif,et al.  Accelerating dissimilarity clustering for biomedical data analysis , 2011, IEEE CIBCB 2011.

[22]  T. Maier,et al.  Fast and reliable MALDI-TOF MS–based microorganism identification , 2006 .

[23]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[24]  Barbara Hammer,et al.  Topographic Processing of Very Large Text Datasets , 2008 .

[25]  John Shawe-Taylor,et al.  Advances in Intelligent Data Analysis VII, 7th International Symposium on Intelligent Data Analysis, IDA 2007, Ljubljana, Slovenia, September 6-8, 2007, Proceedings , 2007, IDA.

[26]  E. Granum,et al.  Quantitative analysis of 6985 digitized trypsin G ‐banded human metaphase chromosomes , 1980, Clinical genetics.

[27]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[28]  R. Duin,et al.  The dissimilarity representation for pattern recognition , a tutorial , 2009 .

[29]  Barbara Hammer,et al.  Topographic Mapping of Large Dissimilarity Data Sets , 2010, Neural Computation.

[30]  Klaus Obermayer,et al.  Soft Learning Vector Quantization , 2003, Neural Computation.

[31]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[32]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[33]  Thomas Villmann,et al.  Batch and median neural gas , 2006, Neural Networks.