Adaptive tangent distances in generalized learning vector quantization for transformation and distortion invariant classification learning

We propose a learning vector quantization algorithm variant for prototype-based classification learning with adaptive tangent distance learning. Tangent distances were developed to achieve dissimilarity measures invariant with respect to transformations and distortions like rotation, noise, etc.. Usually, these tangent distances are predefined in applications or are estimated in preprocessing. We introduce in this paper a generalized learning vector quantizer (GLVQ) with an online adaptation scheme for tangent distances. The adaptation takes place as a stochastic gradient descent learning accompanying the usual online prototype learning. In this way, class discriminative tangents are learned contributing to a better classification performance. Further, the resulting update schemes can be seen as a special type of local matrix learning in GLVQ. In this paper, we provide the full mathematical theory behind the derived tangent distance adaptation rule and demonstrate the classification ability of the resulting GLVQ model in comparison to state-of-the-art tangent distance based classifiers in the field.

[1]  Bernhard Schölkopf,et al.  Incorporating Invariances in Non-Linear Support Vector Machines , 2001, NIPS.

[2]  Koby Crammer,et al.  Margin Analysis of the LVQ Algorithm , 2002, NIPS.

[3]  Shuen-Lin Jeng,et al.  Adaptive tangent distance classifier on recognition of handwritten digits , 2011 .

[4]  Hermann Ney,et al.  Adaptation in statistical pattern recognition using tangent vectors , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Christian Walder,et al.  Learning with Transformation Invariant Kernels , 2007, NIPS.

[7]  Thomas Villmann,et al.  Adaptive dissimilarity weighting for prototype-based classification optimizing mixtures of dissimilarities , 2016, ESANN.

[8]  Robert P. W. Duin,et al.  The Dissimilarity Representation for Pattern Recognition - Foundations and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[9]  Thomas Villmann,et al.  Aspects in Classification Learning - Review of Recent Developments in Learning Vector Quantization , 2014 .

[10]  Holger Schwenk,et al.  Learning Discriminant Tangent Models for Handwritten Character Recognition , 1995 .

[11]  Thomas Villmann,et al.  Generalized relevance learning vector quantization , 2002, Neural Networks.

[12]  Bernhard Schölkopf,et al.  Training Invariant Support Vector Machines , 2002, Machine Learning.

[13]  Bernard Haasdonk,et al.  Tangent distance kernels for support vector machines , 2002, Object recognition supported by user interaction for service robots.

[14]  Alessandro Sperduti,et al.  Discriminant Pattern Recognition Using Transformation-Invariant Neurons , 2000, Neural Computation.

[15]  Georges Voronoi Nouvelles applications des paramètres continus à la théorie des formes quadratiques. Deuxième mémoire. Recherches sur les parallélloèdres primitifs. , 1908 .

[16]  Michael Biehl,et al.  Adaptive Relevance Matrices in Learning Vector Quantization , 2009, Neural Computation.

[17]  Dan Ciresan,et al.  Multi-Column Deep Neural Networks for offline handwritten Chinese character classification , 2013, 2015 International Joint Conference on Neural Networks (IJCNN).

[18]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[19]  Deniz Erdogmus,et al.  Information Theoretic Learning , 2005, Encyclopedia of Artificial Intelligence.

[20]  Marco Wiering,et al.  2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) , 2011, IJCNN 2011.

[21]  Daniel Keysers Approaches to Invariant Image Object Recognition , 2000 .

[22]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[23]  Shie Mannor,et al.  Robustness and Regularization of Support Vector Machines , 2008, J. Mach. Learn. Res..

[24]  Bernhard Schölkopf,et al.  Incorporating invariances in nonlinear Support Vector Machines , 2001, NIPS 2001.

[25]  Teuvo Kohonen,et al.  Improved versions of learning vector quantization , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[26]  Robert Tibshirani,et al.  Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Thomas Villmann,et al.  Border-sensitive learning in generalized learning vector quantization: an alternative to support vector machines , 2015, Soft Comput..

[28]  Yann LeCun,et al.  Efficient Pattern Recognition Using a New Transformation Distance , 1992, NIPS.

[29]  Atsushi Sato,et al.  Generalized Learning Vector Quantization , 1995, NIPS.

[30]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[31]  Thomas Villmann,et al.  Prototype-based models in machine learning. , 2016, Wiley interdisciplinary reviews. Cognitive science.

[32]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[33]  Klaus Obermayer,et al.  Soft Learning Vector Quantization , 2003, Neural Computation.

[34]  Thomas Villmann,et al.  Stationarity of Matrix Relevance LVQ , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).