Exploratory Observation Machine (XOM) with Kullback-Leibler Divergence for Dimensionality Reduction and Visualization

We present an extension of the Exploratory Observation Ma- chine (XOM) for structure-preserving dimensionality reduction. Based on minimizing the Kullback-Leibler divergence of neighborhood functions in data and image spaces, this Neighbor Embedding XOM (NE-XOM) cre- ates a link between fast sequential online learning known from topology- preserving mappings and principled direct divergence optimization ap- proaches. We quantitatively evaluate our method on real world data using multiple embedding quality measures. In this comparison, NE-XOM per- forms as a competitive trade-off between high embedding quality and low computational expense, which motivates its further use in real-world set- tings throughout science and engineering.

[1]  Simon Rogers,et al.  Multi-class Semi-supervised Learning with the e-truncated Multinomial Probit Gaussian Process , 2007, Gaussian Processes in Practice.

[2]  Michel Verleysen,et al.  Locally Linear Embedding versus Isotop , 2003, ESANN.

[3]  James C. Bezdek,et al.  An index of topological preservation for feature extraction , 1995, Pattern Recognit..

[4]  Axel Wismüller,et al.  Adaptive local dissimilarity measures for discriminative dimension reduction of labeled data , 2010, Neurocomputing.

[5]  Deniz Erdogmus,et al.  Vector quantization using information theoretic concepts , 2005, Natural Computing.

[6]  Axel Wismüller Exploration-Organized Morphogenesis (XOM): A general framework for learning by self-organization , 2001 .

[7]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[9]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[10]  T. Villmann,et al.  Mathematical Aspects of Divergence Based Vector Quantization Using Fr´ echet-Derivatives , 2009 .

[11]  Inderjit S. Dhillon,et al.  Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[12]  Michel Verleysen,et al.  Nonlinear Dimensionality Reduction , 2021, Computer Vision.

[13]  T. Heskes Energy functions for self-organizing maps , 1999 .

[14]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[15]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[16]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[17]  Axel Wismüller A Computational Framework for Nonlinear Dimensionality Reduction and Clustering , 2009, WSOM.

[18]  Thomas Villmann,et al.  Funtional vector quantization by neural maps , 2009, 2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing.

[19]  Jarkko Venna,et al.  Dimensionality reduction for visual exploration of similarity structures , 2007 .