A journey into low-dimensional spaces with autoassociative neural networks.

The compression and the visualization of the data have been always a subject of a great deal of excitement. Since multidimensional data sets are difficult to interpret and visualize, much of the attention is drawn how to compress them efficiently. Usually, the compression of dimensionality is considered as the first step of exploratory data analysis. Here, we focus our attention on autoassociative neural networks (ANNs), which in a very elegant manner provide data compression and visualization. ANNs can deal with linear and nonlinear correlation among variables, what makes them a very powerful tool in exploratory data analysis. In the literature, ANNs are often referred as nonlinear principal component analysis (PCA), and due to their specific structure they are also known as bottleneck neural networks. In this paper, ANNs are discussed in details. Different training modes are described and illustrated on real example. The usefulness of ANNs for nonlinear data compression and visualization purposes is proven with the aid of chemical data sets, being the subject of analysis. The comparison of ANNs with well-known PCA is also presented.

[1]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[2]  Gerrit Kateman,et al.  USING ARTIFICIAL NEURAL NETWORKS FOR SOLVING CHEMICAL PROBLEMS .2. KOHONEN SELF-ORGANIZING FEATURE MAPS AND HOPFIELD NETWORKS , 1994 .

[3]  F Despagne,et al.  Neural networks in multivariate calibration. , 1998, The Analyst.

[4]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[5]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[6]  Mark A. Kramer,et al.  Autoassociative neural networks , 1992 .

[7]  Albert Bos,et al.  Tutorial review—Data processing by neural networks in quantitative chemical analysis , 1993 .

[8]  B. Kowalski,et al.  A combined linear and nonlinear factor analysis program package for chemical data evaluation , 1981 .

[9]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[10]  Jure Zupan,et al.  Kohonen and counterpropagation artificial neural networks in analytical chemistry , 1997 .

[11]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[12]  Yi-Zeng Liang,et al.  A non‐linear mapping‐based generalized backpropagation network for unsupervised learning , 1996 .

[13]  Jorge J. Moré,et al.  The Levenberg-Marquardt algo-rithm: Implementation and theory , 1977 .

[14]  Douglas B. Kell,et al.  Plant seed classification using pyrolysis mass spectrometry with unsupervised learning: The application of auto-associative and Kohonen artificial neural networks , 1996 .

[15]  Barry J. Wythoff,et al.  Backpropagation neural networks , 1993 .

[16]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[17]  C. Aldrich,et al.  CLUSTER ANALYSIS OF MINERAL PROCESS DATA WITH AUTOASSOCIATIVE NEURAL NETWORKS , 2000 .

[18]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[19]  Chris Aldrich,et al.  Visualization of transformed multivariate data sets with autoassociative neural networks , 1998, Pattern Recognit. Lett..

[20]  Ru-Qin Yu,et al.  Neural network learning to non-linear principal component analysis , 1996 .

[21]  George M. Whitesides,et al.  FEED-FORWARD NEURAL NETWORKS IN CHEMISTRY : MATHEMATICAL SYSTEMS FOR CLASSIFICATION AND PATTERN RECOGNITION , 1993 .

[22]  G. Kateman,et al.  Neural networks in analytical chemistry , 1993 .

[23]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[24]  D L Massart,et al.  Identification of pharmaceutical excipients using NIR spectroscopy and SIMCA. , 1999, Journal of pharmaceutical and biomedical analysis.

[25]  Carl G. Looney,et al.  Pattern recognition using neural networks , 1997 .

[26]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .