Optimal decisions in combining the SOM with nonlinear projection methods

Abstract Visual data mining is an efficient way to involve human in search for a optimal decision. This paper focuses on the optimization of the visual presentation of multidimensional data. A variety of methods for projection of multidimensional data on the plane have been developed. At present, a tendency of their joint use is observed. In this paper, two consequent combinations of the self-organizing map (SOM) with two other well-known nonlinear projection methods are examined theoretically and experimentally. These two methods are: Sammon’s mapping and multidimensional scaling (MDS). The investigations showed that the combinations (SOM_Sammon and SOM_MDS) have a similar efficiency. This grounds the possibility of application of the MDS with the SOM, because up to now in most researches SOM is applied together with Sammon’s mapping. The problems on the quality and accuracy of such combined visualization are discussed. Three criteria of different nature are selected for evaluation the efficiency of the combined mapping. The joint use of these criteria allows us to choose the best visualization result from some possible ones. Several different initialization ways for nonlinear mapping are examined, and a new one is suggested. A new approach to the SOM visualization is suggested. The obtained results allow us to make better decisions in optimizing the data visualization.

[1]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[2]  J. Douglas Carroll,et al.  14 Multidimensional scaling and its applications , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[3]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[4]  Gintautas Dzemyda,et al.  Comparative Analysis of the Graphical Result Presentation in the SOM Software , 2002, Informatica.

[5]  Helge J. Ritter,et al.  Neural computation and self-organizing maps - an introduction , 1992, Computation and neural systems series.

[6]  Lakhmi C. Jain,et al.  Self-Organizing Neural Networks , 2002 .

[7]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[8]  P. Pardalos,et al.  Gauss-seidel method for least-distance problems , 1992 .

[9]  佐藤 保,et al.  Principal Components , 2021, Encyclopedic Dictionary of Archaeology.

[10]  James C. Bezdek,et al.  An index of topological preservation for feature extraction , 1995, Pattern Recognit..

[11]  J. Rubner,et al.  A Self-Organizing Network for Principal-Component Analysis , 1989 .

[12]  Erkki Oja,et al.  Principal components, minor components, and linear neural networks , 1992, Neural Networks.

[13]  T. Hastie,et al.  Principal Curves , 2007 .

[14]  P. Pezzullo OVERTURE , 2008, First Peoples in a New World.

[15]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[16]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[17]  Gintautas Dzemyda,et al.  Visualization of a set of parameters characterized by their correlation matrix , 2001 .

[18]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[19]  Gintautas Dzemyda,et al.  Visualisation of Multidimensional Objects and the Socio-Economical Impact to Activity in EC RTD Databases , 2001, Informatica.

[20]  Willem J. Heiser,et al.  13 Theory of multidimensional scaling , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[21]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[22]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[23]  L. Guttman A general nonmetric technique for finding the smallest coordinate space for a configuration of points , 1968 .

[24]  Gautam Biswas,et al.  Evaluation of Projection Algorithms , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  A. Karbowski Direct method of hierarchical nonlinear optimization - reassessment after 30 years , 2004 .

[26]  L. N. Kanal,et al.  Theory of Multidimensional Scaling , 2005 .

[27]  G. V. Kass,et al.  Location of Several Outliers in Multiple-Regression Data Using Elemental Sets , 1984 .

[28]  M. Davison Introduction to Multidimensional Scaling and Its Applications , 1983 .

[29]  A. Basilevsky,et al.  Factor Analysis as a Statistical Method. , 1964 .

[30]  Åke Björck,et al.  Numerical Methods , 2021, Markov Renewal and Piecewise Deterministic Processes.

[31]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[32]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[33]  Arthur Flexer,et al.  Limitations of Self-organizing Maps for Vector Quantization and Multidimensional Scaling , 1996, NIPS.

[34]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[35]  T. Sejnowski,et al.  Quantifying neighbourhood preservation in topographic mappings , 1996 .

[36]  Gintautas Dzemyda,et al.  Visualization of Multidimensional Data Taking into Account the Learning Flow of the Self-Organizing Neural Network , 2003, WSCG.