Exploiting Data Topology in Visualization and Clustering of Self-Organizing Maps

The self-organizing map (SOM) is a powerful method for visualization, cluster extraction, and data mining. It has been used successfully for data of high dimensionality and complexity where traditional methods may often be insufficient. In order to analyze data structure and capture cluster boundaries from the SOM, one common approach is to represent the SOM's knowledge by visualization methods. Different aspects of the information learned by the SOM are presented by existing methods, but data topology, which is present in the SOM's knowledge, is greatly underutilized. We show in this paper that data topology can be integrated into the visualization of the SOM and thereby provide a more elaborate view of the cluster structure than existing schemes. We achieve this by introducing a weighted Delaunay triangulation (a connectivity matrix) and draping it over the SOM. This new visualization, CONNvis, also shows both forward and backward topology violations along with the severity of forward ones, which indicate the quality of the SOM learning and the data complexity. CONNvis greatly assists in detailed identification of cluster boundaries. We demonstrate the capabilities on synthetic data sets and on a real 8D remote sensing spectral image.

[1]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[2]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[3]  Jarkko Venna,et al.  Neighborhood Preservation in Nonlinear Projection Methods: An Experimental Study , 2001, ICANN.

[4]  Marie Cottrell,et al.  A Kohonen map representation to avoid misleading interpretations , 1996, The European Symposium on Artificial Neural Networks.

[5]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[6]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[7]  Andreas Rauber,et al.  Advanced Visualization Techniques for Self-organizing Maps with Graph-Based Methods , 2005, ISNN.

[8]  T. Villmann,et al.  Extensions and modifications of the Kohenen-SOM and applications in remote sensing image analysis , 2001 .

[9]  Mu-Chun Su,et al.  A new model of self-organizing neural networks and its application in data projection , 2001, IEEE Trans. Neural Networks.

[10]  Johan Himberg,et al.  A SOM based cluster visualization and its application for false coloring , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[11]  A. Ultsch,et al.  Self-Organizing Neural Networks for Visualisation and Classification , 1993 .

[12]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[13]  Bea Csathó,et al.  A MULTISENSOR DATA SET OF AN URBAN AND COASTAL SCENE , 1998 .

[14]  Elias Pampalk,et al.  Using Smoothed Data Histograms for Cluster Visualization in Self-Organizing Maps , 2002, ICANN.

[15]  Risto Miikkulainen,et al.  Visualizing High-Dimensional Structure with the Incremental Grid Growing Neural Network , 1995, ICML.

[16]  Anil K. Jain,et al.  A nonlinear projection method based on Kohonen's topology preserving maps , 1992, IEEE Trans. Neural Networks.

[17]  Stephen R. Marsland,et al.  A self-organising network that grows when required , 2002, Neural Networks.

[18]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[19]  E. Mer,et al.  \Precision Mining" of High-Dimensional Patterns with Self-Organizing Maps: Interpretation of Hyperspectral Images Erzs , 2003 .

[20]  Thomas Villmann,et al.  Explicit Magnification Control of Self-Organizing Maps for “Forbidden” Data , 2007, IEEE Transactions on Neural Networks.

[21]  A. Ultsch Maps for the Visualization of high-dimensional Data Spaces , 2003 .

[22]  Habtom W. Ressom,et al.  Adaptive double self-organizing maps for clustering gene expression profiles , 2003, Neural Networks.

[23]  Hujun Yin,et al.  ViSOM - a novel method for multivariate data projection and structure visualization , 2002, IEEE Trans. Neural Networks.

[24]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[25]  Jarkko Venna,et al.  Coloring that Reveals Cluster Structures in Multivariate Data , 2000 .

[26]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[27]  Michaël Aupetit,et al.  Learning Topology with the Generative Gaussian Graph and the EM Algorithm , 2005, NIPS.