Ranked centroid projection: a data visualization approach for self-organizing maps

The self-organizing map (SOM) is an efficient tool for visualizing high-dimensional data as it performs a topology-presenting projection of the input space on a low-dimensional grid. To utilize the information provided by the SOM and obtain an approximation of the data structure, a separate data projection method is usually needed. However, most of the SOM projection methods are computationally expensive when the size of the data set becomes large. In this paper, we present an intuitive and effective SOM projection method with comparatively low computational complexity for the purpose of cluster visualization. This method maps data vectors on the output space based on their responses to different prototype vectors. High-resolution maps can be obtained with a relatively small network size. The proposed method is demonstrated using both an artificial and a real world data set.

[1]  Gary G. Yen,et al.  A SOM mapping technique for visualizing documents in a database , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[2]  Sinan Salman,et al.  DIVA: a visualization system for exploring document databases for technology forecasting , 2002 .

[3]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[4]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[5]  Vijay V. Raghavan,et al.  Document Clustering, Visualization, and Retrieval via Link Mining , 2004 .

[6]  Bruno Bienfait Applications of High-Resolution Self-Organizing Maps to Retrosynthetic and QSAR Analysis , 1994, J. Chem. Inf. Comput. Sci..

[7]  Hujun Yin,et al.  ViSOM - a novel method for multivariate data projection and structure visualization , 2002, IEEE Trans. Neural Networks.

[8]  Gary G. Yen,et al.  A SOM projection technique with the growing structure for visualizing high-dimensional data , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[9]  Vladimir Cherkassky,et al.  Self-Organization as an Iterative Kernel Smoothing Process , 1995, Neural Computation.

[10]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[11]  Bala Srinivasan,et al.  Dynamic self-organizing maps with controlled growth for knowledge discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[12]  Olivier de Weck,et al.  Adaptive Weighted Sum Method for Bi-objective Optimization , 2004 .

[13]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[14]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[15]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Elias Pampalk,et al.  Using Smoothed Data Histograms for Cluster Visualization in Self-Organizing Maps , 2002, ICANN.

[17]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[18]  Risto Miikkulainen,et al.  Incremental grid growing: encoding high-dimensional structure into a two-dimensional feature map , 1993, IEEE International Conference on Neural Networks.

[19]  Jeanny Hérault,et al.  Curvilinear component analysis: a self-organizing neural network for nonlinear mapping of data sets , 1997, IEEE Trans. Neural Networks.

[20]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[21]  Huafeng Xu,et al.  A self-organizing principle for learning nonlinear manifolds , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Samuel Kaski,et al.  Self organization of a massive text document collection , 1999 .

[23]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[24]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[25]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[26]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[27]  Bernd Fritzke Growing Grid — a self-organizing network with constant neighborhood range and adaptation strength , 1995, Neural Processing Letters.

[28]  Andreas Rauber,et al.  The growing hierarchical self-organizing map: exploratory analysis of high-dimensional data , 2002, IEEE Trans. Neural Networks.

[29]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[30]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[31]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[32]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[33]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[34]  Miin-Shen Yang,et al.  A similarity-based robust clustering method , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.