GViSOM for Multivariate Mixed Data Projection and Structure Visualization

Data mining has become a popular technology in analyzing complex data. Clustering is one of the data mining core techniques. In this paper, we propose a new visualized clustering approach, namely generalized visualization-induced self-organizing map (GViSOM), to cluster mixed, numeric and categorical, data. Our model integrates the ideas from SOM, GSOM, and ViSOM, and overcomes their shortcomings, including projection distortion on the maps and the incapability of handling mixed data. GViSOM can directly handle mixed data and preserves the topological structure of the original data as faithfully as possible. Experimental results of a synthetic and a real datasets demonstrate that GViSOM is able to cluster mixed data and better reveals the cluster structure than SOM, GSOM and ViSOM.

[1]  Hujun Yin,et al.  ViSOM - a novel method for multivariate data projection and structure visualization , 2002, IEEE Trans. Neural Networks.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  Fionn Murtagh,et al.  Interpreting the Kohonen self-organizing feature map using contiguity-constrained clustering , 1995, Pattern Recognit. Lett..

[4]  Hujun Yin,et al.  Data visualisation and manifold mapping using the ViSOM , 2002, Neural Networks.

[5]  Tommy W. S. Chow,et al.  Clustering of the self-organizing map using a clustering validity index based on inter-cluster and intra-cluster density , 2004, Pattern Recognit..

[6]  Jouko Lampinen,et al.  Clustering properties of hierarchical self-organizing maps , 1992, Journal of Mathematical Imaging and Vision.

[7]  Melody Y. Kiang,et al.  Extending the Kohonen self-organizing map networks for clustering analysis , 2002 .

[8]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[9]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[10]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..

[11]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[12]  Chung-Chian Hsu,et al.  Generalizing self-organizing map for categorical data , 2006, IEEE Transactions on Neural Networks.