Topology Representing Networks for the Visualization of Manifolds

In practical data mining tasks, high-dimensional data has to be analyzed. In most of the cases it is very informative to map and visualize the hidden structure of a complex data set in a low-dimensional space. In this paper a new class of mapping algorithms is defined. These algorithms combine topology representing networks and different nonlinear mapping algorithms. While the former methods aim to quantify the data and disclose the real structure of the objects, the nonlinear mapping algorithms are able to visualize the quantized data in the low-dimensional vector space. In this paper, techniques based on these methods are gathered and the results of a detailed analysis performed on them are shown. The primary aim of this analysis is to examine the preservation of distances and neighborhood relations of the objects. Preservation of neighborhood relations was analyzed both in local and global environments. To evaluate the main properties of the examined methods we show the outcome of the analysis based both on synthetic and real benchmark examples.

[1]  A. Householder,et al.  Discussion of a set of points in terms of their mutual distances , 1938 .

[2]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[3]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[4]  Pierre Comon Independent component analysis - a new concept? signal processing , 1994 .

[5]  E. Capaldi,et al.  The organization of behavior. , 1992, Journal of applied behavior analysis.

[6]  Jeanny Hérault,et al.  Curvilinear component analysis: a self-organizing neural network for nonlinear mapping of data sets , 1997, IEEE Trans. Neural Networks.

[7]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[8]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[9]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[10]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[11]  Pragya Agarwal,et al.  Self-Organising Maps , 2008 .

[12]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[13]  Teuvo Kohonen,et al.  In: Self-organising Maps , 1995 .

[14]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[15]  Pablo A. Estévez,et al.  Online data visualization using the neural gas network , 2006, Neural Networks.

[16]  Ferenc Szeifert,et al.  Data-driven generation of compact, accurate, and linguistically sound fuzzy classifiers based on a decision-tree initialization , 2003, Int. J. Approx. Reason..

[17]  Hamed Hamid Muhammed,et al.  Unsupervised fuzzy clustering using weighted incremental neural networks , 2004, Int. J. Neural Syst..

[18]  Claudio A. Perez,et al.  Nonlinear Projection Using Geodesic Distances and the Neural Gas Network , 2006, ICANN.

[19]  Jarkko Venna,et al.  Trustworthiness and metrics in visualizing similarity of gene expression , 2003, BMC Bioinformatics.

[20]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[21]  Amaury Lendasse,et al.  A robust nonlinear projection method , 2000 .

[22]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[23]  Siming Liu,et al.  Dynamic topology representing networks , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[24]  Jarkko Venna,et al.  Local multidimensional scaling , 2006, Neural Networks.

[25]  Natasha K. Stout,et al.  Chapter 7: The Wisconsin Breast Cancer Epidemiology Simulation Model , 2006 .

[26]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[27]  János Abonyi,et al.  Topology Representing Network Map - A New Tool for Visualization of High-Dimensional Data , 2008, Trans. Comput. Sci..

[28]  Hujun Yin,et al.  ViSOM - a novel method for multivariate data projection and structure visualization , 2002, IEEE Trans. Neural Networks.

[29]  I. Hassan Embedded , 2005, The Cyber Security Handbook.

[30]  B. Yegnanarayana,et al.  Artificial Neural Networks , 2004 .

[31]  János Abonyi,et al.  Visualization of Topology Representing Networks , 2007, IDEAL.