Dynamic self-organizing maps with controlled growth for knowledge discovery

The growing self-organizing map (GSOM) has been presented as an extended version of the self-organizing map (SOM), which has significant advantages for knowledge discovery applications. In this paper, the GSOM algorithm is presented in detail and the effect of a spread factor, which can be used to measure and control the spread of the GSOM, is investigated. The spread factor is independent of the dimensionality of the data and as such can be used as a controlling measure for generating maps with different dimensionality, which can then be compared and analyzed with better accuracy. The spread factor is also presented as a method of achieving hierarchical clustering of a data set with the GSOM. Such hierarchical clustering allows the data analyst to identify significant and interesting clusters at a higher level of the hierarchy, and as such continue with finer clustering of only the interesting clusters. Therefore, only a small map is created in the beginning with a low spread factor, which can be generated for even a very large data set. Further analysis is conducted on selected sections of the data and as such of smaller volume. Therefore, this method facilitates the analysis of even very large data sets.

[1]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[2]  Bernd Fritzke,et al.  Let It Grow - Self-Organizing Feature Maps With Problem Dependent Cell Structure , 1991 .

[3]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[4]  Saman K. Halgamuge,et al.  Self-evolving neural networks for rule-based data processing , 1997, IEEE Trans. Signal Process..

[5]  Tsu-Chang Lee,et al.  Structure level adaptation for artificial neural networks , 1991 .

[6]  Risto Miikkulainen,et al.  Visualizing High-Dimensional Structure with the Incremental Grid Growing Neural Network , 1995, ICML.

[7]  Saman K. Halgamuge,et al.  A self-growing cluster development approach to data mining , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[8]  Philip T. Quinlan,et al.  Structural change and development in real and artificial neural networks , 1998, Neural Networks.

[9]  Manfred Glesner,et al.  Fuzzy neural networks: between functional equivalence and applicability , 1995, Int. J. Neural Syst..

[10]  Thomas Villmann,et al.  Topology preservation in self-organizing feature maps: exact definition and measurement , 1997, IEEE Trans. Neural Networks.

[11]  Karim K. Hirji,et al.  Discovering data mining: from concept to implementation , 1999, SKDD.

[12]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[13]  A. K. Pujari,et al.  Data Mining Techniques , 2006 .

[14]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[15]  C. Gielen,et al.  Neural computation and self-organizing maps, an introduction , 1993 .

[16]  Saman K. Halgamuge,et al.  Knowledge Discovery With Supervised and Unsupervised Self Evolving Neural Networks , 1998 .

[17]  Tharam S. Dillon,et al.  Automated knowledge acquisition , 1994, Prentice Hall International series in computer science and engineering.

[18]  Atsuyuki Okabe,et al.  Spatial Tessellations: Concepts and Applications of Voronoi Diagrams , 1992, Wiley Series in Probability and Mathematical Statistics.

[19]  Joseph P. Bigus,et al.  Data mining with neural networks , 1996 .