A directed batch growing approach to enhance the topology preservation of self-organizing map

Display Omitted A potent batch growing approach for GSOM was proposed.Neuron insertion rules were defined to manage the map growth in proper directions.DBGSOM performs much better than GSOM and SOM in term of topology preservation. The growing self-organizing map (GSOM) possesses effective capability to generate feature maps and visualizing high-dimensional data without pre-determining their size. Most of the proposed growing SOM algorithms use an incremental learning strategy. The conventional growing approach of GSOM is based on filling all available position around the candidate neuron which can decrease the topology preservation quality of the map due to the misconfiguration and twisting of the map which could be a consequence of unexpected network growth and improper neuron addition and weight initialization. To overcome this problem, in this paper we introduce a batch learning strategy for growing self-organizing maps called DBGSOM which direct the growing process based on the accumulative error around the candidate boundary neuron. In the proposed growing approach, just one new neuron is added around each candidate boundary neuron. The DBGSOM offers suitable mechanisms to find a proper growing positions and allocating initial weight vectors for the new neurons.The potential of the DBGSOM was investigated with one synthetic dataset and six real-world benchmark datasets in terms of topology preservation and mapping quality. Experimental results showed that the proposed growing strategy provides an enhanced topology preserved map and reduces the susceptibility of twisting compared to GSOM. Furthermore, the proposed method has a better clustering ability than GSOM and SOM. According to the lower number of neurons generated by DBGSOM, it needs less time to learn the manifold of the data points compared to GSOM.

[1]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Max A. Little,et al.  Exploiting Nonlinear Recurrence and Fractal Scaling Properties for Voice Disorder Detection , 2007 .

[3]  Joseph Rynkiewicz,et al.  Self-organizing map algorithm and distortion measure , 2006, Neural Networks.

[4]  Rowena Chau,et al.  Cluster identification and separation in the growing self-organizing map: application in protein sequence classification , 2009, Neural Computing and Applications.

[5]  Thouraya Ayadi,et al.  MIGSOM: Multilevel Interior Growing Self-Organizing Maps for High Dimensional Data Clustering , 2012, Neural Processing Letters.

[6]  José Manuel Amigo,et al.  A chemometric approach to the environmental problem of predicting toxicity in contaminated sediments , 2010 .

[7]  Thomas Villmann,et al.  Topology preservation in self-organizing feature maps: exact definition and measurement , 1997, IEEE Trans. Neural Networks.

[8]  Kadim Tasdemir,et al.  Topology-Based Hierarchical Clustering of Self-Organizing Maps , 2011, IEEE Transactions on Neural Networks.

[9]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[10]  Saman K. Halgamuge,et al.  Dynamic Self-Organising Maps: Theory, Methods and Applications , 2009, Foundations of Computational Intelligence.

[11]  Download Book,et al.  Information Visualization in Data Mining and Knowledge Discovery , 2001 .

[12]  C. Borland,et al.  Effect Size , 2019, SAGE Research Methods Foundations.

[13]  Hujun Yin,et al.  Learning Nonlinear Principal Manifolds by Self-Organising Maps , 2008 .

[14]  Saman K. Halgamuge,et al.  An unsupervised hierarchical dynamic self-organizing approach to cancer class discovery and marker gene identification in microarray data , 2003, Bioinform..

[15]  Bala Srinivasan,et al.  Dynamic self-organizing maps with controlled growth for knowledge discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[16]  I. Cuthill,et al.  Effect size, confidence interval and statistical significance: a practical guide for biologists , 2007, Biological reviews of the Cambridge Philosophical Society.

[17]  Kate Smith-Miles,et al.  HDGSOM: a modified growing self-organizing map for high dimensional data clustering , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[18]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. J. Kuo,et al.  Integration of growing self-organizing map and continuous genetic algorithm for grading lithium-ion battery cells , 2012, Appl. Soft Comput..

[20]  T. Kohonen,et al.  Bibliography of Self-Organizing Map SOM) Papers: 1998-2001 Addendum , 2003 .

[21]  Wei-Shen Tai,et al.  Growing Self-Organizing Map with cross insert for mixed-type data clustering , 2012, Appl. Soft Comput..

[22]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Saman K. Halgamuge,et al.  Enhancement of topology preservation and hierarchical dynamic self-organising maps for data visualisation , 2003, Int. J. Approx. Reason..

[24]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[25]  Bernd Fritzke Growing Grid — a self-organizing network with constant neighborhood range and adaptation strength , 1995, Neural Processing Letters.

[26]  Andreas Rauber,et al.  The growing hierarchical self-organizing map: exploratory analysis of high-dimensional data , 2002, IEEE Trans. Neural Networks.

[27]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[28]  Damminda Alahakoon,et al.  Batch implementation of Growing Self-Organizing Map , 2006, 2006 International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06).

[29]  Erzsébet Merényi,et al.  Exploiting Data Topology in Visualization and Clustering of Self-Organizing Maps , 2009, IEEE Transactions on Neural Networks.

[30]  Damminda Alahakoon,et al.  Dynamic self organizing maps for discovery and sharing of knowledge in multi agent systems , 2005, Web Intell. Agent Syst..

[31]  Limin Fu,et al.  FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data , 2007, BMC Bioinformatics.

[32]  Risto Miikkulainen,et al.  Incremental grid growing: encoding high-dimensional structure into a two-dimensional feature map , 1993, IEEE International Conference on Neural Networks.

[33]  Jonathan M. Garibaldi,et al.  Using Rule-Based Machine Learning for Candidate Disease Gene Prioritization and Sample Classification of Cancer Gene Expression Data , 2012, PloS one.

[34]  Aluizio F. R. Araújo,et al.  Growing Self-Organizing Maps for Surface Reconstruction from Unstructured Point Clouds , 2007, 2007 International Joint Conference on Neural Networks.

[35]  Tharam S. Dillon,et al.  Automated knowledge acquisition , 1994, Prentice Hall International series in computer science and engineering.

[36]  Samuel Kaski,et al.  Mining massive document collections by the WEBSOM method , 2004, Inf. Sci..

[37]  M. Forina,et al.  Multivariate data analysis as a discriminating method of the origin of wines , 2015 .

[38]  G. Ball,et al.  RERG (Ras-like, oestrogen-regulated, growth-inhibitor) expression in breast cancer: a marker of ER-positive luminal-like subtype , 2011, Breast Cancer Research and Treatment.

[39]  Se Won Kim,et al.  A self-growing and Self-Organizing Batch Map with automatic stopping condition , 2013, 2013 5th International Conference on Knowledge and Smart Technology (KST).

[40]  Juha Vesanto,et al.  On the Decomposition of the Self-Organizing Map Distortion Measure , 2003 .

[41]  Thouraya Ayadi,et al.  A new data topology matching technique with Multilevel Interior Growing Self-Organizing Maps , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[42]  Mahdi Vasighi,et al.  Genetic Algorithms for architecture optimisation of Counter-Propagation Artificial Neural Networks , 2011 .

[43]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[44]  Chao Shao,et al.  Manifold Learning and Visualization Based on Dynamic Self-Organizing Map , 2015 .

[45]  Thouraya Ayadi,et al.  2IBGSOM: interior and irregular boundaries growing self-organizing maps , 2007, Sixth International Conference on Machine Learning and Applications (ICMLA 2007).

[46]  Andreas Rauber,et al.  The growing hierarchical self-organizing map , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[47]  Christopher A. Badurek,et al.  Review of Information visualization in data mining and knowledge discovery by Usama Fayyad, Georges G. Grinstein, and Andreas Wierse. Morgan Kaufmann 2002 , 2003 .