Self-Organizing Map Convergence

Self-organizing maps are artificial neural networks designed for unsupervised machine learning. Here in this article, the authors introduce a new quality measure called the convergence index. The convergence index is a linear combination of map embedding accuracy and estimated topographic accuracy and since it reports a single statistically meaningful number it is perhaps more intuitive to use than other quality measures. The convergence index in the context of clustering problems was proposed by Ultsch as part of his fundamental clustering problem suite as well as real world datasets. First demonstrated is that the convergence index captures the notion that a SOM has learned the multivariate distribution of a training data set by looking at the convergence of the marginals. The convergence index is then used to study the convergence of SOMs with respect to the different parameters that govern self-organizing map learning. One result is that the constant neighborhood function produces better self-organizing map models than the popular Gaussian neighborhood function.

[1]  Lutz Hamel,et al.  Improved Interpretability of the Unified Distance Matrix with Connected Components To Appear Proceedings of DMIN ’ 11 , 2011 .

[2]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[3]  Jan van den Berg,et al.  Credit Rating Classification Using Self-Organizing Maps , 2002 .

[4]  R. Wilcox Kolmogorov–Smirnov Test , 2005 .

[5]  Hsin-Chang Yang,et al.  Mining text documents for thematic hierarchies using self-organizing maps , 2003 .

[6]  Iren Valova,et al.  CQoCO: A measure for comparative quality of coverage and organization for self-organizing maps , 2010, Neurocomputing.

[7]  Daisuke Kihara,et al.  Characterization and Classification of Local Protein Surfaces Using Self-Organizing Map , 2010, Int. J. Knowl. Discov. Bioinform..

[8]  Minyar Sassi Hidri,et al.  Hybrid Segmentation Prototype for Arabic Text-Based Documents: Towards Plagiarism Detection , 2015, Int. J. Serv. Sci. Manag. Eng. Technol..

[9]  Lutz Hamel,et al.  Bayesian Probability Approach to Feature Significance for Infrared Spectra of Bacteria , 2012 .

[10]  D. Polani Measures for the organization of self-organizing maps , 2001 .

[11]  Lili Zhang,et al.  Learning Highly Structured Manifolds: Harnessing the Power of SOMs , 2009, Similarity-Based Clustering.

[12]  Alfred Ultsch,et al.  Self Organized Feature Maps for Monitoring and Knowledge Aquisition of a Chemical Process , 1993 .

[13]  Stefan Kühne,et al.  Classification of Technical Operation Services for Renewable Energy Power Plants , 2013, Int. J. Serv. Sci. Manag. Eng. Technol..

[14]  Thomas Villmann,et al.  Topology preservation in self-organizing feature maps: exact definition and measurement , 1997, IEEE Trans. Neural Networks.

[15]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[16]  Hujun Yin,et al.  On the Distribution and Convergence of Feature Space in Self-Organizing Maps , 1995, Neural Computation.

[17]  Fedja Hadzic,et al.  Outlier Detection Strategy Using the Self-Organizing Map , 2007 .

[18]  Lutz Hamel,et al.  A Population Based Convergence Criterion for Self-Organizing Maps , 2012 .

[19]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[20]  Georg Pölzlbauer Survey and Comparison of Quality Measures for Self-Organizing Maps , 2004 .

[21]  Andreas Rauber,et al.  Analytic Comparison of Self-Organising Maps , 2009, WSOM.

[22]  Antônio Miguel Vieira Monteiro,et al.  Using Self-Organizing Maps for Rural Territorial Typology , 2011 .

[23]  Lutz Hamel,et al.  SOM Quality Measures: An Efficient Statistical Approach , 2016, WSOM.

[24]  T. Villmann,et al.  Topology Preservation in Self-Organizing Maps , 1999 .

[25]  Jay Kandampully,et al.  Using Word Trees, Word Networks, and Tag Clouds to Provide Meaningful Insights: A Case Study of Electronic Word of Mouth for Hotel Service Delivery , 2013, Int. J. Serv. Sci. Manag. Eng. Technol..

[26]  P. Thall,et al.  Some covariance models for longitudinal count data with overdispersion. , 1990, Biometrics.