Performance evaluation of FMIG clustering using fuzzy validity indexes

The clustering of high-dimensional data presents a critical computational problem. Therefore, it is convenient to arrange the cluster centres on a grid with a small dimensional space that reduces computational cost and can be easily visualized. Moreover, in real applications there is no sharp boundary between classes, real datasets are naturally defined in a fuzzy context. Therefore, fuzzy clustering fits better for complex real datasets to determine the best distribution. Self-organizing map (SOM) technique is appropriate for clustering and vector quantization of high-dimensional data. In this paper we present a new fuzzy learning approach called FMIG (fuzzy multilevel interior growing self-organizing maps). The proposed algorithm is a fuzzy version of MIGSOM (multilevel interior growing self-organizing maps). The main contribution of FMIG is to define a fuzzy process of data mapping and to take into account the fuzzy aspect of high-dimensional real datasets. This new algorithm is able to auto-organize the map accordingly to the fuzzy training property of the nodes. In the second step, the introduced scheme for FMIG is clustered by means of fuzzy C-means (FCM) to discover the interior class distribution of the learned data. The validation of FCM partitions is directed by applying six validity indexes. Superiority of the new method is demonstrated by comparing it with crisp MIGSOM, GSOM (growing SOM) and FKCN (fuzzy Kohonen clustering network) techniques. Thus, our new method shows improvement in term of quantization error, topology preservation and clustering ability.

[1]  Miin-Shen Yang,et al.  A cluster validity index for fuzzy clustering , 2005, Pattern Recognit. Lett..

[2]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[3]  W. T. Tucker,et al.  Convergence theory for fuzzy c-means: Counterexamples and repairs , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[5]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[6]  T. Kohonen,et al.  Statistical pattern recognition with neural networks: benchmarking studies , 1988, IEEE 1988 International Conference on Neural Networks.

[7]  Vadlamani Ravi,et al.  Soft computing system for bank performance prediction , 2008, Appl. Soft Comput..

[8]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[9]  Doheon Lee,et al.  On cluster validity index for estimation of the optimal number of fuzzy clusters , 2004, Pattern Recognit..

[10]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[11]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[12]  Adel M. Alimi,et al.  Beta fuzzy logic systems approximation properties in the mimo case , 2003 .

[13]  Glenn Fung,et al.  Reducing a Biomarkers List via Mathematical Programming: Application to Gene Signatures to Detect Time-Dependent Hypoxia in Cancer , 2007, ICMLA 2007.

[14]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[15]  Tieniu Tan,et al.  Learning activity patterns using fuzzy self-organizing neural network , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[17]  Minho Kim,et al.  New indices for cluster validity assessment , 2005, Pattern Recognit. Lett..

[18]  Ajith Abraham,et al.  A neuro-fuzzy approach for modelling electricity demand in Victoria , 2001, Appl. Soft Comput..

[19]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[20]  J. Bezdek,et al.  Fuzzy partitions and relations; an axiomatic basis for clustering , 1978 .

[21]  A. Alimi Beta neuro-fuzzy systems , 2003 .

[22]  M. Emin Yüksel,et al.  A simple neuro-fuzzy impulse detector for efficient blur reduction of impulse noise removal operators for digital images , 2004, IEEE Transactions on Fuzzy Systems.

[23]  Kate Smith-Miles,et al.  HDGSOM: a modified growing self-organizing map for high dimensional data clustering , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[24]  A. G. Frenich,et al.  Application of the Kohonen neural network in coastal water management: methodological development for the assessment and prediction of water quality. , 2001, Water research.

[25]  Thouraya Ayadi,et al.  Movie scenes detection with MIGSOM based on shots semi-supervised clustering , 2012, Neural Computing and Applications.

[26]  Ferenc Szeifert,et al.  Fuzzy Self-Organizing Map based on Regularized Fuzzy c-means Clustering , 2003 .

[27]  Thouraya Ayadi,et al.  2IBGSOM: interior and irregular boundaries growing self-organizing maps , 2007, ICMLA 2007.

[28]  Thouraya Ayadi,et al.  A new data topology matching technique with Multilevel Interior Growing Self-Organizing Maps , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[29]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[30]  Frank Pettersson,et al.  A genetic algorithms based multi-objective neural net applied to noisy blast furnace data , 2007, Appl. Soft Comput..

[31]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[32]  Thouraya Ayadi,et al.  FMIG: Fuzzy Multilevel Interior Growing Self-Organizing Maps , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[33]  Amit Konar,et al.  Automatic kernel clustering with a Multi-Elitist Particle Swarm Optimization Algorithm , 2008, Pattern Recognit. Lett..

[34]  J M Carazo,et al.  Mapping and fuzzy classification of macromolecular images using self-organizing neural networks. , 2000, Ultramicroscopy.

[35]  Wen-Jyi Hwang,et al.  Efficient Fuzzy C-Means Architecture for Image Segmentation , 2011, Sensors.

[36]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Rached Tourki,et al.  Problems in pattern classification in high dimensional spaces: behavior of a class of combined neuro-fuzzy classifiers , 2002, Fuzzy Sets Syst..

[38]  Fakhri Karray,et al.  Enhancing the structure and parameters of the centers for BBF Fuzzy Neural Network classifier construction based on data structure , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[39]  Pei-Chann Chang,et al.  Combining SOM and fuzzy rule base for flow time prediction in semiconductor manufacturing factory , 2006, Appl. Soft Comput..

[40]  A. Alimi,et al.  On the use of cluster validity for evaluation of MIGSOM clustering , 2011, 2011 5th International Symposium on Computational Intelligence and Intelligent Informatics (ISCIII).

[41]  F. Palis,et al.  Modeling and control of non-linear systems using soft computing techniques , 2007, Appl. Soft Comput..

[42]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[43]  Adel M. Alimi,et al.  An Iterative Method for Deciding SVM and Single Layer Neural Network Structures , 2011, Neural Processing Letters.

[44]  Adel M. Alimi,et al.  Hierarchical Learning Algorithm for the Beta Basis Function Neural Network , 2012, ArXiv.

[45]  Thouraya Ayadi,et al.  MIGSOM: Multilevel Interior Growing Self-Organizing Maps for High Dimensional Data Clustering , 2012, Neural Processing Letters.

[46]  G. Tsekouras,et al.  A new approach for measuring the validity of the fuzzy c -means algorithm , 2004 .

[47]  A. L. Hsu,et al.  Enhanced topology preservation of Dynamic Self-Organising Maps for data visualisation , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[48]  James C. Bezdek,et al.  Prototype classification and feature selection with fuzzy sets , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[49]  Fakhri Karray,et al.  Hierarchical genetic algorithm with new evaluation function and bi-coded representation for the selection of features considering their confidence rate , 2011, Appl. Soft Comput..