Two novel fuzzy clustering methods for solving data clustering problems

In recent years, various data analysis techniques have been developed for extracting meaningful information from real-world data clustering problems. The results, running time, and clustering validity of the techniques are very important. During few decades, fuzzy clustering algorithms and especially the fuzzy c-means FCM algorithm has been widely utilized for solving data clustering problems. The fuzzy c-means algorithm FCM can perform well when applied to noise-free datasets, but performs somewhat poorly when applied to data that have been corrupted with noise, mainly because of the use of the non-robust objective function of FCM and the typical Euclidean distance measure of similarity or dissimilarity. To overcome these shortcomings, this work establishes effective objective functions of fuzzy c-means with the center learning method-based quadratic mean distance, entropy methods, and regularization terms. The effective membership function is derived and center updating by optimizing the proposed effective methods. This work introduces a center learning method to reduce the computational complexity and running time. Also, the proposed methods are applied to artificial data, checkerboard, and real-world datasets to evaluate their performance. The silhouette method is used to find the clustering accuracy of the proposed methods with those of other clustering methods. The experimental results reveal the advantages of the proposed clustering for application to real datasets and random data. They also reveal that the proposed methods outperform the other methods.

[1]  Enrique H. Ruspini,et al.  A New Approach to Clustering , 1969, Inf. Control..

[2]  Yueh-Min Huang,et al.  Adapted Mean Variable Distance to Fuzzy-Cmeans for Effective Image Clustering , 2011, 2011 First International Conference on Robot, Vision and Signal Processing.

[3]  Yueh-Min Huang,et al.  Extended Gaussian kernel version of fuzzy c-means in the problem of data analyzing , 2011, Expert Syst. Appl..

[4]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[5]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[6]  Sadaaki Miyamoto,et al.  Possibilistic Approach to Kernel-Based Fuzzy c-Means Clustering with Entropy Regularization , 2005, MDAI.

[7]  Yueh-Min Huang,et al.  Standardized course generation process using Dynamic Fuzzy Petri Nets , 2008, Expert Syst. Appl..

[8]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[9]  Enrique H. Ruspini,et al.  Numerical methods for fuzzy clustering , 1970, Inf. Sci..

[10]  Vasile Palade,et al.  Building interpretable fuzzy models for high dimensional data analysis in cancer diagnosis , 2011, BMC Genomics.

[11]  Ching-Hsue Cheng,et al.  Data spread-based entropy clustering method using adaptive learning , 2009, Expert Syst. Appl..

[12]  Yen-Ting Lin,et al.  Effectiveness of a Mobile Plant Learning System in a science curriculum in Taiwanese elementary education , 2010, Comput. Educ..

[13]  K. Jajuga L 1 -norm based fuzzy clustering , 1991 .

[14]  James C. Bezdek,et al.  Complexity reduction for "large image" processing , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[15]  James C. Bezdek,et al.  Extending fuzzy and probabilistic clustering to very large data sets , 2006, Comput. Stat. Data Anal..

[16]  Wen-Jyi Hwang,et al.  Nonparametric classifier design using greedy tree-structured vector quantization technique , 1997, Pattern Recognit. Lett..

[17]  James C. Bezdek,et al.  A comparison of neural network and fuzzy clustering techniques in segmenting magnetic resonance images of the brain , 1992, IEEE Trans. Neural Networks.

[18]  Hidetomo Ichihashi,et al.  Fuzzy c-Means Classifier for Incomplete Data Sets with Outliers and Missing Values , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[19]  Yong Xu,et al.  Neuro-Fuzzy Ensemble Approach for Microarray Cancer Gene Expression Data Analysis , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[20]  Mohammad Ghorbani,et al.  Maximum Entropy-Based Fuzzy Clustering by Using L1-norm Space , 2005 .

[21]  Iris Hendrickx,et al.  Hybrid Algorithms with Instance-Based Classification , 2005, ECML.

[22]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[23]  Yueh-Min Huang,et al.  Multiprocessor Task Assignment with Fuzzy Hopfield Neural Network Clustering Technique , 2001, Neural Computing & Applications.

[24]  Shehroz S. Khan,et al.  Cluster center initialization algorithm for K-means clustering , 2004, Pattern Recognit. Lett..