Effective fuzzy c-means clustering algorithms for data clustering problems

Clustering is a well known technique in identifying intrinsic structures and find out useful information from large amount of data. One of the most extensively used clustering techniques is the fuzzy c-means algorithm. However, computational task becomes a problem in standard objective function of fuzzy c-means due to large amount of data, measurement uncertainty in data objects. Further, the fuzzy c-means suffer to set the optimal parameters for the clustering method. Hence the goal of this paper is to produce an alternative generalization of FCM clustering techniques in order to deal with the more complicated data; called quadratic entropy based fuzzy c-means. This paper is dealing with the effective quadratic entropy fuzzy c-means using the combination of regularization function, quadratic terms, mean distance functions, and kernel distance functions. It gives a complete framework of quadratic entropy approaching for constructing effective quadratic entropy based fuzzy clustering algorithms. This paper establishes an effective way of estimating memberships and updating centers by minimizing the proposed objective functions. In order to reduce the number iterations of proposed techniques this article proposes a new algorithm to initialize the cluster centers. In order to obtain the cluster validity and choosing the number of clusters in using proposed techniques, we use silhouette method. First time, this paper segments the synthetic control chart time series directly using our proposed methods for examining the performance of methods and it shows that the proposed clustering techniques have advantages over the existing standard FCM and very recent ClusterM-k-NN in segmenting synthetic control chart time series.

[1]  Sadaaki Miyamoto,et al.  Possibilistic Approach to Kernel-Based Fuzzy c-Means Clustering with Entropy Regularization , 2005, MDAI.

[2]  Lawrence O. Hall,et al.  Kernel Based Fuzzy Ant Clustering with Partition Validity , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[3]  Samir Brahim Belhaouari Fast and Accuracy Control Chart Pattern Recognition using a New cluster-k-Nearest Neighbor , 2009 .

[4]  Shigeru Okuma,et al.  Statistical Mechanical Analysis of Fuzzy Clustering Based on Fuzzy Entropy , 2007, IEICE Trans. Inf. Syst..

[5]  Sadaaki Miyamoto,et al.  METHODS OF FUZZY C-MEANS AND POSSIBILISTIC CLUSTERING USING A QUADRATIC TERM , 2004 .

[6]  Hidetomo Ichihashi,et al.  Fuzzy c-Means Classifier for Incomplete Data Sets with Outliers and Missing Values , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[7]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[8]  James C. Bezdek,et al.  Extending fuzzy and probabilistic clustering to very large data sets , 2006, Comput. Stat. Data Anal..

[9]  Ching-Hsue Cheng,et al.  Data spread-based entropy clustering method using adaptive learning , 2009, Expert Syst. Appl..

[10]  R. J. Alcock,et al.  Time-Series Similarity Queries Employing a Feature-Based Approach , 1999 .

[11]  R. Devi,et al.  Robust kernel FCM in segmentation of breast medical images , 2011, Expert Syst. Appl..

[12]  Lisa Singh,et al.  Privacy Preserving Burst Detection of Distributed Time Series Data Using Linear Transforms , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[13]  Qiang Wang,et al.  Time series analysis with multiple resolutions , 2010, Inf. Syst..

[14]  Padhraic Smyth,et al.  Model selection for probabilistic clustering using cross-validated likelihood , 2000, Stat. Comput..

[15]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[16]  Mohammad Ghorbani,et al.  Maximum Entropy-Based Fuzzy Clustering by Using L1-norm Space , 2005 .

[17]  Magnus Lie Hetland A Survey of Recent Methods for Efficient Retrieval of Similar Time Sequences , 2001 .

[18]  Manoranjan Dash,et al.  Entropy-based fuzzy clustering and fuzzy modeling , 2000, Fuzzy Sets Syst..

[19]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[20]  James C. Bezdek,et al.  Complexity reduction for "large image" processing , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[21]  Daoqiang Zhang,et al.  Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Shehroz S. Khan,et al.  Cluster center initialization algorithm for K-means clustering , 2004, Pattern Recognit. Lett..

[23]  Qiang Wang,et al.  A dimensionality reduction technique for efficient time series similarity analysis , 2008, Inf. Syst..