A Fuzzy Clustering Approach Using Reward and Penalty Functions

In this study, the objective function of the most used fuzzy c-means algorithm is reformulated based on two reward and penalty functions. The reward function is defined as the original data in a given dataset, but the penalty function is characterized by a group of additional data in terms of the original data distributions. These additional data are distributed around each group of aggregating original data, and their effects are to enlarge the values of the objective function against the tendency that the determined clustering centers tend these data. Consequently, the fuzzy clustering based on this reformulated objective function achieves two merits: higher accuracy and less time cost. Moreover, after using the two reward and penalty functions, we found that the estimation of the real number of clusters based on a partitioning coefficient function are more accurate than its origin in most datasets. Four successful experiments are present to verify the usefulness of our approach.

[1]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[2]  Ping Li,et al.  Using Greedy algorithm: DBSCAN revisited II , 2004, Journal of Zhejiang University. Science.

[3]  Lawrence O. Hall,et al.  Fast Accurate Fuzzy Clustering through Data Reduction , 2003 .

[4]  Jian Yu,et al.  Cluster Validity and Stability of Clustering Algorithms , 2004, SSPR/SPR.

[5]  Jung-Hsien Chiang,et al.  A new fuzzy cover approach to clustering , 2004, IEEE Trans. Fuzzy Syst..

[6]  Mohamed S. Kamel,et al.  A thresholded fuzzy c-means algorithm for semi-fuzzy clustering , 1991, Pattern Recognit..

[7]  Uzay Kaymak,et al.  Fuzzy clustering with volume prototypes and adaptive cluster merging , 2002, IEEE Trans. Fuzzy Syst..

[8]  郭继东,et al.  A statistical information-based clustering approach in distance space , 2005 .

[9]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Sankar K. Pal,et al.  Fuzzy models for pattern recognition , 1992 .

[11]  Lawrence O. Hall,et al.  Fast fuzzy clustering , 1998, Fuzzy Sets Syst..

[12]  Carl G. Looney A new approach to fuzzy clustering , 2000, Computers and Their Applications.

[13]  Witold Pedrycz,et al.  Linguistic models and linguistic modeling , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[14]  Witold Pedrycz,et al.  P-FCM: a proximity -- based fuzzy clustering , 2004, Fuzzy Sets Syst..

[15]  James M. Keller,et al.  The possibilistic C-means algorithm: insights and recommendations , 1996, IEEE Trans. Fuzzy Syst..

[16]  Jian Yu,et al.  General C-Means Clustering Model , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Michael K. Ng,et al.  Automated variable weighting in k-means type clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Andrzej Bargiela,et al.  Granular prototyping in fuzzy clustering , 2004, IEEE Transactions on Fuzzy Systems.

[19]  Witold Pedrycz,et al.  Conditional Fuzzy C-Means , 1996, Pattern Recognit. Lett..