Clustering using vector membership: An extension of the Fuzzy C-Means algorithm

Clustering is an important facet of explorative data mining and finds extensive use in several fields. In this paper, we propose an extension of the classical Fuzzy C-Means clustering algorithm. The proposed algorithm, abbreviated as VFC, adopts a multi-dimensional membership vector for each data point instead of the traditional, scalar membership value defined in the original algorithm. The membership vector for each point is obtained by considering each feature of that point separately and obtaining individual membership values for the same. We also propose an algorithm to efficiently allocate the initial cluster centers close to the actual centers, so as to facilitate rapid convergence. Further, we propose a scheme to achieve crisp clustering using the VFC algorithm. The proposed, novel clustering scheme has been tested on two standard data sets in order to analyze its performance. We also examine the efficacy of the proposed scheme by analyzing its performance on image segmentation examples and comparing it with the classical Fuzzy C-means clustering algorithm.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Manoranjan Dash,et al.  Entropy-based fuzzy clustering and fuzzy modeling , 2000, Fuzzy Sets Syst..

[3]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[4]  M. Tabakov,et al.  A Fuzzy Clustering Technique for Medical Image Segmentation , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[5]  G.B. Coleman,et al.  Image segmentation by clustering , 1979, Proceedings of the IEEE.

[6]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[7]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[8]  Yong Yang,et al.  Image Segmentation by Fuzzy C-Means Clustering Algorithm with a Novel Penalty Term , 2007, Comput. Artif. Intell..

[9]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[10]  Yuqing Song,et al.  Fuzzy C-Means Clustering for Image Segmentation Using the Adaptive Spatially Median Neighborhood Information , 2010, 2010 Chinese Conference on Pattern Recognition (CCPR).

[11]  James C. Bezdek,et al.  Fuzzy mathematics in pattern classification , 1973 .

[12]  Boudewijn P. F. Lelieveldt,et al.  A multiresolution image segmentation technique based on pyramidal segmentation and fuzzy clustering , 2000, IEEE Trans. Image Process..

[13]  R. Udiljak,et al.  Multipactor breakdown in waveguide irises , 2009, 2009 IEEE International Vacuum Electronics Conference.

[14]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[15]  W. N. Street,et al.  Computerized breast cancer diagnosis and prognosis from fine-needle aspirates. , 1995, Archives of surgery.

[16]  Miin-Shen Yang A survey of fuzzy clustering , 1993 .