A cluster validation index for GK cluster analysis based on relative degree of sharing

In this paper, the problem of traditional validity indices when applied to the Gustafson-Kessel (GK) clustering are reviewed. A new cluster validity index for the GK algorithm is proposed. This validity index is defined as the average value of the relative degrees of sharing of all possible pairs of fuzzy clusters in the system. It computes the overlap of each pair of fuzzy clusters by considering the degree of sharing of each data point in the overlap. The optimal number of clusters is obtained by minimizing the validity index. Experiments in which the proposed validity index and several traditional validity indices were applied to 6 data sets highlight the superior qualities of the proposed index. The results indicate that the proposed validity index is very reliable.

[1]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[2]  Keon-Myung Lee,et al.  Hierarchical partition of nonstructured concurrent systems , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[3]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[4]  James M. Keller,et al.  Fuzzy Models and Algorithms for Pattern Recognition and Image Processing , 1999 .

[5]  Keon-Myung Lee,et al.  Fuzzy hypergraph and fuzzy partition , 1995, IEEE Trans. Syst. Man Cybern..

[6]  Robert Babuska,et al.  Fuzzy Modeling for Control , 1998 .

[7]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Raghu Krishnapuram,et al.  Fitting an unknown number of lines and planes to image data through compatible cluster merging , 1992, Pattern Recognit..

[9]  Doheon Lee,et al.  Fuzzy cluster validation index based on inter-cluster proximity , 2003, Pattern Recognit. Lett..

[10]  L. Jain,et al.  Fuzzy sets and their application to clustering and training , 2000 .

[11]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Boudewijn P. F. Lelieveldt,et al.  A new cluster validity index for the fuzzy c-mean , 1998, Pattern Recognit. Lett..

[13]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[14]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[15]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[16]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[17]  Soon-H. Kwon Cluster validity index for fuzzy clustering , 1998 .

[18]  Sim Heng Ong,et al.  On post-clustering evaluation and modification , 2000, Pattern Recognit. Lett..

[19]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .