Validity-guided (re)clustering with applications to image segmentation

When clustering algorithms are applied to image segmentation, the goal is to solve a classification problem. However, these algorithms do not directly optimize classification duality. As a result, they are susceptible to two problems: 1) the criterion they optimize may not be a good estimator of "true" classification quality, and 2) they often admit many (suboptimal) solutions. This paper introduces an algorithm that uses cluster validity to mitigate problems 1 and 2. The validity-guided (re)clustering (VGC) algorithm uses cluster-validity information to guide a fuzzy (re)clustering process toward better solutions. It starts with a partition generated by a soft or fuzzy clustering algorithm. Then it iteratively alters the partition by applying (novel) split-and-merge operations to the clusters. Partition modifications that result in improved partition validity are retained. VGC is tested on both synthetic and real-world data. For magnetic resonance image (MRI) segmentation, evaluations by radiologists show that VGC outperforms the (unsupervised) fuzzy c-means algorithm, and VGC's performance approaches that of the (supervised) k-nearest-neighbors algorithm.

[1]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[2]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[3]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[4]  Michael Spann,et al.  A new approach to clustering , 1990, Pattern Recognit..

[5]  James C. Bezdek,et al.  A comparison of neural network and fuzzy clustering techniques in segmenting magnetic resonance images of the brain , 1992, IEEE Trans. Neural Networks.

[6]  Michio Sugeno,et al.  A fuzzy-logic-based approach to qualitative modeling , 1993, IEEE Trans. Fuzzy Syst..

[7]  E. Backer,et al.  Cluster analysis by optimal decomposition of induced fuzzy sets , 1978 .

[8]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[9]  Mark A. Gluck,et al.  Information, Uncertainty and the Utility of Categories , 1985 .

[10]  Michael P. Windham,et al.  Cluster Validity for the Fuzzy c-Means Clustering Algorithrm , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  James C. Bezdek,et al.  Fuzzy cluster validity in magnetic resonance images , 1994, Medical Imaging.

[12]  Settimo Termini,et al.  A Definition of a Nonprobabilistic Entropy in the Setting of Fuzzy Sets Theory , 1972, Inf. Control..

[13]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[14]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[15]  Michael J. Sabin,et al.  Convergence and Consistency of Fuzzy c-means/ISODATA Algorithms , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Brian Everitt,et al.  Cluster analysis , 1974 .

[17]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[18]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[19]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  M. P. Windham Cluster validity for fuzzy clustering algorithms , 1981 .

[21]  Anil K. Jain,et al.  A Clustering Performance Measure Based on Fuzzy Set Decomposition , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  D. Matula Graph Theoretic Techniques for Cluster Analysis Algorithms , 1977 .

[23]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[25]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Raghu Krishnapuram,et al.  Fitting an unknown number of lines and planes to image data through compatible cluster merging , 1992, Pattern Recognit..

[27]  L. Hubert,et al.  Quadratic assignment as a general data analysis strategy. , 1976 .

[28]  Isak Gath,et al.  Detection and Separation of Ring-Shaped Clusters Using Fuzzy Clustering , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .