Semi-Supervised Fuzzy Clustering with Pairwise-Constrained Competitive Agglomeration

Traditional clustering algorithms usually rely on a pre-defined similarity measure between unlabelled data to attempt to identify natural classes of items. When compared to what a human expert would provide on the same data, the results obtained may be disappointing if the similarity measure employed by the system is too different from the one a human would use. To obtain clusters fitting user expectations better, we can exploit, in addition to the unlabelled data, some limited form of supervision, such as constraints specifying whether two data items belong to a same cluster or not. The resulting approach is called semi-supervised clustering. In this paper, we put forward a new semi-supervised clustering algorithm, pairwise-constrained competitive agglomeration: clustering is performed by minimizing a competitive agglomeration cost function with a fuzzy term corresponding to the violation of constraints. We present comparisons performed on a simple benchmark and on an image database

[1]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[2]  Witold Pedrycz,et al.  Fuzzy clustering with partial supervision , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[3]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[4]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[5]  Nozha Boujemaa,et al.  Upgrading Color Distributions for Image Retrieval: Can We Do Better? , 2000, VISUAL.

[6]  Rajesh N. Dave,et al.  Use Of The Adaptive Fuzzy Clustering Algorithm To Detect Lines In Digital Images , 1990, Other Conferences.

[7]  Arindam Banerjee,et al.  Semi-supervised Clustering by Seeding , 2002, ICML.

[8]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[9]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[10]  N. Boujemaa IKONA: INTERACTIVE SPECIFIC AND GENERIC IMAGE RETRIEVAL , 2003 .

[11]  Rajesh N. Davé,et al.  Characterization and detection of noise in clustering , 1991, Pattern Recognit. Lett..

[12]  R. Mooney,et al.  Comparing and Unifying Search-Based and Similarity-Based Approaches to Semi-Supervised Clustering , 2003 .

[13]  R. Fisher,et al.  Contributions to Mathematical Statistics , 1951 .

[14]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[15]  Nozha Boujemaa On competitive unsupervised clustering , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[16]  James C. Bezdek,et al.  Semi-supervised Point Prototype Clustering , 1998, Int. J. Pattern Recognit. Artif. Intell..

[17]  Carl G. Looney,et al.  Interactive clustering and merging with a new fuzzy expected value , 2002, Pattern Recognit..

[18]  Hichem Frigui,et al.  Clustering by competitive agglomeration , 1997, Pattern Recognit..