Algorithms of crisp, fuzzy, and probabilistic clustering with semi-supervision or pairwise constraints

An overview of several algorithms of semi-supervised clustering or constrained clustering based on crisp, fuzzy, or probabilistic framework is given with new results. First, equivalence between an EM algorithm for a semi-supervised mixture distribution model and an extended version of KL-information fuzzy c-means is shown. Second, algorithms of constrained clustering are compared, where an extended COP K-means is considered. Third class of algorithms is a two-stage version of a combination of COP K-means and agglomerative clustering. Numerical examples are shown to observe characteristics of the algorithms discussed herein.

[1]  Sadaaki Miyamoto,et al.  Semi-supervised agglomerative hierarchical clustering algorithms with pairwise constraints , 2010, International Conference on Fuzzy Systems.

[2]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[3]  Raymond J. Mooney,et al.  A probabilistic framework for semi-supervised clustering , 2004, KDD.

[4]  Arindam Banerjee,et al.  Active Semi-Supervision for Pairwise Constrained Clustering , 2004, SDM.

[5]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[6]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[7]  Tomer Hertz,et al.  Computing Gaussian Mixture Models with EM Using Equivalence Constraints , 2003, NIPS.

[8]  Yasunori Endo,et al.  On some hierarchical clustering algorithms using kernel functions , 2004, 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542).

[9]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[10]  Claire Cardie,et al.  Proceedings of the Eighteenth International Conference on Machine Learning, 2001, p. 577–584. Constrained K-means Clustering with Background Knowledge , 2022 .

[11]  Hidetomo Ichihashi,et al.  Gaussian Mixture PDF Approximation and Fuzzy c-Means Clustering with Entropy Regularization , 2000 .

[12]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.