Semi-supervised Fuzzy c-Means Clustering Using Clusterwise Tolerance Based Pairwise Constraints

Recently, semi-supervised clustering has been remarked and discussed in many research fields. In semi-supervised clustering, prior knowledge or information are often formulated as pairwise constraints, that is, must-link and cannot-link. Such pairwise constraints are frequently used in order to improve clustering properties. In this paper, we will propose a new semi-supervised fuzzy c-means clustering by using clusterwise tolerance and pairwise constraints. First, the concept of clusterwise tolerance and pairwise constraints are introduced. Second, the optimization problem of fuzzy cmeans clustering using clusterwise tolerance based pairwise constraint is formulated. Especially, must-link constraint is considered and introduced as pairwise constraints. Third, a new clustering algorithm is constructed based on the above discussions. Finally, the effectiveness of proposed algorithm is verified through numerical examples.

[1]  Endo Yasunori,et al.  On semi-supervised fuzzy c-means clustering , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[2]  Arindam Banerjee,et al.  Active Semi-Supervision for Pairwise Constrained Clustering , 2004, SDM.

[3]  Sadaaki Miyamoto,et al.  On Tolerant Fuzzy c-Means Clustering with L1-Regularization , 2009, IFSA/EUSFLAT Conf..

[4]  Sadaaki Miyamoto,et al.  On Tolerant Fuzzy c-Means , 2008 .

[5]  Inderjit S. Dhillon,et al.  Semi-supervised graph clustering: a kernel approach , 2005, ICML '05.

[6]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[7]  Dan Klein,et al.  From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering , 2002, ICML.

[8]  Carlotta Domeniconi,et al.  An Adaptive Kernel Method for Semi-supervised Clustering , 2006, ECML.

[9]  Raymond J. Mooney,et al.  A probabilistic framework for semi-supervised clustering , 2004, KDD.

[10]  S. S. Ravi,et al.  Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results , 2005, PKDD.

[11]  Sadaaki Miyamoto,et al.  Fuzzy c-means as a regularization and maximum entropy approach , 1997 .

[12]  Claire Cardie,et al.  Constrained K-means Clustering with Background Knowledge , 2001, ICML.

[13]  Sadaaki Miyamoto,et al.  On Tolerant Fuzzy c -Means Clustering , 2009, J. Adv. Comput. Intell. Intell. Informatics.

[14]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[15]  Sadaaki Miyamoto,et al.  Algorithms for Fuzzy Clustering - Methods in c-Means Clustering with Applications , 2008, Studies in Fuzziness and Soft Computing.

[16]  Javier Béjar,et al.  Integrating Declarative Knowledge in Hierarchical Clustering Tasks , 1999, IDA.