Self-paced Consensus Clustering with Bipartite Graph

Consensus clustering provides a framework to ensemble multiple clustering results to obtain a consensus and robust result. Most existing consensus clustering methods usually apply all data to ensemble learning, whereas ignoring the side effects caused by some difficult or unreliable instances. To tackle this problem, we propose a novel selfpaced consensus clustering method to gradually involve instances from more reliable to less reliable ones into the ensemble learning. We first construct an initial bipartite graph from the multiple base clustering results, where the nodes represent the instances and clusters and the edges indicate that an instance belongs to a cluster. Then, we learn a structured bipartite graph from the initial one by self-paced learning, i.e., we automatically decide the reliability of each edge and involves the edges into graph learning in order of their reliability. At last, we obtain the final consensus clustering result from the learned bipartite graph. The extensive experimental results demonstrate the effectiveness and superiority of the proposed method.

[1]  Shiguang Shan,et al.  Self-Paced Curriculum Learning , 2015, AAAI.

[2]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[3]  Chris H. Q. Ding,et al.  Weighted Consensus Clustering , 2008, SDM.

[4]  Feiping Nie,et al.  Learning A Structured Optimal Bipartite Graph for Co-Clustering , 2017, NIPS.

[5]  Deyu Meng,et al.  Co-Saliency Detection via a Self-Paced Multiple-Instance Learning Framework , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Junjie Wu,et al.  Spectral Ensemble Clustering , 2015, KDD.

[7]  Daphne Koller,et al.  Self-Paced Learning for Latent Variable Models , 2010, NIPS.

[8]  Anil K. Jain,et al.  A Mixture Model for Clustering Ensembles , 2004, SDM.

[9]  Lei Shi,et al.  Learning a Robust Consensus Matrix for Clustering Ensemble via Kullback-Leibler Divergence Minimization , 2015, IJCAI.

[10]  Jiye Liang,et al.  Clustering ensemble selection for categorical data based on internal validity indices , 2017, Pattern Recognit..

[11]  Yun Fu,et al.  Robust Spectral Ensemble Clustering via Rank Minimization , 2019, ACM Trans. Knowl. Discov. Data.

[12]  Y. Fu,et al.  From Ensemble Clustering to MultiView Clustering , 2017 .

[13]  Chang-Dong Wang,et al.  Ensemble clustering using factor graph , 2016, Pattern Recognit..

[14]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[15]  Arindam Banerjee,et al.  Bayesian cluster ensembles , 2009, Stat. Anal. Data Min..

[16]  Chang-Dong Wang,et al.  Locally Weighted Ensemble Clustering , 2016, IEEE Transactions on Cybernetics.

[17]  Chang-Dong Wang,et al.  Robust Ensemble Clustering Using Probability Trajectories , 2016, IEEE Transactions on Knowledge and Data Engineering.

[18]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.

[19]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[20]  En Zhu,et al.  Multi-view Clustering via Late Fusion Alignment Maximization , 2019, IJCAI.

[21]  Qingming Huang,et al.  When to Learn What: Deep Cognitive Subspace Clustering , 2018, ACM Multimedia.

[22]  Junchi Yan,et al.  Self-Paced MultiTask Learning , 2017 .

[23]  Yun Fu,et al.  Robust Spectral Ensemble Clustering , 2016, CIKM.

[24]  B. C. Brookes,et al.  Information Sciences , 2020, Cognitive Skills You Need for the 21st Century.

[25]  Lei Shi,et al.  Recovery of Corrupted Multiple Kernels for Clustering , 2015, IJCAI.

[26]  Anil K. Jain,et al.  Combining multiple weak clusterings , 2003, Third IEEE International Conference on Data Mining.

[27]  Zenglin Xu,et al.  Robust Softmax Regression for Multi-class Classification with Self-Paced Learning , 2017, IJCAI.

[28]  Deyu Meng,et al.  A theoretical understanding of self-paced learning , 2017, Inf. Sci..

[29]  Jiawei Han,et al.  ACM Transactions on Knowledge Discovery from Data: Introduction , 2007 .

[30]  Dinggang Shen,et al.  Late Fusion Incomplete Multi-View Clustering , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.