Self-paced Adaptive Bipartite Graph Learning for Consensus Clustering

Consensus clustering provides an elegant framework to aggregate multiple weak clustering results to learn a consensus one that is more robust and stable than a single result. However, most of the existing methods usually use all data for consensus learning, whereas ignoring the side effects caused by some unreliable or difficult data. To address this issue, in this article, we propose a novel self-paced consensus clustering method with adaptive bipartite graph learning to gradually involve data from more reliable to less reliable ones in consensus learning. At first, we construct an initial bipartite graph from the base results, where the nodes represent the clusters and instances, and the edges indicate that an instance belongs to a cluster. Then, we adaptively learn a structured bipartite graph from this initial one by self-paced learning, i.e., we automatically determine the reliability of each edge with adaptive cluster similarity measuring and involve the edges in bipartite graph learning in order of their reliability. At last, we obtain the final consensus result from the learned structured bipartite graph. We conduct extensive experiments on both toy and benchmark datasets, and the results show the effectiveness and superiority of our method. The codes of this article are released in http://Doctor-Nobody.github.io/codes/code_SCCABG.zip.

[1]  Peng Zhou,et al.  Balanced Spectral Feature Selection , 2022, IEEE Transactions on Cybernetics.

[2]  Peng Zhou,et al.  Clustering ensemble via structured hypergraph learning , 2022, Inf. Fusion.

[3]  En Zhu,et al.  One-Stage Incomplete Multi-view Clustering via Late Fusion , 2021, ACM Multimedia.

[4]  Liang Du,et al.  Tri-level Robust Clustering Ensemble with Multiple Graph Learning , 2021, AAAI.

[5]  Xinwang Liu,et al.  Adaptive Self-Paced Deep Clustering with Data Augmentation , 2020, IEEE Transactions on Knowledge and Data Engineering.

[6]  Liang Bai,et al.  A multiple k-means clustering ensemble algorithm to find nonlinearly separable clusters , 2020, Inf. Fusion.

[7]  Liang Du,et al.  Self-paced Consensus Clustering with Bipartite Graph , 2020, IJCAI.

[8]  Liang Du,et al.  Self-Paced Clustering Ensemble , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Zenglin Xu,et al.  Self-Paced Deep Regression Forests with Consideration on Underrepresented Samples , 2020, ECCV.

[10]  Chang Tang,et al.  Efficient and Effective Regularized Incomplete Multi-View Clustering , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Zenglin Xu,et al.  Large-scale Multi-view Subspace Clustering in Linear Time , 2019, AAAI.

[12]  Dinggang Shen,et al.  Late Fusion Incomplete Multi-View Clustering , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jie Zhou,et al.  Ensemble clustering based on dense representation , 2019, Neurocomputing.

[14]  En Zhu,et al.  Multi-view Clustering via Late Fusion Alignment Maximization , 2019, IJCAI.

[15]  Yuhua Qian,et al.  Clustering ensemble based on sample's stability , 2019, Artif. Intell..

[16]  Yun Fu,et al.  Adversarial Graph Embedding for Ensemble Clustering , 2019, IJCAI.

[17]  Fan Ye,et al.  Incremental multi-view spectral clustering , 2019, Knowl. Based Syst..

[18]  Jiancheng Lv,et al.  COMIC: Multi-view Clustering Without Parameter Selection , 2019, ICML.

[19]  Chang-Dong Wang,et al.  Ultra-Scalable Spectral Clustering and Ensemble Clustering , 2019, IEEE Transactions on Knowledge and Data Engineering.

[20]  Yun Fu,et al.  Robust Spectral Ensemble Clustering via Rank Minimization , 2019, ACM Trans. Knowl. Discov. Data.

[21]  H. Parvin,et al.  Elite fuzzy clustering ensemble based on clustering diversity and quality measures , 2018, Applied Intelligence.

[22]  Chang-Dong Wang,et al.  Enhanced Ensemble Clustering via Fast Propagation of Cluster-Wise Similarities , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[23]  Qingming Huang,et al.  When to Learn What: Deep Cognitive Subspace Clustering , 2018, ACM Multimedia.

[24]  Zenglin Xu,et al.  Self-Paced Multi-Task Clustering , 2018, Neurocomputing.

[25]  Ming Shao,et al.  Infinite ensemble clustering , 2017, Data Mining and Knowledge Discovery.

[26]  Deyu Meng,et al.  A theoretical understanding of self-paced learning , 2017, Inf. Sci..

[27]  Yun Fu,et al.  From Ensemble Clustering to Multi-View Clustering , 2017, IJCAI.

[28]  Deyu Meng,et al.  Co-Saliency Detection via a Self-Paced Multiple-Instance Learning Framework , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Yun Fu,et al.  Robust Spectral Ensemble Clustering , 2016, CIKM.

[30]  Chang-Dong Wang,et al.  Locally Weighted Ensemble Clustering , 2016, IEEE Transactions on Cybernetics.

[31]  Chang-Dong Wang,et al.  Robust Ensemble Clustering Using Probability Trajectories , 2016, IEEE Transactions on Knowledge and Data Engineering.

[32]  Changsheng Li,et al.  Self-Paced Multi-Task Learning , 2016, AAAI.

[33]  Chang-Dong Wang,et al.  Ensemble clustering using factor graph , 2016, Pattern Recognit..

[34]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[35]  H. Parvin,et al.  Clustering ensemble selection considering quality and diversity , 2015, Artificial Intelligence Review.

[36]  Liang Wang,et al.  Incomplete Multi-view Clustering via Subspace Learning , 2015, CIKM.

[37]  Dacheng Tao,et al.  Multi-View Learning With Incomplete Views , 2015, IEEE Transactions on Image Processing.

[38]  Junjie Wu,et al.  Spectral Ensemble Clustering , 2015, KDD.

[39]  Lei Shi,et al.  Learning a Robust Consensus Matrix for Clustering Ensemble via Kullback-Leibler Divergence Minimization , 2015, IJCAI.

[40]  Lei Shi,et al.  Recovery of Corrupted Multiple Kernels for Clustering , 2015, IJCAI.

[41]  B. Minaei-Bidgoli,et al.  A clustering ensemble framework based on selection of fuzzy weighted clusters in a locally adaptive clustering algorithm , 2015, Pattern Analysis and Applications.

[42]  Shiguang Shan,et al.  Self-Paced Curriculum Learning , 2015, AAAI.

[43]  Qi Xie,et al.  Self-Paced Learning for Matrix Factorization , 2015, AAAI.

[44]  Yunjun Gao,et al.  Hybrid clustering solution selection strategy , 2014, Pattern Recognit..

[45]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[46]  Feiping Nie,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Multi-View K-Means Clustering on Big Data , 2022 .

[47]  Sumit Basu,et al.  Teaching Classification Boundaries to Humans , 2013, AAAI.

[48]  B. Minaei-Bidgoli,et al.  A clustering ensemble framework based on elite selection of weighted clusters , 2013, Adv. Data Anal. Classif..

[49]  Chun Chen,et al.  Clustering analysis using manifold kernel concept factorization , 2012, Neurocomputing.

[50]  Jiawei Han,et al.  Locally Consistent Concept Factorization for Document Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.

[51]  Daphne Koller,et al.  Self-Paced Learning for Latent Variable Models , 2010, NIPS.

[52]  Arindam Banerjee,et al.  Bayesian cluster ensembles , 2009, Stat. Anal. Data Min..

[53]  Xiaoli Z. Fern,et al.  Adaptive Cluster Ensemble Selection , 2009, IJCAI.

[54]  Fei Wang,et al.  Generalized Cluster Aggregation , 2009, IJCAI.

[55]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[56]  Chris H. Q. Ding,et al.  Weighted Consensus Clustering , 2008, SDM.

[57]  Chris H. Q. Ding,et al.  Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[58]  Wei Tang,et al.  Clusterer ensemble , 2006, Knowl. Based Syst..

[59]  George Karypis,et al.  Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering , 2004, Machine Learning.

[60]  Anil K. Jain,et al.  Combining multiple weak clusterings , 2003, Third IEEE International Conference on Data Mining.

[61]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[62]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[63]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations: II. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Feiping Nie,et al.  Learning A Structured Optimal Bipartite Graph for Co-Clustering , 2017, NIPS.

[65]  D. Gleich TRUST REGION METHODS , 2017 .

[66]  Jiawei Han,et al.  Multi-View Clustering via Joint Nonnegative Matrix Factorization , 2013, SDM.

[67]  Anil K. Jain,et al.  A Mixture Model for Clustering Ensembles , 2004, SDM.

[68]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[69]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations I. , 1949, Proceedings of the National Academy of Sciences of the United States of America.