Robust optimal graph clustering

Abstract Most graph-based clustering methods separate the graph construction and clustering into two independent processes. The manually pre-constructed graph may not be suitable for the subsequent clustering. Moreover, as real world data generally contains noises and outliers, the similarity graph directly learned from them will be unreliable and further impair the subsequent clustering performance. To tackle the problems, in this paper, we propose a novel clustering framework where a robust graph is learned with noise removal, and simultaneously, with desirable clustering structure. To this end, we first learn a discriminative representation of data samples via sparse reconstruction. Then, a robust graph is automatically constructed with adaptive neighbors to each data sample. Simultaneously, a reasonable rank constraint is imposed on the Laplacian matrix of similarity graph to pursue the ideal clustering structure, where the number of connected components in the learned graph is exactly equal to the number of clusters. We finally derive an alternate optimization algorithm guaranteed with convergence to solve the formulated unified learning framework to achieve better prediction accuracy. Experiments on both synthetic and real datasets demonstrate the superior performance of the proposed method compared with several state-of-the-art clustering techniques.

[1]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[2]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[3]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Yang Liu,et al.  Learning with Adaptive Neighbors for Image Clustering , 2018, IJCAI.

[5]  Ivor W. Tsang,et al.  Dynamic vehicle routing with stochastic requests , 2003, IJCAI 2003.

[6]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[7]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations: II. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Feiping Nie,et al.  Orthogonal and Nonnegative Graph Reconstruction for Large Scale Clustering , 2017, IJCAI.

[9]  B. Mohar THE LAPLACIAN SPECTRUM OF GRAPHS y , 1991 .

[10]  Xiaofei He,et al.  Robust non-negative matrix factorization , 2011 .

[11]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[12]  Jane You,et al.  Robust Dual Clustering with Adaptive Manifold Regularization , 2017, IEEE Transactions on Knowledge and Data Engineering.

[13]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[14]  Chun Chen,et al.  Graph Regularized Sparse Coding for Image Representation , 2011, IEEE Transactions on Image Processing.

[15]  Xuelong Li,et al.  Multi-View Clustering and Semi-Supervised Classification with Adaptive Neighbours , 2017, AAAI.

[16]  Shichao Zhang,et al.  Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[18]  Michael J. Lyons,et al.  Automatic Classification of Single Facial Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Nicu Sebe,et al.  Flexible Manifold Learning With Optimal Graph for Image and Video Representation , 2018, IEEE Transactions on Image Processing.

[20]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[21]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Feiping Nie,et al.  The Constrained Laplacian Rank Algorithm for Graph-Based Clustering , 2016, AAAI.

[23]  Xuelong Li,et al.  Unsupervised Feature Selection with Structured Graph Optimization , 2016, AAAI.

[24]  T. Steihaug The Conjugate Gradient Method and Trust Regions in Large Scale Optimization , 1983 .

[25]  Jiawei Han,et al.  Locally Consistent Concept Factorization for Document Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.