Locally discriminative spectral clustering with composite manifold

A large number of data are generated in many real-world applications, e.g., photos of albums in social networks. Discovering meaningful patterns from them is desirable and still remains a big challenge. To this end, spectral clustering has established itself as a very useful tool for data analysis. It considers the manifold geometrical structure of the data and desires to estimate the intrinsic manifold. However, there exists no principled way for estimating such manifold. Thus, the clustering performance might even degrade seriously when the estimated manifold deviates far from the intrinsic manifold. To address this problem, we propose to employ composite manifold to approximate the intrinsic manifold as much as possible. This composite manifold is derived from a convex combination of some pre-given candidate manifolds in a convex hull. The diversity nature of these manifolds provides richer structure information, which is helpful to maximally estimate the intrinsic manifold. Besides, traditional spectral clustering neglects the discriminant information latent in the data space, so we incorporate the locally discriminative structure into the partition matrix by explicitly using local linear regression, for better clustering. Therefore, in this paper, we present an integrated clustering approach named Spectral Clustering via Composite manifold and Local discriminant learning, referred to as SCCL for short, which addresses the aforementioned two problems. To optimize the objective function, an alternating optimization framework is adopted to derive both the cluster membership matrix and the composite manifold coefficient. Extensive experiments are carried out on several real-world databases. Results have validated the efficacy of the proposed algorithm.

[1]  Chun Chen,et al.  Clustering analysis using manifold kernel concept factorization , 2012, Neurocomputing.

[2]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[3]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[4]  LiuZongyi,et al.  The HumanID Gait Challenge Problem , 2005 .

[5]  Ivor W. Tsang,et al.  Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral Clustering , 2011, IEEE Transactions on Neural Networks.

[6]  Chun Chen,et al.  Relational co-clustering via manifold ensemble learning , 2012, CIKM.

[7]  C. Ding,et al.  On the Equivalence of Nonnegative Matrix Factorization and K-means - Spectral Clustering , 2005 .

[8]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[9]  Greg Hamerly,et al.  Learning the k in k-means , 2003, NIPS.

[10]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[11]  Sudeep Sarkar,et al.  The humanID gait challenge problem: data sets, performance, and analysis , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Xian-Sheng Hua,et al.  Ensemble Manifold Regularization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Marc Teboulle,et al.  Mirror descent and nonlinear projected subgradient methods for convex optimization , 2003, Oper. Res. Lett..

[15]  Jiawei Han,et al.  Locally Consistent Concept Factorization for Document Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Michael J. Lyons,et al.  Automatic Classification of Single Facial Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Hujun Bao,et al.  Laplacian Regularized Gaussian Mixture Model for Data Clustering , 2011, IEEE Transactions on Knowledge and Data Engineering.

[19]  Bernhard Schölkopf,et al.  A Local Learning Approach for Clustering , 2006, NIPS.

[20]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[21]  Ivor W. Tsang,et al.  Dynamic vehicle routing with stochastic requests , 2003, IJCAI 2003.

[22]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.

[23]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[24]  Jianbo Shi,et al.  Multiclass spectral clustering , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[25]  Nenghai Yu,et al.  Annotating personal albums via web mining , 2008, ACM Multimedia.

[26]  Rongrong Ji,et al.  Nonnegative Spectral Clustering with Discriminative Regularization , 2011, AAAI.

[27]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[29]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[30]  Deng Cai,et al.  Manifold Adaptive Experimental Design for Text Categorization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[31]  Ran He,et al.  Robust Principal Component Analysis Based on Maximum Correntropy Criterion , 2011, IEEE Transactions on Image Processing.

[32]  Ran He,et al.  Agglomerative Mean-Shift Clustering , 2012, IEEE Transactions on Knowledge and Data Engineering.

[33]  Fei Wang,et al.  Clustering with Local and Global Regularization , 2007, IEEE Transactions on Knowledge and Data Engineering.

[34]  Jiawei Han,et al.  Speed up kernel discriminant analysis , 2011, The VLDB Journal.

[35]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[36]  Zaïd Harchaoui,et al.  DIFFRAC: a discriminative and flexible framework for clustering , 2007, NIPS.

[37]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[38]  Y. Jiang,et al.  Spectral Clustering on Multiple Manifolds , 2011, IEEE Transactions on Neural Networks.

[39]  Panos M. Pardalos,et al.  Combinatorial Optimization Algorithms , 2013 .

[40]  Yi Yang,et al.  Ranking with local regression and global alignment for cross media retrieval , 2009, ACM Multimedia.

[41]  Jieping Ye,et al.  Discriminative K-means for Clustering , 2007, NIPS.

[42]  James C. Bezdek,et al.  Convergence of Alternating Optimization , 2003, Neural Parallel Sci. Comput..

[43]  Chun Chen,et al.  Locally Discriminative Coclustering , 2012, IEEE Transactions on Knowledge and Data Engineering.

[44]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[45]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[46]  Wei Liu,et al.  Nonnegative Local Coordinate Factorization for Image Representation , 2011, IEEE Transactions on Image Processing.

[47]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[48]  Edward Y. Chang,et al.  Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Yi Yang,et al.  Image Clustering Using Local Discriminant Models and Global Integration , 2010, IEEE Transactions on Image Processing.