论文信息 - Learning transformations for clustering and classification

Learning transformations for clustering and classification

A low-rank transformation learning framework for subspace clustering and classification is here proposed. Many high-dimensional data, such as face images and motion sequences, approximately lie in a union of low-dimensional subspaces. The corresponding subspace clustering problem has been extensively studied in the literature to partition such high-dimensional data into clusters corresponding to their underlying low-dimensional subspaces. However, low-dimensional intrinsic structures are often violated for real-world observations, as they can be corrupted by errors or deviate from ideal models. We propose to address this by learning a linear transformation on subspaces using matrix rank, via its convex surrogate nuclear norm, as the optimization criteria. The learned linear transformation restores a low-rank structure for data from the same subspace, and, at the same time, forces a a maximally separated structure for data from different subspaces. In this way, we reduce variations within subspaces, and increase separation between subspaces for a more robust subspace clustering. This proposed learned robust subspace clustering framework significantly enhances the performance of existing subspace clustering methods. Basic theoretical results here presented help to further support the underlying framework. To exploit the low-rank structures of the transformed subspaces, we further introduce a fast subspace clustering technique, which efficiently combines robust PCA with sparse modeling. When class labels are present at the training stage, we show this low-rank transformation framework also significantly enhances classification performance. Extensive experiments using public datasets are presented, showing that the proposed approach significantly outperforms state-of-the-art methods for subspace clustering and classification.

Guillermo Sapiro | Qiang Qiu | G. Sapiro | Qiang Qiu

[1] Emmanuel J. Candès,et al. Robust Subspace Clustering , 2013, ArXiv.

[2] S. Shankar Sastry,et al. Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Tommi S. Jaakkola,et al. Maximum-Margin Matrix Factorization , 2004, NIPS.

[4] Terence Sim,et al. The CMU Pose, Illumination, and Expression (PIE) database , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[5] Rama Chellappa,et al. Domain Adaptive Dictionary Learning , 2012, ECCV.

[6] A. Robert Calderbank,et al. Communications-Inspired Projection Design with Application to Compressive Sensing , 2012, SIAM J. Imaging Sci..

[7] Yong Yu,et al. Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[8] Ying Wu,et al. A unified approach to salient object detection via low rank matrix recovery , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Yi Ma,et al. TILT: Transform Invariant Low-Rank Textures , 2010, ACCV 2010.

[10] Gert R. G. Lanckriet,et al. Sparse eigen methods by D.C. programming , 2007, ICML '07.

[11] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] T. P. Dinh,et al. Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[13] Terence Sim,et al. The CMU Pose, Illumination, and Expression Database , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14] L. Saul,et al. An Introduction to Locally Linear Embedding , 2001 .

[15] Jason Weston,et al. Large Scale Transductive SVMs , 2006, J. Mach. Learn. Res..

[16] Guillermo Sapiro,et al. Learning Efficient Sparse and Low Rank Models , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] John Wright,et al. RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18] Carlos D. Castillo,et al. Using Stereo Matching for 2-D Face Recognition Across Pose , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Ulrike von Luxburg,et al. A tutorial on spectral clustering , 2007, Stat. Comput..

[20] David J. Kriegman,et al. From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[21] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[22] Guangliang Chen,et al. Spectral Curvature Clustering (SCC) , 2009, International Journal of Computer Vision.

[23] Gert R. G. Lanckriet,et al. A Proof of Convergence of the Concave-Convex Procedure Using Zangwill's Theory , 2012, Neural Computation.

[24] K. Fan,et al. Maximum Properties and Inequalities for the Eigenvalues of Completely Continuous Operators. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[25] Baoxin Li,et al. Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26] Takeo Kanade,et al. Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[27] SapiroGuillermo,et al. Learning transformations for clustering and classification , 2015 .

[28] Alan L. Yuille,et al. The Concave-Convex Procedure , 2003, Neural Computation.

[29] Deva Ramanan,et al. Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Emmanuel J. Candès,et al. A Geometric Analysis of Subspace Clustering with Outliers , 2011, ArXiv.

[31] Marc Pollefeys,et al. A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate , 2006, ECCV.

[32] Patrice Y. Simard,et al. Metrics and Models for Handwritten Character Recognition , 1998 .

[33] Ronen Basri,et al. Lambertian reflectance and linear subspaces , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[34] Bede Liu,et al. An iterative algorithm for locating the minimal eigenvector of a symmetric matrix , 1984, ICASSP.

[35] Gilad Lerman,et al. Hybrid Linear Modeling via Local Best-Fit Flats , 2010, International Journal of Computer Vision.

[36] René Vidal,et al. Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[37] Mikhail Belkin,et al. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[38] G. Watson. Characterization of the subdifferential of some matrix norms , 1992 .

[39] Pablo A. Parrilo,et al. Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[40] Larry S. Davis,et al. Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[41] Carlos D. Castillo,et al. Using Stereo Matching with General Epipolar Geometry for 2D Face Recognition across Pose , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42] René Vidal,et al. Segmenting Motions of Different Types by Unsupervised Manifold Clustering , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[43] Hans-Peter Kriegel,et al. Subspace clustering , 2012, WIREs Data Mining Knowl. Discov..

[44] Gabriele Steidl,et al. Combined SVM-Based Feature Selection and Classification , 2005, Machine Learning.

[45] Y. C. Pati,et al. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[46] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47] Sun-Yuan Kung,et al. On gradient adaptation with unit-norm constraints , 2000, IEEE Trans. Signal Process..

[48] Yi Ma,et al. Robust principal component analysis? , 2009, JACM.

[49] V. Kshirsagar,et al. Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[50] Huan Xu,et al. Noisy Sparse Subspace Clustering , 2013, J. Mach. Learn. Res..

[51] John Wright,et al. Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52] Adi Ben-Israel,et al. On principal angles between subspaces in Rn , 1992 .

[53] G. Sapiro,et al. A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[54] Guillermo Sapiro,et al. Learning Transformations for Classification Forests , 2013, ICLR.