Low-rank regularized tensor discriminant representation for image set classification

Abstract Although image set classification has attracted great attention in computer vision and pattern recognition communities, however, learning a compact and discriminative representation is still a challenge. In this paper, we present a novel tensor discriminant representation learning method to better solve the image classification task. Specifically, we first exploit the advantages of Grassmann manifold and tensor to model image sets as a high-order tensor. We then propose a transductive low-rank regularized tensor discriminant representation algorithm referred as LRRTDR to learn more intrinsic representations for image sets. Our proposed LRRTDR mainly contains two components: low-rank tensor embedding and discriminant graph embedding. The low-rank tensor embedding is to learn the lowest-rank representation from a low-dimensional subspace, which is spanned by a set of latent basis matrices. The discriminant graph embedding is to further enhance the discriminant ability of the learned representations under the graph embedding framework. To solve the optimization problem of LRRTDR, we develop an alternating direction scheme based on Iterative Shrinkage Thresholding Algorithm (ISTA). Experimental results on five publicly available datasets demonstrate that our proposed algorithm not only converges with few iterations, but also achieves better accuracy compared with state-of-the-art methods.

[1]  Chunxiao Liu,et al.  Embedding metric learning into set-based face recognition for video surveillance , 2015, Neurocomputing.

[2]  Larry S. Davis,et al.  Covariance discriminative learning: A natural and efficient approach to image set classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Changyin Sun,et al.  Kernel Low-Rank Representation for face recognition , 2015, Neurocomputing.

[4]  Brian C. Lovell,et al.  Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching , 2011, CVPR 2011.

[5]  Daniel D. Lee,et al.  Grassmann discriminant analysis: a unifying view on subspace-based learning , 2008, ICML '08.

[6]  Bernt Schiele,et al.  Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[7]  Ruiping Wang,et al.  Manifold Discriminant Analysis , 2009, CVPR.

[8]  Pichao Wang,et al.  A Spectral and Spatial Approach of Coarse-to-Fine Blurred Image Region Detection , 2016, IEEE Signal Processing Letters.

[9]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[10]  Josef Kittler,et al.  Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Xiao Jin,et al.  High-Order Temporal Correlation Model Learning for Time-Series Prediction , 2019, IEEE Transactions on Cybernetics.

[12]  Brian C. Lovell,et al.  Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[14]  Yong Yu,et al.  Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[15]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[16]  Jing Zhang,et al.  Tensor-driven low-rank discriminant analysis for image set classification , 2017, Multimedia Tools and Applications.

[17]  David W. Jacobs,et al.  Generalized Multiview Analysis: A discriminative latent space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Jiawei Han,et al.  SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[19]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[20]  Jing Zhang,et al.  A Tensor-Driven Temporal Correlation Model for Video Sequence Classification , 2016, IEEE Signal Processing Letters.

[21]  Ke Lu,et al.  Low-Rank Discriminant Embedding for Multiview Learning , 2017, IEEE Transactions on Cybernetics.

[22]  Jun Wang,et al.  LRSR: Low-Rank-Sparse representation for subspace clustering , 2016, Neurocomputing.

[23]  Hakan Cevikalp,et al.  Face recognition based on image sets , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Mehrtash Tafazzoli Harandi,et al.  Beyond Gauss: Image-Set Matching on the Riemannian Manifold of PDFs , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25]  Junbin Gao,et al.  Low Rank Representation on Grassmann Manifolds , 2014, ACCV.

[26]  Josef Kittler,et al.  Learning Discriminative Canonical Correlations for Object Recognition with Image Sets , 2006, ECCV.

[27]  J. Ross Beveridge,et al.  Action classification on product manifolds , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  David J. Kriegman,et al.  Video-based face recognition using probabilistic appearance manifolds , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[29]  Huan Xu,et al.  Provable Subspace Clustering: When LRR Meets SSC , 2013, IEEE Transactions on Information Theory.

[30]  Ralph Gross,et al.  The CMU Motion of Body (MoBo) Database , 2001 .

[31]  Peyman Milanfar,et al.  Face Verification Using the LARK Representation , 2011, IEEE Transactions on Information Forensics and Security.

[32]  Jing Zhang,et al.  Low-Rank Regularized Heterogeneous Tensor Decomposition for Subspace Clustering , 2018, IEEE Signal Processing Letters.

[33]  Yun Fu,et al.  Low-Rank Common Subspace for Multi-view Learning , 2014, 2014 IEEE International Conference on Data Mining.

[34]  Guangming Shi,et al.  Low-Rank Tensor Approximation with Laplacian Scale Mixture Modeling for Multiframe Image Denoising , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35]  Wen Gao,et al.  Manifold-Manifold Distance with application to face recognition based on image set , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Alan Edelman,et al.  The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[37]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.