Grassmannian sparse representations

Abstract. We present Grassmannian sparse representations (GSR), a sparse representation Grassmann learning framework for efficient classification. Sparse representation classification offers a powerful approach for recognition in a variety of contexts. However, a major drawback of sparse representation methods is their computational performance and memory utilization for high-dimensional data. A Grassmann manifold is a space that promotes smooth surfaces where points represent subspaces and the relationship between points is defined by the mapping of an orthogonal matrix. Grassmann manifolds are well suited for computer vision problems because they promote high between-class discrimination and within-class clustering, while offering computational advantages by mapping each subspace onto a single point. The GSR framework combines Grassmannian kernels and sparse representations, including regularized least squares and least angle regression, to improve high accuracy recognition while overcoming the drawbacks of performance and dependencies on high dimensional data distributions. The effectiveness of GSR is demonstrated on computationally intensive multiview action sequences, three-dimensional action sequences, and face recognition datasets.

[1]  Martin Jaggi,et al.  An Exponential Lower Bound on the Complexity of Regularization Paths , 2009, J. Comput. Geom..

[2]  H. Nkansah Least squares optimization with L1-norm regularization , 2017 .

[3]  Tanaya Guha,et al.  Learning Sparse Representations for Human Action Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[5]  P. Absil,et al.  Riemannian Geometry of Grassmann Manifolds with a View on Algorithmic Computation , 2004 .

[6]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[7]  Moataz M. Abdelwahab,et al.  2DHOOF-2DPCA contour based optical flow algorithm for human activity recognition , 2013, 2013 IEEE 56th International Midwest Symposium on Circuits and Systems (MWSCAS).

[8]  Marios Savvides,et al.  The multifactor extension of Grassmann manifolds for face recognition , 2011, Face and Gesture 2011.

[9]  Fuji Ren,et al.  Detect and track the dynamic deformation human body with the active shape model modified by motion vectors , 2011, 2011 IEEE International Conference on Cloud Computing and Intelligence Systems.

[10]  Alejandro F. Frangi,et al.  Two-dimensional PCA: a new approach to appearance-based face representation and recognition , 2004 .

[11]  Moataz M. Abdelwahab,et al.  Multi-view human action recognition system employing 2DPCA , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[12]  Guangfeng Lin,et al.  Human Action Recognition Using Latent-Dynamic Condition Random Fields , 2009, 2009 International Conference on Artificial Intelligence and Computational Intelligence.

[13]  Yuemin Zhu,et al.  Sparse representation based MRI denoising with total variation , 2008, 2008 9th International Conference on Signal Processing.

[14]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[15]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  John M. Lee Introduction to Smooth Manifolds , 2002 .

[17]  Ci Wang,et al.  Noisy image super-resolution with sparse mixing estimators , 2011, 2011 4th International Congress on Image and Signal Processing.

[18]  David J. Kriegman,et al.  Recognition using class specific linear projection , 1997 .

[19]  Yongtian Wang,et al.  Sparse representation for action recognition , 2010, 2010 3rd International Congress on Image and Signal Processing.

[20]  Geoffrey B. West,et al.  The origin of universal scaling laws in biology , 1999 .

[21]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[22]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Andreas E. Savakis,et al.  Grassmannian Sparse Representations and Motion Depth Surfaces for 3D Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[24]  David J. Kriegman,et al.  Acquiring linear subspaces for face recognition under variable lighting , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Wanqing Li,et al.  Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[27]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[28]  Ioannis Pitas,et al.  The i3DPost Multi-View and 3D Human Action/Interaction Database , 2009, 2009 Conference for Visual Media Production.

[29]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Paul Van Dooren,et al.  An efficient particle filtering technique on the Grassmann manifold , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[31]  Xihong Wu,et al.  Boosting Local Binary Pattern (LBP)-Based Face Recognition , 2004, SINOBIOMETRICS.

[32]  Mubarak Shah,et al.  Learning 4D action feature models for arbitrary view action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[34]  Julien Mairal,et al.  Complexity Analysis of the Lasso Regularization Path , 2012, ICML.

[35]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[36]  Baba C. Vemuri,et al.  On A Nonlinear Generalization of Sparse Coding and Dictionary Learning , 2013, ICML.

[37]  Brian C. Lovell,et al.  Kernel analysis over Riemannian manifolds for visual recognition of actions, pedestrians and textures , 2012, 2012 IEEE Workshop on the Applications of Computer Vision (WACV).

[38]  Brian C. Lovell,et al.  Improved Image Set Classification via Joint Sparse Approximated Nearest Subspaces , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[40]  Kazufumi Kaneda,et al.  Face sequence recognition using Grassmann Distances and Grassmann Kernels , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[41]  Pingkun Yan,et al.  Alternatively Constrained Dictionary Learning For Image Superresolution , 2014, IEEE Transactions on Cybernetics.

[42]  James W. Davis,et al.  The representation and recognition of human movement using temporal templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[43]  Bo Zhang,et al.  General image classification based on sparse representation , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[44]  Shuicheng Yan,et al.  Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[45]  Rama Chellappa,et al.  Statistical analysis on Stiefel and Grassmann manifolds with applications in computer vision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Jiawei Han,et al.  Spectral Regression for Efficient Regularized Subspace Learning , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[47]  Yaakov Tsaig,et al.  Fast Solution of $\ell _{1}$ -Norm Minimization Problems When the Solution May Be Sparse , 2008, IEEE Transactions on Information Theory.

[48]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[49]  Hongdong Li,et al.  Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[51]  Ajmal S. Mian,et al.  Sparse approximated nearest points for image set classification , 2011, CVPR 2011.

[52]  Chun Chen,et al.  Graph Regularized Sparse Coding for Image Representation , 2011, IEEE Transactions on Image Processing.

[53]  Joseph Moses Juran,et al.  The Non-Pareto Principle; Mea Culpa , 1994 .

[54]  Andreas E. Savakis,et al.  A spatiotemporal descriptor based on radial distances and 3D joint tracking for action classification , 2012, 2012 19th IEEE International Conference on Image Processing.

[55]  Rémi Ronfard,et al.  Free viewpoint action recognition using motion history volumes , 2006, Comput. Vis. Image Underst..

[56]  Brian C. Lovell,et al.  Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching , 2011, CVPR 2011.

[57]  Shuiwang Ji,et al.  SLEP: Sparse Learning with Efficient Projections , 2011 .

[58]  Liang-Tien Chia,et al.  Sparse Representation With Kernels , 2013, IEEE Transactions on Image Processing.

[59]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[60]  Andreas E. Savakis,et al.  LGE-KSVD: Robust Sparse Representation Classification , 2014, IEEE Transactions on Image Processing.

[61]  Yang Yang,et al.  Human action recognition using sparse representation , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[62]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Peyman Milanfar,et al.  Action Recognition from One Example , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Michael Elad,et al.  Stable recovery of sparse overcomplete representations in the presence of noise , 2006, IEEE Transactions on Information Theory.

[65]  Z. Liu,et al.  A real time system for dynamic hand gesture recognition with a depth sensor , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).