论文信息 - Riemannian coding and dictionary learning: Kernels to the rescue

Riemannian coding and dictionary learning: Kernels to the rescue

While sparse coding on non-flat Riemannian manifolds has recently become increasingly popular, existing solutions either are dedicated to specific manifolds, or rely on optimization problems that are difficult to solve, especially when it comes to dictionary learning. In this paper, we propose to make use of kernels to perform coding and dictionary learning on Riemannian manifolds. To this end, we introduce a general Riemannian coding framework with its kernel-based counterpart. This lets us (i) generalize beyond the special case of sparse coding; (ii) introduce efficient solutions to two coding schemes; (iii) learn the kernel parameters; (iv) perform unsupervised and supervised dictionary learning in a much simpler manner than previous Riemannian coding methods. We demonstrate the effectiveness of our approach on three different types of non-flat manifolds, and illustrate its generality by applying it to Euclidean spaces, which also are Riemannian manifolds.

Mehrtash Tafazzoli Harandi | Mathieu Salzmann | M. Salzmann | M. Harandi

[1] Thomas Mensink,et al. Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[2] Daniel D. Lee,et al. Grassmann discriminant analysis: a unifying view on subspace-based learning , 2008, ICML '08.

[3] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[4] Larry S. Davis,et al. Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Yihong Gong,et al. Nonlinear Learning using Local Coordinate Coding , 2009, NIPS.

[6] Mehrtash Tafazzoli Harandi,et al. Bregman Divergences for Infinite Dimensional Covariance Matrices , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7] Stephen J. Maybank,et al. Human Action Recognition under Log-Euclidean Riemannian Metric , 2009, ACCV.

[8] Baba C. Vemuri,et al. On A Nonlinear Generalization of Sparse Coding and Dictionary Learning , 2013, ICML.

[9] Jun Sun,et al. Fast Kernel Sparse Representation , 2011, 2011 International Conference on Digital Image Computing: Techniques and Applications.

[10] Ankur Agarwal,et al. Hyperfeatures - Multilevel Local Coding for Visual Recognition , 2006, ECCV.

[11] N. Hitchin. A panoramic view of riemannian geometry , 2006 .

[12] Fatih Murat Porikli,et al. Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[13] Tae-Kyun Kim,et al. Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Paul M. Thompson,et al. Segmentation of High Angular Resolution Diffusion MRI Using Sparse Riemannian Manifold Clustering , 2014, IEEE Transactions on Medical Imaging.

[15] Suvrit Sra,et al. A new metric on the manifold of kernel matrices with application to matrix geometric means , 2012, NIPS.

[16] David J. Kriegman,et al. From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Brian C. Lovell,et al. Sparse Coding and Dictionary Learning for Symmetric Positive Definite Matrices: A Kernel Approach , 2012, ECCV.

[19] Lei Wang,et al. In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.

[20] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[21] Lei Zhang,et al. Log-Euclidean Kernels for Sparse Representation and Dictionary Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[22] Iasonas Kokkinos,et al. Describing Textures in the Wild , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23] Andrea J. van Doorn,et al. The Structure of Locally Orderless Images , 1999, International Journal of Computer Vision.

[24] Hongdong Li,et al. Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Guillermo Sapiro,et al. Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[26] Brian C. Lovell,et al. Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution , 2013, 2013 IEEE International Conference on Computer Vision.

[27] Rama Chellappa,et al. Design of Non-Linear Kernel Dictionaries for Object Recognition , 2013, IEEE Transactions on Image Processing.

[28] Jean Ponce,et al. Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29] Lei Zhang,et al. Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[30] Larry S. Davis,et al. Covariance discriminative learning: A natural and efficient approach to image set classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Anuj Srivastava,et al. Riemannian Analysis of Probability Density Functions with Applications in Vision , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[32] Katja Markert,et al. Learning Models for Object Recognition from Natural Language Descriptions , 2009, BMVC.

[33] Arif Mahmood,et al. Semi-supervised Spectral Clustering for Image Set Classification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[34] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[35] Brian C. Lovell,et al. Sparse Coding on Symmetric Positive Definite Manifolds Using Bregman Divergences , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[36] Shuiwang Ji,et al. SLEP: Sparse Learning with Efficient Projections , 2011 .

[37] Andrew Y. Ng,et al. The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[38] D. Kendall. SHAPE MANIFOLDS, PROCRUSTEAN METRICS, AND COMPLEX PROJECTIVE SPACES , 1984 .

[39] Anoop Cherian,et al. Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval , 2011, ECML/PKDD.

[40] Ida-Maria Sintorn,et al. Evaluation of noise robustness for local binary pattern descriptors in texture classification , 2013, EURASIP J. Image Video Process..

[41] Anoop Cherian,et al. Riemannian Sparse Coding for Positive Definite Matrices , 2014, ECCV.

[42] N. Ayache,et al. Log‐Euclidean metrics for fast and simple calculus on diffusion tensors , 2006, Magnetic resonance in medicine.

[43] C. V. Jawahar,et al. Cats and dogs , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[45] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[46] Guillermo Sapiro,et al. Supervised Dictionary Learning , 2008, NIPS.

[47] Hongdong Li,et al. A Framework for Shape Analysis via Hilbert Space Embedding , 2013, 2013 IEEE International Conference on Computer Vision.

[48] Cordelia Schmid,et al. Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] Minyoung Kim,et al. Efficient Kernel Sparse Coding Via First-Order Smooth Optimization , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[50] Barbara Caputo,et al. Class-Specific Material Categorisation , 2005, ICCV.

[51] Anuj Srivastava,et al. Statistical shape analysis: clustering, learning, and testing , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52] Thomas Serre,et al. Automated home-cage behavioural phenotyping of mice. , 2010, Nature communications.

[53] Aleix M. Martínez,et al. Rotation Invariant Kernels and Their Application to Shape Analysis , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[54] Trevor Darrell,et al. The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[55] Cor J. Veenman,et al. Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57] Ronald L. Graham,et al. A Panoramic View , 2010 .

[58] Janusz Konrad,et al. Action Recognition Using Sparse Representation on Covariance Manifolds of Optical Flow , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[59] Bernhard Schölkopf,et al. A Generalized Representer Theorem , 2001, COLT/EuroCOLT.

[60] René Vidal,et al. Clustering and dimensionality reduction on Riemannian manifolds , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[61] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[62] Hongdong Li,et al. Expanding the Family of Grassmannian Kernels: An Embedding Perspective , 2014, ECCV.

[63] Liang-Tien Chia,et al. Kernel Sparse Representation for Image Classification and Face Recognition , 2010, ECCV.