论文信息 - Positive Definite Matrices : Data Representation and Applications to Computer Vision

Positive Definite Matrices : Data Representation and Applications to Computer Vision

Numerous applications in computer vision and machine learning rely on representations of data that are compact, discriminative, and robust while satisfying several desirable invariances. One such recently successful representation is offered by symmetric positive definite (SPD) matrices. However, the modeling power of SPD matrices comes at a price: rather than a flat Euclidean view, SPD matrices are more naturally viewed through curved geometry (Riemannian or otherwise) which often complicates matters. We focus on models and algorithms that rely on the geometry of SPD matrices, and make our discussion concrete by casting it in terms of covariance descriptors for images. We summarize various commonly used distance metrics on SPD matrices, before highlighting formulations and algorithms for solving sparse coding and dictionary learning problems involving SPD data. Through empirical results, we showcase the benefits of mathematical models that exploit the curved geometry of SPD data across a diverse set of computer vision applications.

Anoop Cherian | Suvrit Sra | S. Sra | A. Cherian

[1] Sudeep Sarkar,et al. Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Phil Brodatz,et al. Textures: A Photographic Album for Artists and Designers , 1966 .

[3] Laurent Ferro-Famil,et al. Unsupervised classification of multifrequency and fully polarimetric SAR images based on the H/A/Alpha-Wishart classifier , 2001, IEEE Trans. Geosci. Remote. Sens..

[4] S Marcelja,et al. Mathematical description of the responses of simple cortical cells. , 1980, Journal of the Optical Society of America.

[5] Harry Wechsler,et al. The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[6] Shree K. Nayar,et al. Reflectance and texture of real-world surfaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Vassilios Morellas,et al. Compact covariance descriptors in 3D point clouds for object recognition , 2012, 2012 IEEE International Conference on Robotics and Automation.

[8] Brian C. Lovell,et al. Sparse Coding on Symmetric Positive Definite Manifolds Using Bregman Divergences , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[9] Anoop Cherian,et al. Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval , 2011, ECML/PKDD.

[10] U. Fano. Description of States in Quantum Mechanics by Density Matrix and Operator Techniques , 1957 .

[11] Larry S. Davis,et al. Learning Discriminative Appearance-Based Models Using Partial Least Squares , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[12] Xavier Pennec,et al. A Riemannian Framework for Tensor Computing , 2005, International Journal of Computer Vision.

[13] Baba C. Vemuri,et al. On A Nonlinear Generalization of Sparse Coding and Dictionary Learning , 2013, ICML.

[14] Anoop Cherian,et al. Jensen-Bregman LogDet Divergence with Application to Efficient Similarity Search for Covariance Matrices , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Fatih Murat Porikli,et al. Covariance Tracking using Model Update Based on Lie Algebra , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16] J. Borwein,et al. Two-Point Step Size Gradient Methods , 1988 .

[17] Vassilios Morellas,et al. Action recognition using global spatio-temporal features derived from sparse representations , 2014, Comput. Vis. Image Underst..

[18] Matti Pietikäinen,et al. A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[19] Fatih Murat Porikli,et al. Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[20] Vittorio Murino,et al. Multi-class Classification on Riemannian Manifolds for Video Surveillance , 2010, ECCV.

[21] Anoop Cherian,et al. Riemannian Dictionary Learning and Sparse Coding for Positive Definite Matrices , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[22] R. Bhatia. Positive Definite Matrices , 2007 .

[23] Kristen Grauman,et al. Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[24] Anoop Cherian. Nearest Neighbors Using Compact Sparse Codes , 2014, ICML.

[25] Luc Van Gool,et al. Depth and Appearance for Mobile Scene Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[26] Mehrtash Harandi,et al. Bag of riemannian words for virus classification , 2014 .

[27] Xuelong Li,et al. Gabor-Based Region Covariance Matrices for Face Recognition , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[28] Brian C. Lovell,et al. Sparse Coding and Dictionary Learning for Symmetric Positive Definite Matrices: A Kernel Approach , 2012, ECCV.

[29] Lei Zhang,et al. Log-Euclidean Kernels for Sparse Representation and Dictionary Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[30] Jana Reinhard,et al. Textures A Photographic Album For Artists And Designers , 2016 .

[31] Bamdev Mishra,et al. Manopt, a matlab toolbox for optimization on manifolds , 2013, J. Mach. Learn. Res..

[32] Robert E. Mahony,et al. Optimization Algorithms on Matrix Manifolds , 2007 .

[33] Biju Issac,et al. Case Studies in Intelligent Computing: Achievements and Trends , 2014 .

[34] Takashi Masuko,et al. Covariance clustering on Riemannian manifolds for acoustic model compression , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[35] Jean Ponce,et al. Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[36] Vassilios Morellas,et al. Tensor Sparse Coding for Region Covariances , 2010, ECCV.

[37] S. Sra. Positive definite matrices and the S-divergence , 2011, 1110.1773.

[38] Thomas S. Huang,et al. Emotion Recognition from Arbitrary View Facial Images , 2010, ECCV.

[39] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[40] Larry S. Davis,et al. Covariance discriminative learning: A natural and efficient approach to image set classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Anoop Cherian,et al. Riemannian Sparse Coding for Positive Definite Matrices , 2014, ECCV.

[42] N. Ayache,et al. Log‐Euclidean metrics for fast and simple calculus on diffusion tensors , 2006, Magnetic resonance in medicine.

[43] José Mario Martínez,et al. Nonmonotone Spectral Projected Gradient Methods on Convex Sets , 1999, SIAM J. Optim..

[44] Mark W. Schmidt,et al. Optimizing Costly Functions with Simple Constraints: A Limited-Memory Projected Quasi-Newton Algorithm , 2009, AISTATS.

[45] Hongdong Li,et al. Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[46] Dieter Fox,et al. A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[47] Chengjun Liu,et al. Gabor-based kernel PCA with fractional power polynomial models for face recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48] Vittorio Murino,et al. Kernel-based classification for brain connectivity graphs on the Riemannian manifold of positive definite matrices , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[49] Duc Fehr. Covariance Based Point Cloud Descriptors for Object Detection and Classification , 2013 .

[50] Janusz Konrad,et al. Action Recognition Using Sparse Representation on Covariance Manifolds of Optical Flow , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[51] Hyeonjoon Moon,et al. The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[52] Bingpeng Ma,et al. BiCov: a novel image representation for person re-identification and face verification , 2012, BMVC.

[53] James C. Gee,et al. Spatial transformations of diffusion tensor magnetic resonance images , 2001, IEEE Transactions on Medical Imaging.

[54] José Mario Martínez,et al. Algorithm 813: SPG—Software for Convex-Constrained Optimization , 2001, TOMS.

[55] Mehrtash Tafazzoli Harandi,et al. Riemannian coding and dictionary learning: Kernels to the rescue , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56] Pierre-Antoine Absil,et al. Trust-Region Methods on Riemannian Manifolds , 2007, Found. Comput. Math..

[57] Michael Elad,et al. Image Denoising Via Learned Dictionaries and Sparse representation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).