Similarity search in visual data

Contemporary times have witnessed a significant increase in the amount of data available on the Internet. Organizing such big data so that it is easily and promptly accessible, is a necessity that has been growing in importance. Among the various data modalities such as text, audio, etc., visual data (in the form of images and videos) constitute a major share of this available content. Contrary to other data modalities, visual data pose several significant challenges to storage and retrieval, namely (i) choosing an appropriate representation that can capture the essence of visual data is often non-trivial, and (ii) visual search and retrieval are often subjective, as a result computing semantically meaningful results is hard. On the other hand, visual data possesses rich structure. Exploiting this structure might help address these challenges. Motivated by these observations, this thesis explores new algorithms for efficient similarity search in structured visual data; “structure” is synonymous with the mathematical representation that captures desirable data properties. We will deal with two classes of such structures that are common in computer vision, namely (i) symmetric positive definite matrices as covariances, and (ii) sparse data representations in a dictionary learned from the data. Covariance valued data has found immense success in several mainstream computer vision applications such as visual surveillance, emotion recognition, face recognition, etc. Moreover, it is of fundamental importance in several other disciplines such as magnetic resonance imaging, speech recognition, etc. A technical challenge in computing similarities on such matrix valued data is their non-Euclidean nature. These matrices belong to a curved manifold where distances between data points are no more along straight lines, but along curved geodesics. As a result, state-of-the-art measures for comparing covariances tend to be slow. To address this issue, we propose a novel similarity measure on covariance matrices-the Jensen-Bregman LogDet divergence -which is fast, but at the same time preserves the accuracy of retrieval compared to natural distances on the manifold. To scale our retrieval framework for large covariance datasets, we propose a metric tree data structure on this new measure. Next, as clustering forms an important ingredient for several search algorithms, we investigate this component independently and propose a novel unsupervised algorithm based on the Dirichlet process mixture model for clustering covariance valued data. The second part of this thesis addresses similarity search problems for high dimensional vector valued data. Such data is ubiquitous not only in computer vision, but also in several other disciplines including data mining, machine learning, and robotics. As the dimensionality of the data increases, computing meaningful similarities becomes increasingly difficult due to the curse of dimensionality. Our approach to deal with this problem is inspired from the principles of dictionary learning and sparse coding. Our main idea is to learn an overcomplete dictionary of subspaces from the data so that each data point can be approximated by a sparse linear combination of these subspaces. We introduce a tuple based data descriptor on these sparse combinations-Subspace Combination Tuple-that is storage efficient, fast in retrieval, and provides superior accuracy for NN retrieval against the state-of-the-art. These benefits come at a price; the sparse representations are often sensitive to data perturbations. To circumvent this issue, we propose several algorithms for robust dictionary learning and sparse coding. Extending the sparse coding framework to matrix valued data for hashing covariances forms the content for the third part of this thesis. Towards this end, we propose our novel Generalized dictionary learning framework. We describe the theoretical motivations and provide extensive experimental evidence for demonstrating the benefits of our algorithms.

[1]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[2]  Rui Caseiro,et al.  Foreground Segmentation via Background Modeling on Riemannian Manifolds , 2010, 2010 20th International Conference on Pattern Recognition.

[3]  Zhizhou Wang,et al.  A constrained variational principle for direct estimation and smoothing of the diffusion tensor field from complex DWI , 2004, IEEE Transactions on Medical Imaging.

[4]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[6]  Andrew W. Moore,et al.  An Investigation of Practical Approximate Nearest Neighbor Algorithms , 2004, NIPS.

[7]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[8]  Michael Elad,et al.  Learning Multiscale Sparse Representations for Image and Video Restoration , 2007, Multiscale Model. Simul..

[9]  Albert Wang,et al.  The In-Crowd Algorithm for Fast Basis Pursuit Denoising , 2011, IEEE Transactions on Signal Processing.

[10]  Vassilios Morellas,et al.  Metric learning for semi-supervised clustering of Region Covariance Descriptors , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[11]  Max Donath,et al.  American Control Conference , 1993 .

[12]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[13]  Daniel Boley,et al.  Symmetrized Bregman Divergences and Metrics , 2009 .

[14]  Lee Atkinson,et al.  Using C , 1990 .

[15]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16]  Rama Chellappa,et al.  Nearest-neighbor search algorithms on non-Euclidean manifolds for computer vision applications , 2010, ICVGIP '10.

[17]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[18]  W. Reed The Normal-Laplace Distribution and Its Relatives , 2006 .

[19]  Norbert Schuff,et al.  Alzheimer’s Disease Neuroimaging Initiative: A one-year follow up study using Tensor-Based Morphometry correlating degenerative rates, biomarkers and cognition , 2009 .

[20]  Janusz Konrad,et al.  Action Recognition in Video by Covariance Matching of Silhouette Tunnels , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[21]  Thomas J. Asaki,et al.  A Variational Approach to Reconstructing Images Corrupted by Poisson Noise , 2007, Journal of Mathematical Imaging and Vision.

[22]  BSTRACT,et al.  1 NONPARAMETRIC FUNCTIONAL DATA ANALYSIS THROUGH BAYESIAN DENSITY ESTIMATION , 2007 .

[23]  G. Sapiro,et al.  Universal priors for sparse modeling , 2009, 2009 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[24]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[25]  Jitendra Malik,et al.  Shape contexts enable efficient retrieval of similar shapes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[26]  Guillermo Sapiro,et al.  Supervised Dictionary Learning , 2008, NIPS.

[27]  Piotr Indyk On approximate nearest neighbors in non-Euclidean spaces , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[28]  Svetlana Lazebnik,et al.  Locality-sensitive binary codes from shift-invariant kernels , 2009, NIPS.

[29]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.

[30]  Jitendra Malik,et al.  Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[31]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[32]  M. West,et al.  Hyperparameter estimation in Dirichlet process mixture models , 1992 .

[33]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[34]  Nizar Bouguila,et al.  A Dirichlet Process Mixture of Generalized Dirichlet Distributions for Proportional Data Modeling , 2010, IEEE Transactions on Neural Networks.

[35]  Lei Wang,et al.  Scalable Large-Margin Mahalanobis Distance Metric Learning , 2010, IEEE Transactions on Neural Networks.

[36]  Jimmie D. Lawson,et al.  The Geometric Mean, Matrices, Metrics, and More , 2001, Am. Math. Mon..

[37]  Donald E. Knuth,et al.  Sorting and Searching , 1973 .

[38]  Inderjit S. Dhillon,et al.  Matrix Nearness Problems with Bregman Divergences , 2007, SIAM J. Matrix Anal. Appl..

[39]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[40]  Chabane Djeraba,et al.  Covariance Matrices for Crowd Behaviour Monitoring on the Escalator Exits , 2008, ISVC.

[41]  Inderjit S. Dhillon,et al.  Low-Rank Kernel Learning with Bregman Matrix Divergences , 2009, J. Mach. Learn. Res..

[42]  Christine Guillemot,et al.  Approximate nearest neighbors using sparse representations , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[43]  Lei Wang,et al.  PSDBoost: Matrix-Generation Linear Programming for Positive Semidefinite Matrices Learning , 2008, NIPS.

[44]  R. Muirhead Aspects of Multivariate Statistical Theory , 1982, Wiley Series in Probability and Statistics.

[45]  Radu Horaud,et al.  SHREC '11: Robust Feature Detection and Description Benchmark , 2011, 3DOR@Eurographics.

[46]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[47]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[48]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Inderjit S. Dhillon,et al.  Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[50]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Ivan W. Selesnick,et al.  The Estimation of Laplace Random Vectors in Additive White Gaussian Noise , 2008, IEEE Transactions on Signal Processing.

[52]  Junfeng Yang,et al.  Alternating Direction Algorithms for 1-Problems in Compressive Sensing , 2009, SIAM J. Sci. Comput..

[53]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[54]  Roded Sharan,et al.  Bayesian haplo-type inference via the dirichlet process , 2004, ICML.

[55]  Léon Bottou,et al.  On-line learning and stochastic approximations , 1999 .

[56]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[57]  Samy Bengio,et al.  An Online Algorithm for Large Scale Image Similarity Learning , 2009, NIPS.

[58]  Chun Chen,et al.  Speech Emotion Classification on a Riemannian Manifold , 2008, PCM.

[59]  Sergey Brin,et al.  Near Neighbor Search in Large Metric Spaces , 1995, VLDB.

[60]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[61]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[62]  Guillermo Sapiro,et al.  Online dictionary learning for sparse coding , 2009, ICML '09.

[63]  Takashi Masuko,et al.  Covariance clustering on Riemannian manifolds for acoustic model compression , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[64]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[65]  Shih-Fu Chang,et al.  Semi-supervised hashing for scalable image retrieval , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[66]  Frank Nielsen,et al.  Jensen-Bregman Voronoi Diagrams and Centroidal Tessellations , 2010, 2010 International Symposium on Voronoi Diagrams in Science and Engineering.

[67]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[68]  K. Kreutz-Delgado,et al.  Sparse image coding using learned overcomplete dictionaries , 2004, Proceedings of the 2004 14th IEEE Signal Processing Society Workshop Machine Learning for Signal Processing, 2004..

[69]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[70]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, CVPR Workshops.

[71]  Jian Yu,et al.  Restoration of images corrupted by mixed Gaussian-impulse noise via l1-l0 minimization , 2011, Pattern Recognit..

[72]  Emilio Maggio,et al.  Particle PHD Filtering for Multi-Target Visual Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[73]  Paul M. Thompson,et al.  Generalized Tensor-Based Morphometry of HIV/AIDS Using Multivariate Statistics on Deformation Tensors , 2008, IEEE Transactions on Medical Imaging.

[74]  E. Candès,et al.  Curvelets: A Surprisingly Effective Nonadaptive Representation for Objects with Edges , 2000 .

[75]  Frank K. Soong,et al.  On divergence based clustering of normal distributions and its application to HMM adaptation , 2003, INTERSPEECH.

[76]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[77]  D. Narmadha,et al.  A Survey on Image Denoising Techniques , 2012 .

[78]  Kenneth L. Clarkson,et al.  An algorithm for approximate closest-point queries , 1994, SCG '94.

[79]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[80]  Lawrence Cayton,et al.  Fast nearest neighbor retrieval for bregman divergences , 2008, ICML '08.

[81]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[82]  Paul M. Thompson,et al.  3 D pattern of brain atrophy in HIV / AIDS visualized using tensor-based morphometry , 2006 .

[83]  Hao Xu,et al.  Learning Sparse Representations of High Dimensional Data on Large Scale Dictionaries , 2011, NIPS.

[84]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[85]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[86]  Rachid Deriche,et al.  Unsupervised Segmentation Incorporating Colour, Texture, and Motion , 2003, CAIP.

[87]  Michael Elad,et al.  Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[88]  Inderjit S. Dhillon,et al.  Online Metric Learning and Fast Similarity Search , 2008, NIPS.

[89]  Inderjit S. Dhillon,et al.  Orthogonal Matching Pursuit with Replacement , 2011, NIPS.

[90]  J. Ibrahim,et al.  Statistical Analysis of Diffusion Tensors in Diffusion-Weighted Magnetic Resonance Imaging Data , 2007 .

[91]  M. R. Osborne,et al.  On the LASSO and its Dual , 2000 .

[92]  Anoop Cherian,et al.  Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet Divergence , 2011, 2011 International Conference on Computer Vision.

[93]  Daniel Fink A Compendium of Conjugate Priors , 1997 .

[94]  Xavier Pennec,et al.  A Riemannian Framework for Tensor Computing , 2005, International Journal of Computer Vision.

[95]  Alan L. Yuille,et al.  The Concave-Convex Procedure , 2003, Neural Computation.

[96]  Frank Nielsen,et al.  On the Centroids of Symmetrized Bregman Divergences , 2007, ArXiv.

[97]  L. Eon Bottou Online Learning and Stochastic Approximations , 1998 .

[98]  Jon M. Kleinberg,et al.  Two algorithms for nearest-neighbor search in high dimensions , 1997, STOC '97.

[99]  Emmanuel J. Candès,et al.  Matrix Completion With Noise , 2009, Proceedings of the IEEE.

[100]  Mohamed-Jalal Fadili,et al.  Wavelets, Ridgelets, and Curvelets for Poisson Noise Removal , 2008, IEEE Transactions on Image Processing.

[101]  Thomas S. Huang,et al.  Emotion Recognition from Arbitrary View Facial Images , 2010, ECCV.

[102]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[103]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[104]  W. Förstner,et al.  A Metric for Covariance Matrices , 2003 .

[105]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[106]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[107]  Nuno Vasconcelos,et al.  Bridging the Gap: Query by Semantic Example , 2007, IEEE Transactions on Multimedia.

[108]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[109]  Quanquan Gu,et al.  A similarity measure under Log-Euclidean metric for stereo matching , 2008, 2008 19th International Conference on Pattern Recognition.

[110]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[111]  Erik Learned-Miller,et al.  FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[112]  Frank Nielsen,et al.  The Burbea-Rao and Bhattacharyya Centroids , 2010, IEEE Transactions on Information Theory.

[113]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[114]  S. Li Concise Formulas for the Area and Volume of a Hyperspherical Cap , 2011 .

[115]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[116]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[117]  L. Bregman The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , 1967 .

[118]  Yuri Ivanov,et al.  Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos , 2010, ECCV.

[119]  Lieven Vandenberghe,et al.  Interior-Point Method for Nuclear Norm Approximation with Application to System Identification , 2009, SIAM J. Matrix Anal. Appl..

[120]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[121]  J. Pitman Combinatorial Stochastic Processes , 2006 .

[122]  Martin J. Wainwright,et al.  Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[123]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[124]  Erik B. Sudderth Graphical models for visual object recognition and tracking , 2006 .

[125]  Thomas S. Huang,et al.  Close the loop: Joint blind image restoration and recognition with sparse representation prior , 2011, 2011 International Conference on Computer Vision.

[126]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[127]  D. Blackwell,et al.  Ferguson Distributions Via Polya Urn Schemes , 1973 .

[128]  Dinh Van Huynh,et al.  Algebra and Its Applications , 2006 .

[129]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[130]  Vassilios Morellas,et al.  Feature-based covariance matching for a moving target in multi-robot following , 2011, 2011 19th Mediterranean Conference on Control & Automation (MED).

[131]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[132]  Subhash C. Kak,et al.  Quantum Information and Entropy , 2006, ArXiv.

[133]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[134]  N. Ayache,et al.  Log‐Euclidean metrics for fast and simple calculus on diffusion tensors , 2006, Magnetic resonance in medicine.

[135]  Y. Censor,et al.  Parallel Optimization: Theory, Algorithms, and Applications , 1997 .

[136]  Shree K. Nayar,et al.  Reflectance and texture of real-world surfaces , 1999, TOGS.

[137]  Enric Martí,et al.  Manifold parametrization of the left ventricle for a statistical modelling of its complete anatomy , 2010, Medical Imaging.

[138]  Fatih Murat Porikli,et al.  Robust License Plate Detection Using Covariance Descriptor in a Neural Network Framework , 2006, 2006 IEEE International Conference on Video and Signal Based Surveillance.

[139]  Hervé Delingette,et al.  Statistical Comparison of Cardiac Fibre Architectures , 2007, FIMH.

[140]  Fatih Murat Porikli,et al.  Covariance Tracking using Model Update Based on Lie Algebra , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[141]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[142]  Fatih Murat Porikli,et al.  Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[143]  Maher Moakher,et al.  Symmetric Positive-Definite Matrices: From Geometry to Applications and Visualization , 2006, Visualization and Processing of Tensor Fields.

[144]  Kjersti Engan,et al.  Multi-frame compression: theory and design , 2000, Signal Process..

[145]  John N. Tsitsiklis,et al.  Introduction to Probability , 2002 .

[146]  Nicholas Ayache,et al.  Geometric Means in a Novel Vector Space Structure on Symmetric Positive-Definite Matrices , 2007, SIAM J. Matrix Anal. Appl..

[147]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[148]  Volker Roth,et al.  The Translation-invariant Wishart-Dirichlet Process for Clustering Distance Data , 2010, ICML.

[149]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[150]  Stephen P. Boyd,et al.  An Interior-Point Method for Large-Scale $\ell_1$-Regularized Least Squares , 2007, IEEE Journal of Selected Topics in Signal Processing.

[151]  Inderjit S. Dhillon,et al.  Differential Entropic Clustering of Multivariate Gaussians , 2006, NIPS.

[152]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[153]  Shree K. Nayar,et al.  Reflectance and texture of real-world surfaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[154]  Arkadi Nemirovski,et al.  Robust solutions of uncertain linear programs , 1999, Oper. Res. Lett..

[155]  Xuelong Li,et al.  Gabor-Based Region Covariance Matrices for Face Recognition , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[156]  Antonio Torralba,et al.  Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.

[157]  M. Gromov,et al.  Monotonicity of the volume of intersection of balls , 1987 .

[158]  Vassilios Morellas,et al.  Tensor Sparse Coding for Region Covariances , 2010, ECCV.

[159]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[160]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[161]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[162]  Xiaoqin Zhang,et al.  Visual tracking via incremental Log-Euclidean Riemannian subspace learning , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[163]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[164]  Jianjun Yu,et al.  A scene recognition algorithm based on covariance descriptor , 2008, 2008 IEEE Conference on Cybernetics and Intelligent Systems.

[165]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[166]  S. Sra Positive definite matrices and the Symmetric Stein Divergence , 2011 .

[167]  Bo Ma,et al.  Covariance Matching for PDE-based Contour Tracking , 2011, 2011 Sixth International Conference on Image and Graphics.

[168]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[169]  Stephen M. Omohundro,et al.  Five Balltree Construction Algorithms , 2009 .

[170]  Parikshit Ram,et al.  Rank-Approximate Nearest Neighbor Search: Retaining Meaning and Speed in High Dimensions , 2009, NIPS.

[171]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[172]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[173]  Nicholas Ayache,et al.  A Riemannian Framework for the Processing of Tensor-Valued Images , 2005, DSSCV.

[174]  I. Daubechies,et al.  An iterative thresholding algorithm for linear inverse problems with a sparsity constraint , 2003, math/0307152.

[175]  Piotr Indyk,et al.  Stable distributions, pseudorandom generators, embeddings, and data stream computation , 2006, JACM.

[176]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[177]  J. Borwein,et al.  Two-Point Step Size Gradient Methods , 1988 .

[178]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[179]  Kazuhiko Sumi,et al.  A New Approach for In-Vehicle Camera Traffic Sign Detection and Recognition , 2009, MVA.

[180]  Gert R. G. Lanckriet,et al.  On the Convergence of the Concave-Convex Procedure , 2009, NIPS.

[181]  Julien Mairal,et al.  Complexity Analysis of the Lasso Regularization Path , 2012, ICML.

[182]  M. L. Eaton Multivariate statistics : a vector space approach , 1985 .

[183]  Matti Pietikäinen,et al.  Matching Groups of People by Covariance Descriptor , 2010, 2010 20th International Conference on Pattern Recognition.

[184]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[185]  I. Dryden,et al.  Non-Euclidean statistics for covariance matrices, with applications to diffusion tensor imaging , 2009, 0910.1656.

[186]  Stephen J. Maybank,et al.  Human Action Recognition under Log-Euclidean Riemannian Metric , 2009, ACCV.

[187]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[188]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[189]  James C. Gee,et al.  Spatial transformations of diffusion tensor magnetic resonance images , 2001, IEEE Transactions on Medical Imaging.

[190]  Antonio Torralba,et al.  Object Detection and Localization Using Local and Global Features , 2006, Toward Category-Level Object Recognition.

[191]  Dmitry M. Malioutov,et al.  Homotopy continuation for sparse signal representation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..