Learning dictionaries for local sparse coding in image classification

Low dimensional embedding of data samples lying on a manifold can be performed using locally linear modeling. By incorporating suitable locality constraints, sparse coding can be adapted to modeling local regions of a manifold. This has been coupled with the spatial pyramid matching algorithm to achieve state-of-the-art performance in object recognition. In this paper, we propose an algorithm to learn dictionaries for computing local sparse codes of descriptors extracted from image patches. The algorithm iterates between a local sparse coding step and an update step that searches for a better dictionary. Evaluation of the local sparse code for a data sample is simplified by first estimating its neighbors using the proposed distance metric and then computing the minimum ℓ1 solution using only the neighbors. The proposed dictionary update ensures that the neighborhood of a training sample is not changed from one iteration to the next. Simulation results demonstrate that the sparse codes computed using the proposed dictionary achieve improved classification accuracies when compared to using a K-means dictionary with standard image datasets.

[1]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[2]  D. Donoho,et al.  Atomic Decomposition by Basis Pursuit , 2001 .

[3]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[4]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[5]  Lawrence Cayton,et al.  Algorithms for manifold learning , 2005 .

[6]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[7]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[9]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[10]  Yihong Gong,et al.  Nonlinear Learning using Local Coordinate Coding , 2009, NIPS.

[11]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[12]  Michael Elad,et al.  Dictionaries for Sparse Representation Modeling , 2010, Proceedings of the IEEE.

[13]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Karthikeyan Natesan Ramamurthy,et al.  Optimality and stability of the K-hyperline clustering algorithm , 2011, Pattern Recognit. Lett..

[15]  Andreas Spanias,et al.  Multilevel dictionary learning for sparse representation of images , 2011, 2011 Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE).