Supervised local sparse coding of sub-image features for image retrieval

The success of sparse representations in image modeling and recovery has motivated its use in computer vision applications. Image retrieval and classification tasks require extracting features that discriminate different image classes. State-of-the-art object recognition methods based on sparse coding use spatial pyramid features obtained from dense descriptors. In this paper, we develop a feature extraction method that uses multiple global/local features extracted from large overlapping regions of an image, which we refer to as sub-images. We propose a procedure for dictionary design and supervised local sparse coding of sub-image heterogeneous features. We perform image retrieval on the Microsoft Research Cambridge image dataset and show that the proposed features outperform the spatial pyramid features obtained using dense descriptors.

[1]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Ioannis Patras,et al.  Supervised dictionary learning for action localization , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[3]  Jan-Michael Frahm,et al.  Building Rome on a Cloudless Day , 2010, ECCV.

[4]  Alain Rakotomamonjy,et al.  Surveying and comparing simultaneous sparse approximation (or group-lasso) algorithms , 2011, Signal Process..

[5]  Vincent Lepetit,et al.  Are sparse representations really relevant for image classification? , 2011, CVPR 2011.

[6]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Larry S. Davis,et al.  Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[8]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[9]  Andrew Zisserman,et al.  SDL: Supervised Dictionary Learning , 2008, NIPS 2008.

[10]  Andreas Spanias,et al.  Learning dictionaries for local sparse coding in image classification , 2011, 2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR).

[11]  Yihong Gong,et al.  Nonlinear Learning using Local Coordinate Coding , 2009, NIPS.

[12]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[13]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[14]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Ilkay Ulusoy,et al.  Generative versus discriminative methods for object recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Baoxin Li,et al.  Discriminative K-SVD for dictionary learning in face recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Jiebo Luo,et al.  Heterogeneous feature machines for visual recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision.