Orthogonal and Smooth Subspace Based on Sparse Coding for Image Classification

Many real-world problems usually deal with high-dimensional data, such as images, videos, text, web documents and so on. In fact, the classification algorithms used to process these high-dimensional data often suffer from the low accuracy and high computational complexity. Therefore, we propose a framework of transforming images from a high-dimensional image space to a low-dimensional target image space, based on learning an orthogonal smooth subspace for the SIFT sparse codes SC-OSS. It is a two stage framework for subspace learning. Firstly, a sparse coding followed by spatial pyramid max pooling is used to get the image representation. Then, the image descriptor is mapped into an orthonormal and smooth subspace to classify images in low dimension. The proposed algorithm adds the orthogonality and a Laplacian smoothing penalty to constrain the projective function coefficient to be orthogonal and spatially smooth. The experimental results on the public datasets have shown that the proposed algorithm outperforms other subspace methods.

[1]  Furui Liu,et al.  Locality enhanced spectral embedding and spatially smooth spectral regression for face recognition , 2012, 2012 IEEE International Conference on Information and Automation.

[2]  Jieping Ye,et al.  Characterization of a Family of Algorithms for Generalized Discriminant Analysis on Undersampled Problems , 2005, J. Mach. Learn. Res..

[3]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[6]  Aijun Xu,et al.  An Image Classification Algorithm Based on Bag of Visual Words and Multi-kernel Learning , 2014, J. Multim..

[7]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[8]  Feiping Nie,et al.  Learning an Orthogonal and Smooth Subspace for Image Classification , 2009, IEEE Signal Processing Letters.

[9]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Yu-Jin Zhang,et al.  Discriminant projection embedding for face and palmprint recognition , 2008, Neurocomputing.

[12]  Er-wei Bai,et al.  Big data: The curse of dimensionality in modeling , 2014, Proceedings of the 33rd Chinese Control Conference.

[13]  Jing-Yu Yang,et al.  Multiscale saliency detection using principle component analysis , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).