Hierarchical feature concatenation-based kernel sparse representations for image categorization

In order to obtain improved performance in complicated visual categorization tasks, considerable research has adopted multiple kernel learning based on dozens of different features. However, it is a complex process that needs to extract a multitude of features and seeks the optimal combination of multiple kernels. Inspired by the key idea of hierarchical learning, in this paper, we propose to find sparse representation based on feature concatenation using hierarchical kernel orthogonal matching pursuit (HKOMP). In addition to commonly used spatial pyramid feature for kernel representation, our method only employs one type of generic image feature, i.e., p.d.f gradient-based orientation histogram for concatenation of sparse codes. Next, the resulting concatenated features kernelized with widely used Gaussian radial basis kernel function form compact sparse representations in the second layer for linear support vector machine. HKOMP algorithm combines the advantages of building image representations layer-by-layer and kernel learning. Several publicly available image datasets are used to evaluate the presented approach and empirical results for various datasets show that the proposed scheme outperforms many kernel learning based and other competitive image categorization algorithms.

[1]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Ting Wang,et al.  Kernel Sparse Representation-Based Classifier , 2012, IEEE Transactions on Signal Processing.

[3]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[4]  Xuelong Li,et al.  Image Classification With Densely Sampled Image Windows and Generalized Adaptive Multiple Kernel Learning , 2015, IEEE Transactions on Cybernetics.

[5]  Guizhong Liu,et al.  Localized Multiple Kernel Learning Via Sample-Wise Alternating Optimization , 2014, IEEE Transactions on Cybernetics.

[6]  Jun Sun,et al.  Fast Kernel Sparse Representation , 2011, 2011 International Conference on Digital Image Computing: Techniques and Applications.

[7]  Hervé Glotin,et al.  Cooperative Sparse Representation in Two Opposite Directions for Semi-Supervised Image Annotation , 2012, IEEE Transactions on Image Processing.

[8]  Karthikeyan Natesan Ramamurthy,et al.  Multiple Kernel Sparse Representations for Supervised and Unsupervised Learning , 2013, IEEE Transactions on Image Processing.

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  Guizhong Liu,et al.  Probability-Confidence-Kernel-Based Localized Multiple Kernel Learning With $l_{p}$ Norm , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Dieter Fox,et al.  Multipath Sparse Coding Using Hierarchical Matching Pursuit , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Bingyuan Liu,et al.  Regularized Hierarchical Feature Learning with Non-negative Sparsity and Selectivity for Image Classification , 2014, 2014 22nd International Conference on Pattern Recognition.

[14]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Xi Liu,et al.  A co-boost framework for learning object categories from Google Images with 1st and 2nd order features , 2012, The Visual Computer.

[16]  Yao Zhao,et al.  Extracting shared subspace incrementally for multi-label image classification , 2013, The Visual Computer.

[17]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Jean Ponce,et al.  Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Xuelong Li,et al.  Spectral-Spatial Constraint Hyperspectral Image Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Yves Grandvalet,et al.  More efficiency in multiple kernel learning , 2007, ICML '07.

[21]  Chunhong Pan,et al.  Visual Tracking Via Kernel Sparse Representation With Multikernel Fusion , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Wen Gao,et al.  Group-Sensitive Multiple Kernel Learning for Object Recognition , 2012, IEEE Transactions on Image Processing.

[23]  Manik Varma,et al.  Learning The Discriminative Power-Invariance Trade-Off , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[24]  Shengyong Chen,et al.  Efficient kernel discriminative common vectors for classification , 2014, The Visual Computer.

[25]  S. V. N. Vishwanathan,et al.  SPF-GMKL: generalized multiple kernel learning with a million kernels , 2012, KDD.

[26]  Junqing Yu,et al.  On-Device Mobile Visual Location Recognition by Integrating Vision and Inertial Sensors , 2013, IEEE Transactions on Multimedia.

[27]  Xu-Dong Zhang,et al.  Learning to Rank from Noisy Data , 2015, ACM Trans. Intell. Syst. Technol..

[28]  James M. Rehg,et al.  Beyond the Euclidean distance: Creating effective visual codebooks using the Histogram Intersection Kernel , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  Andrew Zisserman,et al.  Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[30]  Zhong Jin,et al.  Kernel sparse representation based classification , 2012, Neurocomputing.

[31]  Jitendra Malik,et al.  Geometric blur for template matching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[32]  Liang-Tien Chia,et al.  Sparse Representation With Kernels , 2013, IEEE Transactions on Image Processing.

[33]  Meng Jian,et al.  Class-Discriminative Kernel Sparse Representation-Based Classification Using Multi-Objective Optimization , 2013, IEEE Transactions on Signal Processing.

[34]  Andrew Zisserman,et al.  Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[35]  Tinne Tuytelaars,et al.  Dense interest points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  Muhammad Arif,et al.  Cortex-inspired multilayer hierarchy based object detection system using PHOG descriptors and ensemble classification , 2015, The Visual Computer.

[37]  Rama Chellappa,et al.  Multiple Kernel Learning for Sparse Representation-Based Classification , 2014, IEEE Transactions on Image Processing.

[38]  Junqing Yu,et al.  Efficient BOF Generation and Compression for On-Device Mobile Visual Location Recognition , 2014, IEEE MultiMedia.

[39]  Rama Chellappa,et al.  Design of Non-Linear Kernel Dictionaries for Object Recognition , 2013, IEEE Transactions on Image Processing.

[40]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[41]  Fabio Aiolli,et al.  EasyMKL: a scalable multiple kernel learning algorithm , 2015, Neurocomputing.

[42]  Yongqiang Zhao,et al.  Hyperspectral Image Denoising via Sparse Representation and Low-Rank Constraint , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Shuicheng Yan,et al.  Visual classification with multi-task joint sparse representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[44]  Nicolas Pinto,et al.  Why is Real-World Visual Object Recognition Hard? , 2008, PLoS Comput. Biol..

[45]  James Hays,et al.  SUN attribute database: Discovering, annotating, and recognizing scene attributes , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  John D. Lafferty,et al.  Learning image representations from the pixel level via hierarchical sparse coding , 2011, CVPR 2011.

[47]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[48]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[49]  Rong Jin,et al.  Multiple Kernel Learning for Visual Object Recognition: A Review , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Rama Chellappa,et al.  Kernel dictionary learning , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[51]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[52]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[53]  Gunnar Rätsch,et al.  Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..

[54]  Dieter Fox,et al.  Hierarchical Matching Pursuit for Image Classification: Architecture and Fast Algorithms , 2011, NIPS.

[55]  Takumi Kobayashi,et al.  BFO Meets HOG: Feature Extraction Based on Histograms of Oriented p.d.f. Gradients for Image Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Weidong Zhou,et al.  Iris recognition based on a novel variation of local binary pattern , 2014, The Visual Computer.

[57]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Rongrong Ji,et al.  On-Device Mobile Landmark Recognition Using Binarized Descriptor with Multifeature Fusion , 2015, ACM Trans. Intell. Syst. Technol..

[59]  Shang-Hong Lai,et al.  Learning Component-Level Sparse Representation for Image and Video Categorization , 2013, IEEE Transactions on Image Processing.

[60]  Ling Shao,et al.  Learning Object-to-Class Kernels for Scene Classification , 2014, IEEE Transactions on Image Processing.

[61]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[62]  Xindong Wu,et al.  3-D Object Retrieval With Hausdorff Distance Learning , 2014, IEEE Transactions on Industrial Electronics.

[63]  Hongbin Zha,et al.  Supervised Kernel Descriptors for Visual Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.