Discriminative Spatial Tree for Image Classification

Spatial pyramid is a very popular method for preserving the spatial information of local features, which partitions an image into multiple blocks at different resolution levels. Nevertheless, the strategy for partitioning an image is designed by hand and same for all the codewords in a codebook. To address this problem, we propose a novel partition method named discriminative spatial tree (ST), and implement it from two different viewpoints: class and image. Discriminative ST builds one spatial tree for each codeword, and each node in each tree corresponds to one image region. For better performance evaluation, we also introduce a simplified coding scheme. Experimental results on two challenging datasets show that our simplified coding scheme leads to comparable results to some sophisticated ones, and discriminative ST can achieve better classification performance than spatial pyramid.

[1]  Fereshteh Sadeghi,et al.  Latent Pyramidal Regions for Recognizing Scenes , 2012, ECCV.

[2]  Jian Yu,et al.  A boosting approach to learning receptive fields for scene categorization , 2013, 2013 IEEE International Conference on Image Processing.

[3]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[4]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[5]  Yi Yang,et al.  Weakly supervised sparse coding with geometric consistency pooling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Bingbing Ni,et al.  Geometric ℓp-norm feature pooling for image classification , 2011, CVPR 2011.

[7]  Liang Wang,et al.  Multiple spatial pooling for visual object recognition , 2014, Neurocomputing.

[8]  Trevor Darrell,et al.  Beyond spatial pyramids: Receptive field learning for pooled image features , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Lei Wang,et al.  In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.

[11]  Yasuo Kuniyoshi,et al.  Discriminative spatial pyramid , 2011, CVPR 2011.

[12]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[13]  Cordelia Schmid,et al.  Discriminative spatial saliency for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Tieniu Tan,et al.  Spatial Graph for Image Classification , 2012, ACCV.