Scene Classification Using Generalized Local Correlation

Feature extraction is an important issue for generic image recognition. In recent years, methods based on the bag-of-keypoints technique have been quite successful and are widely used. However, this technique requires the quantization of local patches to build visual words as a preprocessing step, the computational cost of which is enormous. On the other hand, methods based on global image features have been used for a long time. Because global image features can be extracted rapidly, it is relatively easy to use them in practical large-scale systems. However, the performance of global feature methods is usually poor compared to bag-of-keypoints. Therefore, it is essential to develop a more powerful scheme of global feature extraction for achieving practical applications of generic image recognition. In this paper, we show that we can boost the performance of global image features by considering the correlations of local features in addition to the mean. We experimentally verify the effectiveness of our method using standard scene classification benchmark datasets.

[1]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Yong Wang,et al.  Conditional Random Field for Natural Scene Categorization , 2007, BMVC.

[4]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[5]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[8]  Nicolas Hervé,et al.  Image annotation: which approach for realistic databases? , 2007, CIVR '07.

[9]  Sergey Ioffe,et al.  Probabilistic Linear Discriminant Analysis , 2006, ECCV.

[10]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.