Scene Categorization from Tiny Images

Scene categorization can play a crucial role in applications such as image retrieval and object recognition. It seems reasonable that scene category labels like indoor, outdoor, kitchen, or beach can be assigned to images at lower spatial and radiometric resolution than may be required for categorization and localization of objects in the scenes. This paper presents a systematic investigation of resolution requirements for scene categorization. We introduce a novel, challenging database of tiny images for 20 dierent scene categories, present experimental results on human categorization performance, as well as an algorithm to eciently learn and perform scene categorization. Our model’s best recall rate of 46% significantly outperforms human recall of 30% on 32 ◊ 32 images. These results should encourage future research in the use of categorization on low resolution thumbnails as an important preprocessing step to model context in categorization, recognition and localization.

[1]  Andrew Zisserman,et al.  An Affine Invariant Salient Region Detector , 2004, ECCV.

[2]  Jitendra Malik,et al.  Shape matching and object recognition using low distortion correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[5]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[6]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[8]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[10]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[11]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[12]  Trevor Darrell,et al.  Pyramid Match Kernels: Discriminative Classification with Sets of Image Features (version 2) , 2006 .

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Svetlana Lazebnik,et al.  Learning Nearest-Neighbor Quantizers from Labeled Data by Information Loss Minimization , 2007, AISTATS.

[15]  Cor J. Veenman,et al.  Robust Scene Categorization by Learning Image Statistics in Context , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).