Image Retrieval Based on Statistical and Geometry Features

The most popular approach to large scale image retrieval is based on the bag-of-word (BoW) representation of images. There is an important trick how the statistical and geometry features of BoW are efficiently used. We present a two-step approach for image retrieval with statistical features and spatial geometry information been considered in different step. In the first step, the statistical features of the images’ BoW are achieved to capture the underlying image topic to screen those images. In the second step, images from same topic are ranked using the concept of co-occurrence features (a type of geometry features). Computational cost of the retrieval is reduced because the first step does not consider computing the expensive spatial geometry information and the second step only uses significant features to rank images. Experiments on the Oxford 5K benchmark show that the proposed technique can stably achieve nearly the same result compared with a state-of-the-art retrieval method [1] while only spending about a tenth of the time the method takes.

[1]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[2]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[3]  Jiri Matas,et al.  Efficient representation of local geometry for large scale object retrieval , 2009, CVPR.

[4]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Changhu Wang,et al.  Spatial-bag-of-features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  D. B. Davis,et al.  Intel Corp. , 1993 .

[7]  Tsuhan Chen,et al.  Image retrieval with geometry-preserving visual phrases , 2011, CVPR 2011.

[8]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[9]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[10]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Neil A. Dodgson,et al.  Proceedings Ninth IEEE International Conference on Computer Vision , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Cordelia Schmid,et al.  Packing bag-of-features , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Michael Isard,et al.  Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[17]  Fangyuan Wang,et al.  Image retrieval using multiple orders of Geometry-preserving Visual Phrases , 2012, 2012 International Conference on Image Analysis and Signal Processing.