Efficient re-ranking in vocabulary tree based image retrieval

Image retrieval using a large vocabulary tree of local invariant features can efficiently handle databases with millions of images. However, a costly re-ranking step is generally required to re-order the top candidate images to enforce spatial consistency among local features. In this paper, we propose an efficient re-ranking approach which takes advantage of the vocabulary tree quantization to conduct fast feature matching. The proposed re-ranking algorithm involves no operations in the high-dimensional feature space and does not assume a global transform between a pair of images, thus, it not only dramatically reduces the computational complexity but also improves the retrieval precision, which is validated using 1.26 million images in the public ImageNet dataset and the San Francisco Landmark dataset including 1.7 million images.

[1]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[2]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Myron Flickner,et al.  Query by Image and Video Content , 1995 .

[5]  Ming Yang,et al.  Contextual weighting for vocabulary tree based image retrieval , 2011, 2011 International Conference on Computer Vision.

[6]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[7]  Laurent Itti,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rapid Biologically-inspired Scene Classification Using Features Shared with Visual Attention , 2022 .

[8]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Qi Tian,et al.  Spatial coding for large scale partial-duplicate web image search , 2010, ACM Multimedia.

[10]  Xin Chen,et al.  City-scale landmark identification on mobile devices , 2011, CVPR 2011.

[11]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[12]  Antonio Torralba,et al.  Small codes and large image databases for recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Jia Deng,et al.  A large-scale hierarchical image database , 2009, CVPR 2009.

[15]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Bernd Girod,et al.  Fast geometric re-ranking for image-based retrieval , 2010, 2010 IEEE International Conference on Image Processing.

[17]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[18]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.