Improving bag-of-words representation with efficient twin feature integration

In recent years, the Bag-of-Words (BoW) model has been widely used in most state-of-the-art large-scale image retrieval systems. However, the standard BoW based systems suffer from low discriminative power of local features as well as quantization errors that significantly affect the retrieval performance. In this paper, twin feature is employed and well combined with two advanced techniques including Hamming Embedding (HE) and Multiple Assignment (MA) to construct a discriminative image retrieval system on BoW representation in an efficient way. Experimental results on two benchmark datasets Oxford5k and Paris6k demonstrate that the proposed technique can greatly refine the visual matching process and enhance the final performance for image retrieval.

[1]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[4]  Lei Wang,et al.  Twin Feature and Similarity Maximal Matching for Image Retrieval , 2015, ICMR.

[5]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[6]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[7]  Hongxun Yao,et al.  Nested-SIFT for Efficient Image Matching and Retrieval , 2013, IEEE MultiMedia.

[8]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[12]  Cordelia Schmid,et al.  A contextual dissimilarity measure for accurate and efficient image search , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Changhu Wang,et al.  Spatial-bag-of-features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.