论文信息 - Cross-Indexing of Binary SIFT Codes for Large-Scale Image Search

Cross-Indexing of Binary SIFT Codes for Large-Scale Image Search

In recent years, there has been growing interest in mapping visual features into compact binary codes for applications on large-scale image collections. Encoding high-dimensional data as compact binary codes reduces the memory cost for storage. Besides, it benefits the computational efficiency since the computation of similarity can be efficiently measured by Hamming distance. In this paper, we propose a novel flexible scale invariant feature transform (SIFT) binarization (FSB) algorithm for large-scale image search. The FSB algorithm explores the magnitude patterns of SIFT descriptor. It is unsupervised and the generated binary codes are demonstrated to be dispreserving. Besides, we propose a new searching strategy to find target features based on the cross-indexing in the binary SIFT space and original SIFT space. We evaluate our approach on two publicly released data sets. The experiments on large-scale partial duplicate image retrieval system demonstrate the effectiveness and efficiency of the proposed algorithm.

[1] Shih-Fu Chang,et al. Spherical hashing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Gary R. Bradski,et al. ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[3] Qi Tian,et al. Latent visual context learning for web image applications , 2011, Pattern Recognit..

[4] O. Chum,et al. Geometric min-Hashing: Finding a (thick) needle in a haystack , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Qi Tian,et al. Lp-Norm IDF for Large Scale Image Search , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Qi Tian,et al. Embedding spatial context information into inverted filefor large-scale image retrieval , 2012, ACM Multimedia.

[7] Tsuhan Chen,et al. Image retrieval with geometry-preserving visual phrases , 2011, CVPR 2011.

[8] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[9] Jon Louis Bentley,et al. K-d trees for semidynamic point sets , 1990, SCG '90.

[10] Michael Isard,et al. Bundling features for large scale partial-duplicate web image search , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Michael Isard,et al. Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Shengjin Wang,et al. Visual Phraselet: Refining Spatial Constraints for Large Scale Image Search , 2013, IEEE Signal Processing Letters.

[13] Qi Tian,et al. SIFT match verification by geometric coding for large-scale partial-duplicate web image search , 2013, TOMCCAP.

[14] Roland Siegwart,et al. BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[15] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[16] Cordelia Schmid,et al. An Affine Invariant Interest Point Detector , 2002, ECCV.

[17] O. Chum,et al. ENHANCING RANSAC BY GENERALIZED MODEL OPTIMIZATION Onďrej Chum, Jǐ , 2003 .

[18] Shih-Fu Chang,et al. Mobile product search with Bag of Hash Bits and boundary reranking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Jinhui Tang,et al. Strong geometrical consistency in large scale partial-duplicate image search , 2013, ACM Multimedia.

[20] Tat-Seng Chua,et al. Semantic-Gap-Oriented Active Learning for Multilabel Image Annotation , 2012, IEEE Transactions on Image Processing.

[21] Qi Tian,et al. Towards Codebook-Free: Scalable Cascaded Hashing for Mobile Image Search , 2014, IEEE Transactions on Multimedia.

[22] Antonio Torralba,et al. Spectral Hashing , 2008, NIPS.

[23] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .

[24] Changhu Wang,et al. Spatial-bag-of-features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25] Qi Tian,et al. Contextual Hashing for Large-Scale Image Search , 2014, IEEE Transactions on Image Processing.

[26] Qi Tian,et al. Scalar quantization for large scale image search , 2012, ACM Multimedia.

[27] Ming Yang,et al. Contextual weighting for vocabulary tree based image retrieval , 2011, 2011 International Conference on Computer Vision.

[28] Shiliang Zhang,et al. Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search , 2013, IEEE Transactions on Image Processing.

[29] Gang Hua,et al. Building contextual visual vocabulary for large-scale image applications , 2010, ACM Multimedia.

[30] Chong-Wah Ngo,et al. Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation , 2012, IEEE Transactions on Image Processing.

[31] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[32] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[33] David Nistér,et al. Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[34] Kristen Grauman,et al. Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[35] Jiri Matas,et al. Total recall II: Query expansion revisited , 2011, CVPR 2011.

[36] Andrew Zisserman,et al. Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37] Shuicheng Yan,et al. Inferring semantic concepts from community-contributed images and noisy tags , 2009, ACM Multimedia.

[38] Shih-Fu Chang,et al. Query-Adaptive Image Search With Hash Codes , 2013, IEEE Transactions on Multimedia.

[39] Pierre Vandergheynst,et al. FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.