Information Gain Product Quantization for Image Retrieval

Approximate nearest neighbor (ANN) research has become the key method of content-based image retrieval. Product quantization (PQ) is a popular method for approximate nearest neighbor search. However, the informativeness may not be uniformly distributed across the subspace in product quantization. Allocating the same number of bits for the subspace will bring large quantization loss. To address this issue, in this paper, we propose an improved product quantization method, which adaptively allocates different numbers of bits to subspace via information gain. The key of our method is to find the optimal bit allocation strategy. To this end, our method maximizes the summation of information gain for each subspace, and also takes the limited length of codes into account. Experimental results on two large-scale bench-marks GIST1M and 22K-LabelMe demonstrate that our approach significantly outperforms state-of-the-art methods in terms of accuracy.

[1]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[2]  Zhi Zeng,et al.  Adaptive bit allocation hashing for approximate nearest neighbor search , 2015, Neurocomputing.

[3]  Jonathon Shlens,et al.  Fast, Accurate Detection of 100,000 Object Classes on a Single Machine , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Gang Chen,et al.  Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes , 2014, SDM.

[5]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[6]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[11]  David Suter,et al.  A General Two-Step Approach to Learning-Based Hashing , 2013, 2013 IEEE International Conference on Computer Vision.

[12]  Antonio Torralba,et al.  Small codes and large image databases for recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[14]  Daniel C. Laughlin,et al.  The intrinsic dimensionality of plant traits and its relevance to community assembly , 2014 .

[15]  Zhi-Hua Zhou,et al.  Learning to hash for big data: Current status and future trends , 2015 .

[16]  R Bellman,et al.  On the Theory of Dynamic Programming. , 1952, Proceedings of the National Academy of Sciences of the United States of America.