Core-set based product quantization for large-scale multimedia search

Approximate nearest neighbor (ANN) search has been well-studied and weightily applied in the field of large scale multimedia search. In recent years, Product Quantization (PQ) based methods have achieved greate success in ANN search. However, both the time and space complexity for PQ learning are linear to the data amount, making it infeasible for large-scale datasets. In this paper, we propose a more efficient learning strategy for product quantization. Instead of learning PQ on the whole dataset, we construct a small set of representative elements, from which PQ can be approximated learned with much lower computation overhead. Extensive experiments are conducted on three large-scale datasets. The codebook learning process of our approach can achieve up to tens times speed-up than existing PQ-based methods, while the memory consumption is only logarithmic to the number of data points. At the same time, the ANN search accuracy of our method is still comparable with state-of-the-art methods on all datasets.

[1]  Dan Feldman,et al.  Turning big data into tiny data: Constant-size coresets for k-means, PCA and projective clustering , 2013, SODA.

[2]  Michael Langberg,et al.  A unified framework for approximating and clustering data , 2011, STOC.

[3]  Christian Sohler,et al.  StreamKM++: A clustering algorithm for data streams , 2010, JEAL.

[4]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Christian Sohler,et al.  A fast k-means implementation using coresets , 2006, SCG '06.

[6]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[7]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[8]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[9]  Yingyu Liang,et al.  Distributed k-Means and k-Median Clustering on General Topologies , 2013, NIPS 2013.

[10]  Shih-Fu Chang,et al.  Spherical hashing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Jian Sun,et al.  Optimized Product Quantization for Approximate Nearest Neighbor Search , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[13]  Matthijs Douze,et al.  Searching in one billion vectors: Re-rank with source coding , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Svetlana Lazebnik,et al.  Asymmetric Distances for Binary Embeddings , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.