论文信息 - A neural network catalyzer for multi-dimensional similarity search

A neural network catalyzer for multi-dimensional similarity search

This paper aims at learning a function mapping input vectors to an output space in a way that improves high-dimensional similarity search. As a proxy objective, we design and train a neural network that favors uniformity in the spherical output space, while preserving the neighborhood structure after the mapping. For this purpose, we propose a new regularizer derived from the Kozachenko-Leonenko differential entropy estimator and combine it with a locality-aware triplet loss. Our method operates as a catalyzer for traditional indexing methods such as locality sensitive hashing or iterative quantization, boosting the overall recall. Additionally, the network output distribution makes it possible to leverage structured quantizers with efficient algebraic encoding, in particular spherical lattice quantizers such as the Gosset lattice E8. Our experiments show that this approach is competitive with state-of-the-art methods such as optimized product quantization.

[1] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2] Antonio Torralba,et al. Small codes and large image databases for recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Jian Sun,et al. Optimized Product Quantization for Approximate Nearest Neighbor Search , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Jeff Johnson,et al. Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.

[5] Geoffrey E. Hinton,et al. Stochastic Neighbor Embedding , 2002, NIPS.

[6] Victor S. Lempitsky,et al. Efficient Indexing of Billion-Scale Datasets of Deep Descriptors , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[8] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[9] L. Györfi,et al. Nonparametric entropy estimation. An overview , 1997 .

[10] Cordelia Schmid,et al. Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Patrick Pérez,et al. SuBiC: A Supervised, Structured Binary Code for Image Search , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12] Wei Liu,et al. Learning to Hash for Indexing Big Data—A Survey , 2015, Proceedings of the IEEE.

[13] Heng Tao Shen,et al. Hashing for Similarity Search: A Survey , 2014, ArXiv.

[14] Laurent Amsaleg,et al. Locality sensitive hashing: A comparison of hash function types and querying mechanisms , 2010, Pattern Recognit. Lett..

[15] Armand Joulin,et al. Unsupervised Learning by Predicting Noise , 2017, ICML.

[16] Cordelia Schmid,et al. Query adaptative locality sensitive hashing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17] Svetlana Lazebnik,et al. Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[18] Lior Wolf,et al. In Defense of Product Quantization , 2017, ArXiv.

[19] Hervé Jégou,et al. Beyond “project and sign” for cosine estimation with binary codes , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[21] Yang Song,et al. Learning Fine-Grained Image Similarity with Deep Ranking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[23] Matthijs Douze,et al. Searching in one billion vectors: Re-rank with source coding , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24] Samy Bengio,et al. Large Scale Online Learning of Image Similarity Through Ranking , 2009, J. Mach. Learn. Res..

[25] David G. Lowe,et al. Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Tim Kraska,et al. The Case for Learned Index Structures , 2018 .

[27] J. Snyders,et al. Efficient decoding of the Gosset, Coxeter-Todd and the Barnes-Wall lattices , 1998, Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252).

[28] Jiwen Lu,et al. Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Eric O. Postma,et al. Dimensionality Reduction: A Comparative Review , 2008 .

[30] Yury A. Malkov,et al. Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] F ROSENBLATT,et al. The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[32] Geoffrey E. Hinton,et al. Regularizing Neural Networks by Penalizing Confident Output Distributions , 2017, ICLR.

[33] Patrick Gallinari,et al. Ranking with ordered weighted pairwise classification , 2009, ICML '09.

[34] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[35] Victor Lempitsky,et al. Additive Quantization for Extreme Vector Compression , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36] Patrick Pérez,et al. Approximate Search with Quantized Sparse Representations , 2016, ECCV.

[37] Matthijs Douze,et al. Polysemous Codes , 2016, ECCV.

[38] Nicole Immorlica,et al. Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[39] Carl Doersch,et al. Tutorial on Variational Autoencoders , 2016, ArXiv.

[40] Kai Li,et al. Image similarity search with compact data structures , 2004, CIKM '04.

[41] Piotr Indyk,et al. Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[42] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[43] Jinhui Tang,et al. Sparse composite quantization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Alexandr Andoni,et al. Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[45] Matthijs Douze,et al. How should we evaluate supervised hashing? , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).