Deep Learning for Approximate Nearest Neighbour Search: A Survey and Future Directions

Approximate nearest neighbour search (ANNS) in high-dimensional space is an essential and fundamental operation in many applications from many domains such as multimedia database, information retrieval and computer vision. With the rapidly growing volume of data and the dramatically increasing demands of users, traditional heuristic-based ANNS solutions have been facing great challenges in terms of both efficiency and accuracy. Inspired by the recent successes of deep learning in many fields, substantial efforts have been devoted to applying deep learning techniques to ANNS for learning to index and learning to search, resulting in numerous algorithms that achieve state-of-the-art performance compared with conventional methods. In this survey paper, we comprehensively review the different types of deep learning-based ANNS methods according to two learning paradigms: learning to index and learning to search. We provide a comprehensive overview and analysis of these methods in a systematic manner. Based on the overview, we point out that end-to-end learning will be a new and promising research direction for deep learning-based ANNS, i.e., applying deep learning techniques to jointly learn the indexing and searching together, such that the underlying knowledge learned from data can directly contribute to the final searching performance. Finally, we conduct experiments and provide general performance analyses for the representative deep learning-based ANNS algorithms.

[1]  Hai Jin,et al.  Milvus: A Purpose-Built Vector Data Management System , 2021, SIGMOD Conference.

[2]  Xiaoliang Xu,et al.  A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search , 2021, Proc. VLDB Endow..

[3]  Dan Meng,et al.  Deep Unsupervised Hybrid-similarity Hadamard Hashing , 2020, ACM Multimedia.

[4]  Xin Luo,et al.  Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval , 2020, ACM Multimedia.

[5]  Minjia Zhang,et al.  Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination , 2020, SIGMOD Conference.

[6]  Jinhui Tang,et al.  Weakly-supervised Semantic Guided Hashing for Social Image Retrieval , 2020, International Journal of Computer Vision.

[7]  Ivor W. Tsang,et al.  I/O Efficient Approximate Nearest Neighbour Search based on Learned Functions , 2020, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[8]  Lukasz Kaiser,et al.  Reformer: The Efficient Transformer , 2020, ICLR.

[9]  Artem Babenko,et al.  Towards Similarity Graphs Constructed by Deep Reinforcement Learning , 2019, ArXiv.

[10]  Jun Zhou,et al.  Deep Hashing by Discriminating Hard Examples , 2019, ACM Multimedia.

[11]  Jiwen Lu,et al.  Neighborhood Preserving Hashing for Scalable Video Retrieval , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Tao Mei,et al.  Deep Collaborative Embedding for Social Image Understanding , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Francis E. H. Tay,et al.  Central Similarity Quantization for Efficient Image and Video Retrieval , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Jingyuan Wang,et al.  Empowering A* Search Algorithms with Neural Networks for Personalized Route Recommendation , 2019, KDD.

[15]  Junjie Chen,et al.  Similarity Preserving Deep Asymmetric Quantization for Image Retrieval , 2019, AAAI.

[16]  Ling Shao,et al.  Deep Self-Taught Hashing for Image Retrieval. , 2019, IEEE transactions on cybernetics.

[17]  Jing Liu,et al.  Deep Incremental Hashing Network for Efficient Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Artem Babenko,et al.  Learning to Route in Similarity Graphs , 2019, ICML.

[19]  Dacheng Tao,et al.  DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Jinhui Tang,et al.  Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Wei Wang,et al.  I-LSH: I/O Efficient c-Approximate Nearest Neighbor Search in High-Dimensional Space , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[22]  Piotr Indyk,et al.  Learning Space Partitions for Nearest Neighbor Search , 2019, ICLR.

[23]  Jinhui Tang,et al.  Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Geoffrey J. Gordon,et al.  Learning Beam Search Policies via Imitation Learning , 2018, NeurIPS.

[26]  Bin Liu,et al.  Deep Triplet Quantization , 2018, ACM Multimedia.

[27]  Philip S. Yu,et al.  Deep Priority Hashing , 2018, ACM Multimedia.

[28]  Jiliang Tang,et al.  Signed Graph Convolutional Networks , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[29]  Chih-Yi Chiu,et al.  Learning to Index for Nearest Neighbor Search , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Aditya Bhaskara,et al.  Distributed Clustering via LSH Based Data Partitioning , 2018, ICML.

[31]  Wei Liu,et al.  Semantic Structure-based Unsupervised Deep Hashing , 2018, IJCAI.

[32]  Alexandr Andoni,et al.  Data-dependent hashing via nonlinear spectral gaps , 2018, STOC.

[33]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Jianmin Wang,et al.  HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35]  Jinhui Tang,et al.  Deep Ordinal Hashing With Spatial Attention , 2018, IEEE Transactions on Image Processing.

[36]  Dik Lun Lee,et al.  Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba , 2018, KDD.

[37]  Stan Sclaroff,et al.  Hashing with Mutual Information , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Lawrence V. Snyder,et al.  Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, NeurIPS.

[39]  Lior Wolf,et al.  End-To-End Supervised Product Quantization for Image Search and Retrieval , 2017, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Hong Cheng,et al.  PQBF: I/O-Efficient Approximate Nearest Neighbor Search by Product Quantization , 2017, CIKM.

[41]  Hanqing Lu,et al.  Pseudo Label based Unsupervised Deep Discriminative Hashing for Image Retrieval , 2017, ACM Multimedia.

[42]  Yang Yang,et al.  Deep Asymmetric Pairwise Hashing , 2017, ACM Multimedia.

[43]  Bingbing Ni,et al.  Deep Progressive Hashing for Image Retrieval , 2017, IEEE Transactions on Multimedia.

[44]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[45]  Jure Leskovec,et al.  Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[46]  Marie-Francine Moens,et al.  Fast and Flexible Top-k Similarity Search on Large Networks , 2017, ACM Trans. Inf. Syst..

[47]  Fan Yang,et al.  LoSHa: A General Framework for Scalable Locality Sensitive Hashing , 2017, SIGIR.

[48]  Wu-Jun Li,et al.  Deep Discrete Supervised Hashing , 2017, IEEE Transactions on Image Processing.

[49]  Deng Cai,et al.  Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph , 2017, Proc. VLDB Endow..

[50]  Wu-Jun Li,et al.  Asymmetric Deep Supervised Hashing , 2017, AAAI.

[51]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[52]  Tieniu Tan,et al.  Deep Supervised Discrete Hashing , 2017, NIPS.

[53]  Kun He,et al.  Hashing as Tie-Aware Learning to Rank , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[54]  Elias Boutros Khalil,et al.  Learning Combinatorial Optimization Algorithms over Graphs , 2017, NIPS.

[55]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[56]  Jeff Johnson,et al.  Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.

[57]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[58]  Xuelong Li,et al.  Learning k for kNN Classification , 2017, ACM Trans. Intell. Syst. Technol..

[59]  Xuemin Lin,et al.  Approximate Nearest Neighbor Search on High Dimensional Data — Experiments, Analyses, and Improvement , 2016, IEEE Transactions on Knowledge and Data Engineering.

[60]  Qi Tian,et al.  Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing , 2016, ACM Multimedia.

[61]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[62]  Ngai-Man Cheung,et al.  Learning to Hash with Binary Deep Neural Network , 2016, ECCV.

[63]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[64]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, International Journal of Computer Vision.

[65]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  Tom Drummond,et al.  FANNG: Fast Approximate Nearest Neighbour Graphs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[67]  Anthony K. H. Tung,et al.  LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index , 2016, SIGMOD Conference.

[68]  Stefano Ermon,et al.  Generative Adversarial Imitation Learning , 2016, NIPS.

[69]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  Yury A. Malkov,et al.  Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71]  Jianmin Wang,et al.  Deep Quantization Network for Efficient Image Retrieval , 2016, AAAI.

[72]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[73]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[74]  Venkatesh Saligrama,et al.  Efficient Training of Very Deep Neural Networks for Supervised Hashing , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[75]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[76]  Ling Shao,et al.  Deep Self-Taught Hashing for Image Retrieval , 2019, IEEE Transactions on Cybernetics.

[77]  Jingkuan Song,et al.  Scalable Multimedia Retrieval by Deep Learning Hashing with Relative Similarity Learning , 2015, ACM Multimedia.

[78]  Wei Liu,et al.  Learning to Hash for Indexing Big Data—A Survey , 2015, Proceedings of the IEEE.

[79]  Qiang Huang,et al.  Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search , 2015, Proc. VLDB Endow..

[80]  Lei Zhang,et al.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification , 2015, IEEE Transactions on Image Processing.

[81]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[82]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[83]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[84]  Michael I. Jordan,et al.  Trust Region Policy Optimization , 2015, ICML.

[85]  Robert P. Sheridan,et al.  Deep Neural Nets as a Method for Quantitative Structure-Activity Relationships , 2015, J. Chem. Inf. Model..

[86]  Tieniu Tan,et al.  Deep semantic ranking based hashing for multi-label image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87]  B. Frey,et al.  The human splicing code reveals new insights into the genetic determinants of disease , 2015, Science.

[88]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[89]  Yoshua Bengio,et al.  On Using Very Large Target Vocabulary for Neural Machine Translation , 2014, ACL.

[90]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[91]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[92]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[93]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[94]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[95]  Vladimir Krylov,et al.  Approximate nearest neighbor algorithm based on navigable small world graphs , 2014, Inf. Syst..

[96]  Xuemin Lin,et al.  SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index , 2014, Proc. VLDB Endow..

[97]  Alexandr Andoni,et al.  Spectral Approaches to Nearest Neighbor Search , 2014, 2014 IEEE 55th Annual Symposium on Foundations of Computer Science.

[98]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[99]  Jingdong Wang,et al.  Composite Quantization for Approximate Nearest Neighbor Search , 2014, ICML.

[100]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[101]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[102]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[103]  David G. Lowe,et al.  Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[104]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[105]  Parikshit Ram,et al.  Which Space Partitioning Tree to Use for Search? , 2013, NIPS.

[106]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[107]  Tara N. Sainath,et al.  Improvements to Deep Convolutional Neural Networks for LVCSR , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[108]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[109]  Thomas L. Dean,et al.  Fast, Accurate Detection of 100,000 Object Classes on a Single Machine , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[110]  Anton van den Hengel,et al.  Learning Compact Binary Codes for Visual Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[111]  Peter Sanders,et al.  Think Locally, Act Globally: Highly Balanced Graph Partitioning , 2013, SEA.

[112]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[113]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.

[114]  Antonio Torralba,et al.  Multidimensional Spectral Hashing , 2012, ECCV.

[115]  Victor S. Lempitsky,et al.  The Inverted Multi-Index , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[116]  David J. Fleet,et al.  Fast search in Hamming space with multi-index hashing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[117]  Wilfred Ng,et al.  Locality-sensitive hashing scheme based on dynamic collision counting , 2012, SIGMOD Conference.

[118]  Yihong Gong,et al.  Learning to Search Efficiently in High Dimensions , 2011, NIPS.

[119]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[120]  Tal Hassner,et al.  Face recognition in unconstrained videos with matched background similarity , 2011, CVPR 2011.

[121]  Kai Li,et al.  Efficient k-nearest neighbor graph construction for generic similarity measures , 2011, WWW.

[122]  Panos Kalnis,et al.  Efficient and accurate nearest neighbor and closest pair search in high-dimensional space , 2010, TODS.

[123]  Lucas C. Parra,et al.  Maximum Likelihood in Cost-Sensitive Learning: Model Specification, Approximations, and Upper Bounds , 2010, J. Mach. Learn. Res..

[124]  Yousef Saad,et al.  Fast Approximate kNN Graph Construction for High Dimensional Data via Recursive Lanczos Bisection , 2009, J. Mach. Learn. Res..

[125]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[126]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[127]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[128]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[129]  Sanjoy Dasgupta,et al.  Random projection trees and low dimensional manifolds , 2008, STOC.

[130]  Sanjoy Dasgupta,et al.  A learning framework for nearest neighbor search , 2007, NIPS.

[131]  Alan Fern,et al.  On learning linear ranking functions for beam search , 2007, ICML '07.

[132]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[133]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[134]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[135]  由希 辻 Representation , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[136]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[137]  Tara N. Sainath,et al.  The shared views of four research groups ) , 2012 .

[138]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[139]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[140]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[141]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[142]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[143]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[144]  A. Krizhevsky ImageNet Classification with Deep Convolutional Neural Networks , 2022 .