Deep Image Retrieval: A Survey

In recent years a vast amount of visual content has been generated and shared from various fields, such as social media platforms, medical images, and robotics. This abundance of content creation and sharing has introduced new challenges. In particular, searching databases for similar content, i.e., content based image retrieval (CBIR), is a long-established research area, and more efficient and accurate methods are needed for real time retrieval. Artificial intelligence has made progress in CBIR and has significantly facilitated the process of intelligent search. In this survey we organize and review recent CBIR works that are developed based on deep learning algorithms and techniques, including insights and techniques from recent papers. We identify and present the commonly-used databases, benchmarks, and evaluation methods used in the field. We collect common challenges and propose promising future directions. More specifically, we focus on image retrieval with deep learning and organize the state of the art methods according to the types of deep network structure, deep features, feature enhancement methods, and network fine-tuning strategies. Our survey considers a wide variety of recent methods, aiming to promote a global view of the field of category-based CBIR.

[1]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[3]  Atsuto Maki,et al.  Visual Instance Retrieval with Deep Convolutional Networks , 2014, ICLR.

[4]  Jiwen Lu,et al.  Hardness-Aware Deep Metric Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Yonghong Tian,et al.  CNN vs. SIFT for Image Retrieval: Alternative or Complementary? , 2016, ACM Multimedia.

[6]  Avik Bhattacharya,et al.  Siamese graph convolutional network for content based remote sensing image retrieval , 2019, Comput. Vis. Image Underst..

[7]  Leo Sampaio Ferraz Ribeiro,et al.  Sketching out the details: Sketch-based image retrieval using convolutional neural networks with multi-stage regression , 2018, Comput. Graph..

[8]  Suha Kwak,et al.  Proxy Anchor Loss for Deep Metric Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Kamalraj Subramaniam,et al.  A Review on Multiple Approaches to Medical Image Retrieval System , 2020 .

[10]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Chunheng Wang,et al.  Iterative Manifold Embedding Layer Learned by Incomplete Data for Large-Scale Image Retrieval , 2017, IEEE Transactions on Multimedia.

[12]  Jing Liu,et al.  Deep Incremental Hashing Network for Efficient Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[14]  Tao Xiang,et al.  Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15]  Ling-Yu Duan,et al.  DeepHash for Image Instance Retrieval: Getting Regularization, Depth and Fine-Tuning Right , 2017, ICMR.

[16]  Chunyan Miao,et al.  Online multimodal deep similarity learning with application to image retrieval , 2013, ACM Multimedia.

[17]  Xiu-Shen Wei,et al.  Selective Convolutional Descriptor Aggregation for Fine-Grained Image Retrieval , 2016, IEEE Transactions on Image Processing.

[18]  Guanbin Li,et al.  Visual Saliency Detection Based on Multiscale Deep CNN Features. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[19]  Noel E. O'Connor,et al.  Bags of Local Convolutional Features for Scalable Instance Search , 2016, ICMR.

[20]  Jungmin Lee,et al.  Attention-based Ensemble for Deep Metric Learning , 2018, ECCV.

[21]  Hongxun Yao,et al.  Exploiting the complementary strengths of multi-layer CNN features for image retrieval , 2017, Neurocomputing.

[22]  Miroslaw Bober,et al.  Siamese Network of Deep Fisher-Vector Descriptors for Image Retrieval , 2017, ArXiv.

[23]  Giorgos Tolias,et al.  Fine-Tuning CNN Image Retrieval with No Human Annotation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Xian-Ling Mao,et al.  Object Detection based Deep Unsupervised Hashing , 2019, IJCAI.

[25]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[26]  Weilin Huang,et al.  Cross-Batch Memory for Embedding Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Qi Tian,et al.  Regularized Diffusion Process on Bidirectional Context for Object Retrieval , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Albert Gordo,et al.  End-to-End Learning of Deep Visual Representations for Image Retrieval , 2016, International Journal of Computer Vision.

[29]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[30]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Daniel Carlos Guimarães Pedronette,et al.  Graph-based selective rank fusion for unsupervised image retrieval , 2020, Pattern Recognit. Lett..

[32]  Ronan Sicre,et al.  Particular object retrieval with integral max-pooling of CNN activations , 2015, ICLR.

[33]  Yannis Avrithis,et al.  Efficient Diffusion on Region Manifolds: Recovering Small Objects with Compact CNN Representations , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Ondrej Chum,et al.  CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples , 2016, ECCV.

[35]  Alberto Del Bimbo,et al.  Image Tag Assignment, Refinement and Retrieval , 2015, ACM Multimedia.

[36]  Jack Sim,et al.  Unifying Deep Local and Global Features for Efficient Image Search , 2020, ArXiv.

[37]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[38]  Yannis Avrithis,et al.  Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Thomas Mensink,et al.  Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.

[40]  Stan Sclaroff,et al.  Hashing with Mutual Information , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Philip S. Yu,et al.  Maximum-Margin Hamming Hashing , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[42]  Ming-Hsuan Yang,et al.  Dynamic Match Kernel With Deep Convolutional Features for Image Retrieval , 2018, IEEE Transactions on Image Processing.

[43]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[44]  Sinisa Todorovic,et al.  Ensemble Deep Manifold Similarity Learning Using Hard Proxies , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Wenyin Liu,et al.  A novel feature representation: Aggregating convolution kernels for image retrieval , 2020, Neural Networks.

[46]  Marcello Pelillo,et al.  Multi-feature Fusion for Image Retrieval Using Constrained Dominant Sets , 2018, Image Vis. Comput..

[47]  Anastasios Tefas,et al.  Deep convolutional image retrieval: A general framework , 2018, Signal Process. Image Commun..

[48]  Jie Lin,et al.  A practical guide to CNNs and Fisher Vectors for image instance retrieval , 2015, Signal Process..

[49]  Shin'ichi Satoh,et al.  Faster R-CNN Features for Instance Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[50]  Chu-Song Chen,et al.  Cross-batch Reference Learning for Deep Classification and Retrieval , 2016, ACM Multimedia.

[51]  Qi Tian,et al.  Recent Advance in Content-based Image Retrieval: A Literature Survey , 2017, ArXiv.

[52]  George Vogiatzis,et al.  Learning Non-Metric Visual Similarity for Image Retrieval , 2017, Image Vis. Comput..

[53]  Lei Song,et al.  Selective deep ensemble for instance retrieval , 2018, Multimedia Tools and Applications.

[54]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Kaiqi Huang,et al.  Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Zi Huang,et al.  Quartet-net Learning for Visual Instance Retrieval , 2016, ACM Multimedia.

[57]  Krystian Mikolajczyk,et al.  SOLAR: Second-Order Loss and Attention for Image Retrieval , 2020, ECCV.

[58]  Shin'ichi Satoh,et al.  Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing , 2018, AAAI.

[59]  Noel E. O'Connor,et al.  Shallow and Deep Convolutional Networks for Saliency Prediction , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Qi Li,et al.  Learning deep similarity models with focus ranking for fabric image retrieval , 2017, Image Vis. Comput..

[61]  Chunheng Wang,et al.  Unsupervised Part-Based Weighting Aggregation of Deep Convolutional Features for Image Retrieval , 2017, AAAI.

[62]  Cordelia Schmid,et al.  Convolutional Kernel Networks , 2014, NIPS.

[63]  Tiejun Huang,et al.  Deep Relative Distance Learning: Tell the Difference between Similar Vehicles , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64]  Hua Li,et al.  A multi-layer deep fusion convolutional neural network for sketch based image retrieval , 2018, Neurocomputing.

[65]  Maksims Volkovs,et al.  Guided Similarity Separation for Image Retrieval , 2019, NeurIPS.

[66]  Xiaogang Wang,et al.  DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[67]  Noel E. O'Connor,et al.  Saliency Weighted Convolutional Features for Instance Search , 2017, 2018 International Conference on Content-Based Multimedia Indexing (CBMI).

[68]  Larry S. Davis,et al.  An Analysis of Object Embeddings for Image Retrieval , 2019, ArXiv.

[69]  David Stutz,et al.  Neural Codes for Image Retrieval , 2015 .

[70]  Albert Gordo,et al.  Deep Image Retrieval: Learning Global Representations for Image Search , 2016, ECCV.

[71]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[72]  Weihong Deng,et al.  Hybrid-Attention Based Decoupled Metric Learning for Zero-Shot Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[73]  Ngai-Man Cheung,et al.  Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[74]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[75]  Zi Huang,et al.  Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps , 2016, ArXiv.

[76]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[77]  Yu Liu,et al.  DeepIndex for Accurate and Efficient Image Retrieval , 2015, ICMR.

[78]  Yong Rui,et al.  Image search—from thousands to billions in 20 years , 2013, TOMCCAP.

[79]  Chunheng Wang,et al.  Spatial weighted fisher vector for image retrieval , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[80]  Toshihiko Yamasaki,et al.  Efficient and Interactive Spatial-Semantic Image Retrieval , 2018, MMM.

[81]  Jian Wang,et al.  Deep Metric Learning with Angular Loss , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[82]  Jaeyoon Kim,et al.  Regional Attention Based Deep Feature for Image Retrieval , 2018, BMVC.

[83]  Yang Hua,et al.  Ranked List Loss for Deep Metric Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[84]  Jon Almazán,et al.  Learning With Average Precision: Training Image Retrieval With a Listwise Loss , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[85]  Ling Shao,et al.  Auto-Encoding Twin-Bottleneck Hashing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[86]  Zhen Yang,et al.  Semi-Supervised Metric Learning-Based Anchor Graph Hashing for Large-Scale Image Retrieval , 2019, IEEE Transactions on Image Processing.

[87]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[88]  Qi Tian,et al.  Good Practice in CNN Feature Transfer , 2016, ArXiv.

[89]  Miguel Á. Carreira-Perpiñán,et al.  Hashing with binary autoencoders , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[90]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[91]  Jean Ponce,et al.  A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[92]  Abbes Amira,et al.  Content-based image retrieval with compact deep convolutional features , 2017, Neurocomputing.

[93]  Larry S. Davis,et al.  Exploiting local features from deep networks for image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[94]  Yizhou Yu,et al.  Visual Saliency Detection Based on Multiscale Deep CNN Features , 2016, IEEE Transactions on Image Processing.

[95]  Yang Song,et al.  Learning Fine-Grained Image Similarity with Deep Ranking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[96]  Yannis Avrithis,et al.  Fast Spectral Ranking for Similarity Search , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[97]  Yannis Avrithis,et al.  Mining on Manifolds: Metric Learning Without Labels , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[98]  Jianmin Wang,et al.  HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[99]  Qi Tian,et al.  Image Classification and Retrieval are ONE , 2015, ICMR.

[100]  Yafei Zhang,et al.  Nonlinear embedding neural codes for visual instance retrieval , 2018, Neurocomputing.

[101]  Miroslaw Bober,et al.  REMAP: Multi-Layer Entropy-Guided Pooling of Dense CNN Features for Image Retrieval , 2019, IEEE Transactions on Image Processing.

[102]  Anastasios Tefas,et al.  Exploiting supervised learning for finetuning deep CNNs in content based image retrieval , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[103]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[104]  Jie Li,et al.  Unsupervised Semantic-Preserving Adversarial Hashing for Image Search , 2019, IEEE Transactions on Image Processing.

[105]  Tomasz Trzcinski,et al.  BinGAN: Learning Compact Binary Descriptors with a Regularized GAN , 2018, NeurIPS.

[106]  Haofeng Zhang,et al.  Clustering-driven unsupervised deep hashing for image retrieval , 2019, Neurocomputing.

[107]  Zi Huang,et al.  Local Deep Descriptors in Bag-of-Words for Image Retrieval , 2017, ACM Multimedia.

[108]  Louis Chevallier,et al.  Hybrid multi-layer deep CNN/aggregator feature for image classification , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[109]  Atsuto Maki,et al.  Factors of Transferability for a Generic ConvNet Representation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[110]  Robert Pless,et al.  Deep Randomized Ensembles for Metric Learning , 2018, ECCV.

[111]  Kihyuk Sohn,et al.  Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[112]  Dacheng Tao,et al.  DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[113]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[114]  Nam Ik Cho,et al.  Regional deep feature aggregation for image retrieval , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[115]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[116]  Chong-Wah Ngo,et al.  A Hamming Embedding Kernel with Informative Bag-of-Visual Words for Video Semantic Indexing , 2014, TOMCCAP.

[117]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[118]  Maksims Volkovs,et al.  Explore-Exploit Graph Traversal for Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[119]  Heng Tao Shen,et al.  Deep Region Hashing for Efficient Large-scale Instance Search from Images , 2017, ArXiv.

[120]  Dacheng Tao,et al.  Two-Stream Deep Hashing With Class-Specific Centers for Supervised Image Search , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[121]  Silvio Savarese,et al.  Deep Metric Learning via Lifted Structured Feature Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[122]  Yinghuan Shi,et al.  Modelling Diffusion Process by Deep Neural Networks for Image Retrieval , 2018, BMVC.

[123]  Horst Bischof,et al.  Diffusion Processes for Retrieval Revisited , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[124]  Qi Tian,et al.  SIFT Meets CNN: A Decade Survey of Instance Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[125]  R. Venkatesh Babu,et al.  Object level deep feature pooling for compact image representation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[126]  Xavier Giró-i-Nieto,et al.  Class-Weighted Convolutional Features for Visual Instance Search , 2017, BMVC.

[127]  Long Chen,et al.  Dress Fashionably: Learn Fashion Collocation With Deep Mixed-Category Metric Learning , 2018, AAAI.

[128]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[129]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[130]  Yao Zhao,et al.  Two-stream Attentive CNNs for Image Retrieval , 2017, ACM Multimedia.

[131]  Iasonas Kokkinos,et al.  Discriminative Learning of Deep Convolutional Feature Point Descriptors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[132]  Rita Cucchiara,et al.  Predicting Human Eye Fixations via an LSTM-Based Saliency Attentive Model , 2016, IEEE Transactions on Image Processing.

[133]  Qi Tian,et al.  Exploiting Hierarchical Activations of Neural Network for Image Retrieval , 2016, ACM Multimedia.

[134]  Andrew Zisserman,et al.  Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval , 2020, ECCV.

[135]  Jongtack Kim,et al.  Combination of Multiple Global Descriptors for Image Retrieval , 2019, ArXiv.

[136]  Michael Isard,et al.  Lost in quantization: Improving particular object retrieval in large scale image databases , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[137]  Kohei Ozaki,et al.  Large-scale Landmark Retrieval/Recognition under a Noisy and Diverse Dataset , 2019, ArXiv.

[138]  Yuxin Peng,et al.  SSDH: Semi-Supervised Deep Hashing for Large Scale Image Retrieval , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[139]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[140]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[141]  Bohyung Han,et al.  Large-Scale Image Retrieval with Attentive Deep Local Features , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[142]  Ling Shao,et al.  Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[143]  Qi Tian,et al.  Retrieval Oriented Deep Feature Learning With Complementary Supervision Mining , 2018, IEEE Transactions on Image Processing.

[144]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[145]  Tao Mei,et al.  Deep Domain Adaptation Hashing with Adversarial Learning , 2018, SIGIR.

[146]  Qi Tian,et al.  Accurate Image Search with Multi-Scale Contextual Evidences , 2016, International Journal of Computer Vision.

[147]  Simon Osindero,et al.  Cross-Dimensional Weighting for Aggregated Deep Convolutional Features , 2015, ECCV Workshops.

[148]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[149]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[150]  Ling Shao,et al.  Deep Self-Taught Hashing for Image Retrieval , 2019, IEEE Transactions on Cybernetics.

[151]  Victor S. Lempitsky,et al.  Aggregating Local Deep Features for Image Retrieval , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[152]  Gustavo Carneiro,et al.  Smart Mining for Deep Metric Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[153]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[154]  Tieniu Tan,et al.  Deep semantic ranking based hashing for multi-label image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[155]  Ngai-Man Cheung,et al.  From Selective Deep Convolutional Features to Compact Binary Representations for Image Retrieval , 2018, ACM Trans. Multim. Comput. Commun. Appl..

[156]  Giorgio Giacinto,et al.  Information fusion in content based image retrieval: A comprehensive overview , 2017, Inf. Fusion.

[157]  Atsuto Maki,et al.  A Baseline for Visual Instance Retrieval with Deep Convolutional Networks , 2014, ICLR 2015.

[158]  Abbes Amira,et al.  Semantic content-based image retrieval: A comprehensive study , 2015, J. Vis. Commun. Image Represent..

[159]  Yair Movshovitz-Attias,et al.  No Fuss Distance Metric Learning Using Proxies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[160]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[161]  Chang Zhou,et al.  Learning Feature Embedding with Strong Neural Activations for Fine-Grained Retrieval , 2017, ACM Multimedia.

[162]  Matti Pietikäinen,et al.  Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[163]  Cheng Deng,et al.  Unsupervised Deep Generative Adversarial Hashing Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[164]  Asifullah Khan,et al.  A survey of the recent architectures of deep convolutional neural networks , 2019, Artificial Intelligence Review.

[165]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[166]  Yan Lu,et al.  Local Descriptors Optimized for Average Precision , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[167]  Svetlana Lazebnik,et al.  Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[168]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[169]  Jiwen Lu,et al.  Deep Metric Learning for Visual Understanding: An Overview of Recent Advances , 2017, IEEE Signal Processing Magazine.

[170]  Yan Pan,et al.  Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning , 2018, IEEE Transactions on Image Processing.

[171]  Tinne Tuytelaars,et al.  On the Exploration of Incremental Learning for Fine-grained Image Retrieval , 2020, BMVC.

[172]  Jianmin Wang,et al.  Deep Quantization Network for Efficient Image Retrieval , 2016, AAAI.