Multi-granularity feature learning network for deep hashing

Abstract With the ever-increasing growth of massive high-dimensional data, deep learning to hash technology has been widely used for approximate nearest neighbor search on large-scale datasets, due to its remarkable efficiency and retrieval performance. In this paper, we propose a novel supervised deep hashing method, named Multi-granularity Feature Learning Hashing (MFLH), to learn compact binary descriptors. Specifically, we design an end-to-end trainable network to jointly learn feature representations and hash codes, in which a global stream and a local stream are responsible for learning feature representations with different granularities, and a hashing stream is devoted to encoding multi-granularity features into binary codes. In the local stream, a Cyclic Shift Mechanism (CSM) strategy is developed to assist mining more discriminative local fine information. In the meantime, an approximate sign activation function, which can be used for continuous optimization, is introduced to reduce quantization error. Furthermore, an improved variant of the triplet loss is presented to enhance the representation of pair-wise similarity for hash codes. Extensive experiments demonstrate that our proposed method significantly outperforms state-of-the-art hashing methods on the benchmark datasets, thereby verifying the effectiveness of our approach. Source code is provided for reproducibility.

[1]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[2]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[3]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[4]  Neha Jain,et al.  Deep semantic preserving hashing for large scale image retrieval , 2018, Multimedia Tools and Applications.

[5]  Xin Zhao,et al.  On Trivial Solution and High Correlation Problems in Deep Supervised Hashing , 2018, AAAI.

[6]  Xiaoqiang Lu,et al.  Deep discrete hashing with pairwise correlation learning , 2020, Neurocomputing.

[7]  Stan Sclaroff,et al.  Hashing with Mutual Information , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Geoffrey E. Hinton,et al.  Semantic hashing , 2009, Int. J. Approx. Reason..

[10]  Jie Zhou,et al.  Unsupervised Variational Video Hashing With 1D-CNN-LSTM Networks , 2020, IEEE Transactions on Multimedia.

[11]  Jie Li,et al.  Unsupervised Semantic-Preserving Adversarial Hashing for Image Search , 2019, IEEE Transactions on Image Processing.

[12]  Yuanjie Zheng,et al.  Solving Jigsaw Puzzles via Nonconvex Quadratic Programming With the Projected Power Method , 2021, IEEE Transactions on Multimedia.

[13]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Bingbing Ni,et al.  Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval , 2020, IEEE Transactions on Multimedia.

[15]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Hong Liu,et al.  Towards Optimal Discrete Online Hashing with Balanced Similarity , 2019, AAAI.

[17]  Wu-Jun Li,et al.  Deep Discrete Supervised Hashing , 2017, IEEE Transactions on Image Processing.

[18]  Ian D. Reid,et al.  Fast Training of Triplet-Based Deep Binary Embedding Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[20]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21]  Yunchao Wei,et al.  Horizontal Pyramid Matching for Person Re-identification , 2018, AAAI.

[22]  Jinhui Tang,et al.  Discriminative Deep Hashing for Scalable Face Image Retrieval , 2017, IJCAI.

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[26]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28]  Jianmin Wang,et al.  Deep Quantization Network for Efficient Image Retrieval , 2016, AAAI.

[29]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[30]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Rongrong Ji,et al.  Towards Optimal Binary Code Learning via Ordinal Embedding , 2016, AAAI.

[32]  Huaxiang Zhang,et al.  Deep Collaborative Multi-View Hashing for Large-Scale Image Search , 2020, IEEE Transactions on Image Processing.

[33]  Yue Gao,et al.  Deep Multi-View Enhancement Hashing for Image Retrieval , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[35]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[36]  Jiwen Lu,et al.  Deep Hashing for Scalable Image Search , 2017, IEEE Transactions on Image Processing.

[37]  Bo Zhang,et al.  Scalable Discrete Supervised Multimedia Hash Learning With Clustering , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Jianmin Wang,et al.  HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Xiaoqiang Lu,et al.  Deep Category-Level and Regularized Hashing With Global Semantic Similarity Learning , 2020, IEEE Transactions on Cybernetics.

[40]  Haofeng Zhang,et al.  Clustering-driven unsupervised deep hashing for image retrieval , 2019, Neurocomputing.

[41]  Jiwen Lu,et al.  Deep Hashing via Discrepancy Minimization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42]  Bingbing Ni,et al.  Deep Progressive Hashing for Image Retrieval , 2019, IEEE Transactions on Multimedia.

[43]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[44]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.