Deep multiscale divergence hashing for image retrieval

Abstract. Image retrieval based on deep learning of hash has made great progress. The hash method increases retrieval speed greatly while saving storage space. However, some problems exist, such as the distinctiveness of image feature still needs to be improved and some image features are still redundant. We propose a new deep learning to hash method, namely, deep multiscale divergence hashing, which provides a high diversity and compact image feature for image retrieval. The discriminative features from deep neural networks are identified by the importance criterion in network pruning techniques and the feature redundancy is reduced. Then, the selected features across different layers are fused in a certain proportion to increase the diversity of features for image retrieval. We also present a hybrid loss function in hash space, which consists of the weighted pairwise cross-entropy loss function and the KL-divergence. It tends to minimize the hamming distance between similar images and maximize the hamming distance between different images, which helps improve the accuracy. Massive experimental results show that our method achieves better feature distinguishability and more advanced image retrieval accuracy, and surpasses HashNet by 11.46%, 7.58%, and 13.86% on MS COCO, NUS-WIDE, and CIFAR-10 datasets, respectively.

[1]  Jian Cheng,et al.  Semi-supervised Generative Adversarial Hashing for Image Retrieval , 2018, ECCV.

[2]  Xiangyu Zhang,et al.  Channel Pruning for Accelerating Very Deep Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3]  Heng Tao Shen,et al.  Semi-Paired Discrete Hashing: Learning Latent Hash Codes for Semi-Paired Cross-View Retrieval , 2017, IEEE Transactions on Cybernetics.

[4]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[5]  Rabul Hussain Laskar,et al.  Image authentication based on robust image hashing with geometric correction , 2018, Multimedia Tools and Applications.

[6]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Shih-Fu Chang,et al.  Semi-Supervised Hashing for Large-Scale Search , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Xiushan Nie,et al.  Efficient weakly-supervised discrete hashing for large-scale social image retrieval , 2020, Pattern Recognit. Lett..

[10]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11]  Jianmin Wang,et al.  HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  Wilfred Ng,et al.  Locality-sensitive hashing scheme based on dynamic collision counting , 2012, SIGMOD Conference.

[13]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Jiwen Lu,et al.  Deep Hashing via Discrepancy Minimization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Mathieu Salzmann,et al.  Learning the Number of Neurons in Deep Networks , 2016, NIPS.

[17]  Jie Yang,et al.  SHISS: Supervised hashing with informative set selection , 2017, Pattern Recognit. Lett..

[18]  Xiuyang Zhao,et al.  Multiple Spaces Deep Hashing for Image Retrieval , 2020, 2020 12th International Conference on Advanced Computational Intelligence (ICACI).

[19]  Li Liu,et al.  DHA: Supervised Deep Learning to Hash with an Adaptive Loss Function , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[20]  Aditi,et al.  Robust image hashing through DWT-SVD and spectral residual method , 2017, EURASIP Journal on Image and Video Processing.

[21]  Fumin Shen,et al.  Inductive Hashing on Manifolds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[23]  Philip S. Yu,et al.  Maximum-Margin Hamming Hashing , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Dacheng Tao,et al.  Joint Deep Multi-View Learning for Image Clustering , 2020, IEEE Transactions on Knowledge and Data Engineering.

[25]  Jianmin Wang,et al.  Deep Quantization Network for Efficient Image Retrieval , 2016, AAAI.

[26]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[27]  Trevor Darrell,et al.  Learning to Hash with Binary Reconstructive Embeddings , 2009, NIPS.

[28]  Kilian Q. Weinberger,et al.  Deep Networks with Stochastic Depth , 2016, ECCV.

[29]  Dan Zhang,et al.  Learning to Hash with Partial Tags: Exploring Correlation between Tags and Hashing Bits for Large Scale Image Retrieval , 2014, ECCV.

[30]  Kilian Q. Weinberger,et al.  CondenseNet: An Efficient DenseNet Using Learned Group Convolutions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Wei Liu,et al.  Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Zhiqiang Shen,et al.  Learning Efficient Convolutional Networks through Network Slimming , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[35]  Anirban Dasgupta,et al.  Fast locality-sensitive hashing , 2011, KDD.

[36]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Yixin Chen,et al.  Compressing Neural Networks with the Hashing Trick , 2015, ICML.

[40]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Jianmin Wang,et al.  Deep Visual-Semantic Quantization for Efficient Image Retrieval , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[45]  Hanan Samet,et al.  Pruning Filters for Efficient ConvNets , 2016, ICLR.

[46]  Song Han,et al.  Exploring the Regularity of Sparse Structure in Convolutional Neural Networks , 2017, ArXiv.

[47]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[48]  Bin Liu,et al.  Deep Triplet Quantization , 2018, ACM Multimedia.

[49]  Ram Kumar Karsh,et al.  Robust image hashing using ring partition-PGNMF and local features , 2016, SpringerPlus.

[50]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Lucas C. Parra,et al.  Maximum Likelihood in Cost-Sensitive Learning: Model Specification, Approximations, and Upper Bounds , 2010, J. Mach. Learn. Res..

[52]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.