A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing

The majority of deep unsupervised hashing methods usually first construct pairwise semantic similarity information and then learn to map images into compact hash codes while preserving the similarity structure, which implies that the quality of hash codes highly depends on the constructed semantic similarity structure. However, since the features of images for each kind of semantics usually scatter in high-dimensional space with unknown distribution, previous methods could introduce a large number of false positives and negatives for boundary points of distributions in the local semantic structure based on pairwise cosine distances. Towards this limitation, we propose a general distribution-based metric to depict the pairwise distance between images. Specifically, each image is characterized by its random augmentations that can be viewed as samples from the corresponding latent semantic distribution. Then we estimate the distances between images by calculating the sample distribution divergence of their semantics. By applying this new metric to deep unsupervised hashing, we come up with Distribution-based similArity sTructure rEconstruction (DATE). DATE can generate more accurate semantic similarity information by using non-parametric ball divergence. Moreover, DATE explores both semantic-preserving learning and contrastive learning to obtain high-quality hash codes. Extensive experiments on several widely-used datasets validate the superiority of our DATE.

[1]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[2]  Rongrong Ji,et al.  Supervised Online Hashing via Hadamard Codebook Learning , 2018, ACM Multimedia.

[3]  Maksims Volkovs,et al.  Guided Similarity Separation for Image Retrieval , 2019, NeurIPS.

[4]  Ling Shao,et al.  Unsupervised Binary Representation Learning with Deep Variational Networks , 2019, International Journal of Computer Vision.

[5]  Heping Zhang,et al.  Ball Covariance: A Generic Measure of Dependence in Banach Space , 2019, Journal of the American Statistical Association.

[6]  Dan Meng,et al.  Deep Unsupervised Hybrid-similarity Hadamard Hashing , 2020, ACM Multimedia.

[7]  Le Song,et al.  Stochastic Generative Hashing , 2017, ICML.

[8]  Heping Zhang,et al.  BALL DIVERGENCE: NONPARAMETRIC TWO SAMPLE TEST. , 2018, Annals of Statistics.

[9]  Weiping Wang,et al.  Asymmetric Deep Hashing for Efficient Hash Code Compression , 2020, ACM Multimedia.

[10]  Tomasz Trzcinski,et al.  BinGAN: Learning Compact Binary Descriptors with a Regularized GAN , 2018, NeurIPS.

[11]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[12]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[13]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15]  Xianling Mao,et al.  MLS3RDUH: Deep Unsupervised Hashing via Manifold based Local Semantic Similarity Structure Reconstructing , 2020, IJCAI.

[16]  Ling Shao,et al.  Auto-Encoding Twin-Bottleneck Hashing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[18]  Jun Zhou,et al.  Deep Hashing by Discriminating Hard Examples , 2019, ACM Multimedia.

[19]  Wei Liu,et al.  Semantic Structure-based Unsupervised Deep Hashing , 2018, IJCAI.

[20]  Hanqing Lu,et al.  Pseudo Label based Unsupervised Deep Discriminative Hashing for Image Retrieval , 2017, ACM Multimedia.

[21]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[22]  Tieniu Tan,et al.  Deep Supervised Discrete Hashing , 2017, NIPS.

[23]  Jinwen Ma,et al.  Deep Unsupervised Hashing by Global and Local Consistency , 2021, 2021 IEEE International Conference on Multimedia and Expo (ICME).

[24]  Kai Han,et al.  Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN , 2018, NeurIPS.

[25]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[26]  Haofeng Zhang,et al.  Clustering-driven unsupervised deep hashing for image retrieval , 2019, Neurocomputing.

[27]  Dacheng Tao,et al.  DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[29]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Dezhong Peng,et al.  Contrastive Clustering , 2021, AAAI.

[31]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Deng Cai,et al.  Density Sensitive Hashing , 2012, IEEE Transactions on Cybernetics.

[33]  Ling Shao,et al.  Unsupervised Deep Hashing With Pseudo Labels for Scalable Image Retrieval , 2018, IEEE Transactions on Image Processing.

[34]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[35]  Kaiming He,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[37]  Phillip Isola,et al.  Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere , 2020, ICML.

[38]  Heng Tao Shen,et al.  Hashing for Similarity Search: A Survey , 2014, ArXiv.

[39]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Shih-Fu Chang,et al.  Spherical hashing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Philip S. Yu,et al.  Deep Priority Hashing , 2018, ACM Multimedia.