Discriminative deep metric learning for asymmetric discrete hashing

Abstract Thanks to its storage and computation efficiency, hashing as a kind of nearest neighbor search method can facilitate massive data processing in recent vision and learning studies. Particularly, deep supervised hashing methods have significantly improved the retrieval performance compared with non-deep supervised hashing methods. However, most existing deep supervised hashing methods approximate the similarity between two images with the hamming distance between the outputs of the same hash function, i.e., the symmetric strategy. Consequently, it is typically time-consuming to train these symmetric hashing methods, and these methods can hardly take full advantage of the supervised information in the large-scale database. In this paper, we propose a novel discriminative deep metric learning approach for asymmetric discrete hashing (ADMH) approach for supervised hashing learning. ADMH integrates an asymmetric strategy with a deep metric learning method to learn the hash function for the query images and the discrete hash codes for database images directly. More specifically, we train a deep neural network to extract the features of the query images. Subsequently, we introduce a metric learning scheme for learning the discrete discriminative hash codes of the database images directly. Finally, the feature learning procedure for generating the hash codes of the query images and the discrete coding procedure for generating the hash codes of the database images are integrated into an end-to-end learning framework. Extensive experiments on various benchmark datasets show that the proposed asymmetric deep hashing method outperforms the existing hashing methods.

[1]  Wu-Jun Li,et al.  Asymmetric Deep Supervised Hashing , 2017, AAAI.

[2]  Shih-Fu Chang,et al.  Query-Adaptive Image Search With Hash Codes , 2013, IEEE Transactions on Multimedia.

[3]  Xuelong Li,et al.  Deep Binary Reconstruction for Cross-Modal Hashing , 2017, IEEE Transactions on Multimedia.

[4]  Yang Yang,et al.  Attribute hashing for zero-shot image retrieval , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[5]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[6]  Fumin Shen,et al.  Inductive Hashing on Manifolds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[8]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Lei Zhang,et al.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification , 2015, IEEE Transactions on Image Processing.

[10]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[11]  Jian Yang,et al.  Discriminative Deep Quantization Hashing for Face Image Retrieval , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[13]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  King Ngi Ngan,et al.  Global and local semantics-preserving based deep hashing for cross-modal retrieval , 2018, Neurocomputing.

[15]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Hanqing Lu,et al.  Pseudo Label based Unsupervised Deep Discriminative Hashing for Image Retrieval , 2017, ACM Multimedia.

[17]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Jianmin Wang,et al.  Correlation Hashing Network for Efficient Cross-Modal Retrieval , 2016, BMVC.

[19]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.

[20]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[22]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[23]  Tianxu Zhang,et al.  Progressive Dual-Domain Filter for Enhancing and Denoising Optical Remote-Sensing Images , 2018, IEEE Geoscience and Remote Sensing Letters.

[24]  Wei Liu,et al.  Discrete Graph Hashing , 2014, NIPS.

[25]  Yang Yang,et al.  Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval , 2017, ACM Multimedia.

[26]  Wen Gao,et al.  Supervised Distributed Hashing for Large-Scale Multimedia Retrieval , 2018, IEEE Transactions on Multimedia.

[27]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[28]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[29]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[30]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[31]  Wei Liu,et al.  Asymmetric Binary Coding for Image Search , 2017, IEEE Transactions on Multimedia.

[32]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  David Suter,et al.  Fast Supervised Hashing with Decision Trees for High-Dimensional Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Jiwen Lu,et al.  Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Jinhui Tang,et al.  Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval , 2015, IEEE Transactions on Multimedia.

[36]  Jianmin Wang,et al.  Deep Visual-Semantic Quantization for Efficient Image Retrieval , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[39]  Minyi Guo,et al.  Supervised hashing with latent factor models , 2014, SIGIR.

[40]  Yang Yang,et al.  Zero-Shot Hashing via Transferring Supervised Knowledge , 2016, ACM Multimedia.

[41]  Qingbo Wu,et al.  Manifold-ranking embedded order preserving hashing for image semantic retrieval , 2017, J. Vis. Commun. Image Represent..

[42]  King Ngi Ngan,et al.  Hierarchy Neighborhood Discriminative Hashing for An Unified View of Single-Label and Multi-Label Image retrieval , 2019, ArXiv.

[43]  Tianxu Zhang,et al.  Framelet regularization for uneven intensity correction of color images with illumination and reflectance estimation , 2018, Neurocomputing.

[44]  Tao Mei,et al.  Searching Personal Photos on the Phone with Instant Visual Query Suggestion and Joint Text-Image Hashing , 2017, ACM Multimedia.

[45]  Wei Liu,et al.  Hashing with Graphs , 2011, ICML.

[46]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[47]  Tianxu Zhang,et al.  Spatially adaptive denoising for X-ray cardiovascular angiogram images , 2018, Biomed. Signal Process. Control..

[48]  Jiashi Feng,et al.  Central Similarity Hashing via Hadamard matrix , 2019, ArXiv.

[49]  Tieniu Tan,et al.  Deep semantic ranking based hashing for multi-label image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Wu-Jun Li,et al.  Discrete Latent Factor Model for Cross-Modal Hashing , 2017, IEEE Transactions on Image Processing.

[51]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[52]  Pengfei Zhang,et al.  Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval , 2017, ACM Multimedia.

[53]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[54]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[56]  King Ngi Ngan,et al.  Learning Efficient Binary Codes From High-Level Feature Representations for Multilabel Image Retrieval , 2017, IEEE Transactions on Multimedia.

[57]  Yang Yang,et al.  Asymmetric sparse hashing , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[58]  Yang Yang,et al.  Deep Asymmetric Pairwise Hashing , 2017, ACM Multimedia.

[59]  Bingbing Ni,et al.  Deep Progressive Hashing for Image Retrieval , 2017, IEEE Transactions on Multimedia.

[60]  Ruslan Salakhutdinov,et al.  The Power of Asymmetry in Binary Hashing , 2013, NIPS.

[61]  Wei Liu,et al.  Learning to Hash for Indexing Big Data—A Survey , 2015, Proceedings of the IEEE.

[62]  Wei Liu,et al.  Coordinate Discrete Optimization for Efficient Cross-View Image Retrieval , 2016, IJCAI.

[63]  King Ngi Ngan,et al.  Multi-task Learning for Deep Semantic Hashing , 2018, 2018 IEEE Visual Communications and Image Processing (VCIP).

[64]  Yi Shi,et al.  Deep Supervised Hashing with Triplet Labels , 2016, ACCV.

[65]  Jianfei Cai,et al.  Semi-supervised manifold-embedded hashing with joint feature representation and classifier learning , 2017, Pattern Recognit..

[66]  Lin Yang,et al.  Asymmetric Discrete Graph Hashing , 2017, AAAI.

[67]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.