Deep multilevel similarity hashing with fine-grained features for multi-label image retrieval

Abstract For multi-label image retrieval based on deep hashing, the ultimate challenge is to map from the original image to binary space while preserving high-level semantic similarity. Recently, many supervised deep hashing approaches for multi-label image retrieval have been proposed to generate high-quality binary codes. However, most such methods are only introduced to learn simple similarity based on these image characteristics, therein ignoring complex multilevel semantic similarity with fine-grained features. In this paper, we propose a framework named deep hashing with fine-grained feature learning (DH-FFL) to preserve complex multilevel semantic similarity between multi-label image pairs. In this proposed model, compact bilinear pooling convolutional neural networks (CNNs) with normalization are adopted to extract fine-grained feature descriptors. In addition, a novel multilevel contrastive loss is designed to preserve multilevel semantic similarity by introducing a zero-loss parameter. Moreover, a multi-label classification loss is used to maintain the unique semantic structure of each image and maximize the distinguishing ability of binary codes. Comprehensive experiments on three benchmark datasets show that the proposed DH-FFL achieves promising performance compared with other state-of-the-art multi-label image retrieval applications.

[1]  Cristian Sminchisescu,et al.  Semantic Segmentation with Second-Order Pooling , 2012, ECCV.

[2]  Zheng Lin,et al.  Deep Supervised Hashing for Multi-Label and Large-Scale Image Retrieval , 2017, ICMR.

[3]  Anastasios Tefas,et al.  Deep convolutional learning for Content Based Image Retrieval , 2018, Neurocomputing.

[4]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[5]  Shu Kong,et al.  Low-Rank Bilinear Pooling for Fine-Grained Classification , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Lei Zhang,et al.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification , 2015, IEEE Transactions on Image Processing.

[7]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[8]  Kristen Grauman,et al.  Kernelized Locality-Sensitive Hashing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Hanjiang Lai,et al.  Instance-Aware Hashing for Multi-Label Image Retrieval , 2016, IEEE Transactions on Image Processing.

[10]  Baoxin Li,et al.  Weakly Supervised Deep Image Hashing Through Tag Embeddings , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Jing Liu,et al.  Deep Incremental Hashing Network for Efficient Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[13]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[14]  Jen-Hao Hsiao,et al.  Deep learning of binary hash codes for fast image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16]  Yang Gao,et al.  Compact Bilinear Pooling , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jianmin Wang,et al.  Deep Cauchy Hashing for Hamming Space Retrieval , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[20]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[21]  Nicu Sebe,et al.  A Survey on Learning to Hash , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Subhransu Maji,et al.  Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[23]  David J. Fleet,et al.  Minimal Loss Hashing for Compact Binary Codes , 2011, ICML.

[24]  Dacheng Tao,et al.  DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Lei Zhu,et al.  Unsupervised Visual Hashing with Semantic Assistant for Content-Based Image Retrieval , 2017, IEEE Transactions on Knowledge and Data Engineering.

[26]  Ling-Yu Duan,et al.  DeepHash for Image Instance Retrieval: Getting Regularization, Depth and Fine-Tuning Right , 2017, ICMR.

[27]  Tao Mei,et al.  Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval , 2016, IJCAI.

[28]  Jie Li,et al.  Unsupervised Semantic-Preserving Adversarial Hashing for Image Search , 2019, IEEE Transactions on Image Processing.

[29]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[30]  M. Goodale,et al.  Separate visual pathways for perception and action , 1992, Trends in Neurosciences.

[31]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[32]  Tieniu Tan,et al.  Deep semantic ranking based hashing for multi-label image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Xuelong Li,et al.  Discrete Spectral Hashing for Efficient Similarity Retrieval , 2019, IEEE Transactions on Image Processing.

[34]  Abbes Amira,et al.  Content-based image retrieval with compact deep convolutional features , 2017, Neurocomputing.

[35]  Meng Wang,et al.  Neighborhood Discriminant Hashing for Large-Scale Image Retrieval , 2015, IEEE Transactions on Image Processing.

[36]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[37]  Zheng Lin,et al.  Deep Uniqueness-Aware Hashing for Fine-Grained Multi-Label Image Retrieval , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[38]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Wei Liu,et al.  Learning to Hash for Indexing Big Data—A Survey , 2015, Proceedings of the IEEE.

[41]  Jing-Ming Guo,et al.  Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval , 2017, IEEE Transactions on Image Processing.

[42]  Xiaoyang Tan,et al.  Learning Multilevel Semantic Similarity for Large-Scale Multi-Label Image Retrieval , 2018, ICMR.

[43]  Peisong Wang,et al.  K-Nearest Neighbors Hashing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Xinge You,et al.  Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition , 2018, ECCV.

[45]  Yan Pan,et al.  Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning , 2018, IEEE Transactions on Image Processing.

[46]  Tong Li,et al.  Deep Multi-Similarity Hashing for Multi-label Image Retrieval , 2017, CIKM.

[47]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[48]  Wei Liu,et al.  Multimedia Hashing and Networking , 2016, IEEE MultiMedia.

[49]  Zhenan Sun,et al.  Local Semantic-Aware Deep Hashing With Hamming-Isometric Quantization , 2019, IEEE Transactions on Image Processing.

[50]  Wei Liu,et al.  Fast Structural Binary Coding , 2016, IJCAI.