Deep top similarity hashing with class-wise loss for multi-label image retrieval

Abstract One of the major challenges of learning to hash in large-scale image retrieval is the projective transformation from raw image to binary space with preserving semantic similarity. Recently, several deep hashing methods show many excellent properties compared with traditional hashing based on hand-designed representation. However, most of the existing hashing models only pay attention to the semantic similarity between image pairs, ignoring the ranking information of retrieval results, which limits its performance. In this paper, a novel deep hashing framework, named Deep Top Similarity Hashing with Class-wise loss (DTSH-CW), is proposed to preserve semantic similarity between top images of ranking list and query images. In this proposed framework, CNNs architecture with batch normalization module is adopted to extract deep semantic characteristics. With integrating the position information of images, a top similarity loss is carefully designed to ensure the similarities between top images of ranking list and query images. Unlike pair-wise or triplet-wise loss, by directly leveraging the class labels, a cubic constraint based on Gaussian distribution is introduced to optimize objective function so as to maintain semantic variations of different classes. Furthermore, in order to solve discrete optimization problem, Two-Stage strategy is developed to provide efficient model training. Quantities of comparison experiments on three multi-label benchmark datasets show that our proposed DTSH-CW achieves promising performance compared to several state-of-the-art hashing methods.

[1]  Lijun Zhang,et al.  Semi-Supervised Deep Hashing with a Bipartite Graph , 2017, IJCAI.

[2]  Zheng Lin,et al.  Deep Supervised Hashing for Multi-Label and Large-Scale Image Retrieval , 2017, ICMR.

[3]  Ling Shao,et al.  Learning to Hash With Optimized Anchor Embedding for Scalable Retrieval , 2017, IEEE Transactions on Image Processing.

[4]  Chu-Song Chen,et al.  Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[6]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[7]  Heng Tao Shen,et al.  Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Jianmin Wang,et al.  Deep Hashing Network for Efficient Similarity Retrieval , 2016, AAAI.

[9]  Yi Shi,et al.  Deep Supervised Hashing with Triplet Labels , 2016, ACCV.

[10]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Ling Shao,et al.  Unsupervised Deep Hashing With Pseudo Labels for Scalable Image Retrieval , 2018, IEEE Transactions on Image Processing.

[12]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[13]  Jingkuan Song,et al.  Binary Generative Adversarial Networks for Image Retrieval , 2017, AAAI.

[14]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[15]  Chun Chen,et al.  Scalable Image Retrieval by Sparse Product Quantization , 2016, IEEE Transactions on Multimedia.

[16]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[17]  Wu-Jun Li,et al.  Feature Learning Based Deep Supervised Hashing with Pairwise Labels , 2015, IJCAI.

[18]  Xinbo Gao,et al.  Triplet-Based Deep Hashing Network for Cross-Modal Retrieval , 2018, IEEE Transactions on Image Processing.

[19]  Zhiqiang Wei,et al.  A Novel Deep Hashing Method with Top Similarity for Image Retrieval , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yuxin Peng,et al.  Unsupervised Generative Adversarial Cross-modal Hashing , 2017, AAAI.

[22]  Xiaoyang Tan,et al.  Learning Multilevel Semantic Similarity for Large-Scale Multi-Label Image Retrieval , 2018, ICMR.

[23]  Bohyung Han,et al.  Large-Scale Image Retrieval with Attentive Deep Local Features , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[24]  Lei Zhang,et al.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification , 2015, IEEE Transactions on Image Processing.

[25]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[26]  Silvio Savarese,et al.  Deep Metric Learning via Lifted Structured Feature Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Shiguang Shan,et al.  Deep Supervised Hashing for Fast Image Retrieval , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Jiwen Lu,et al.  Deep hashing for compact binary codes learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Hong Yan,et al.  Deep Class-Wise Hashing: Semantics-Preserving Hashing via Class-Wise Loss , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Yan Pan,et al.  Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning , 2018, IEEE Transactions on Image Processing.

[31]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[32]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[33]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[34]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[35]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[36]  Yu Liu,et al.  Learning Deep Features via Congenerous Cosine Loss for Person Recognition , 2017, ArXiv.

[37]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[38]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[40]  Jianmin Wang,et al.  Deep Quantization Network for Efficient Image Retrieval , 2016, AAAI.

[41]  Ian D. Reid,et al.  Fast Training of Triplet-Based Deep Binary Embedding Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Alberto Del Bimbo,et al.  Compact Hash Codes for Efficient Visual Descriptors Retrieval in Large Scale Databases , 2016, IEEE Transactions on Multimedia.

[43]  Tong Li,et al.  Deep Multi-Similarity Hashing for Multi-label Image Retrieval , 2017, CIKM.

[44]  Jing Liu,et al.  Deep Incremental Hashing Network for Efficient Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Hanjiang Lai,et al.  Simultaneous feature learning and hash coding with deep neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Philip S. Yu,et al.  HashNet: Deep Learning to Hash by Continuation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[47]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[49]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[50]  Liqiang Nie,et al.  Fast Scalable Supervised Hashing , 2018, SIGIR.

[51]  Dapeng Tao,et al.  Constrained Discriminative Projection Learning for Image Classification , 2020, IEEE Transactions on Image Processing.

[52]  Jen-Hao Hsiao,et al.  Deep learning of binary hash codes for fast image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[53]  Fei Yang,et al.  Web scale photo hash clustering on a single machine , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Manohar Paluri,et al.  Metric Learning with Adaptive Density Discrimination , 2015, ICLR.

[55]  Song Wang,et al.  Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval , 2018, IEEE Transactions on Multimedia.

[56]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.

[57]  Albert Gordo,et al.  End-to-End Learning of Deep Visual Representations for Image Retrieval , 2016, International Journal of Computer Vision.

[58]  Qi Tian,et al.  Deep hashing with top similarity preserving for image retrieval , 2017, Multimedia Tools and Applications.

[59]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[60]  Giorgos Tolias,et al.  Fine-Tuning CNN Image Retrieval with No Human Annotation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Chen Huang,et al.  Learning Deep Representation for Imbalanced Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  Hanjiang Lai,et al.  Supervised Hashing for Image Retrieval via Image Representation Learning , 2014, AAAI.

[63]  Zheng Lin,et al.  Deep Uniqueness-Aware Hashing for Fine-Grained Multi-Label Image Retrieval , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[64]  Yiu-ming Cheung,et al.  Triplet Fusion Network Hashing for Unpaired Cross-Modal Retrieval , 2019, ICMR.

[65]  Tieniu Tan,et al.  Deep semantic ranking based hashing for multi-label image retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).