Fast Asymmetric and Discrete Cross-Modal Hashing With Semantic Consistency

Hashing has attracted widespread attention in the field of supervised cross-modal retrieval due to its advantages in search and storage. However, there are still some issues to be addressed, e.g.: 1) how to effectively combine sample and label semantics to learn hash codes; 2) how to reduce high computational requirements brought by computing a pairwise similarity matrix; and 3) how to effectively solve discrete optimization problems. To cope with them, a fast asymmetric and discrete cross-modal hashing (FADCH) method is proposed in this article. First, matrix factorization is leveraged to collaboratively construct a common semantic subspace between different modalities. Second, semantic consistency is preserved by aligning the common semantic subspace with the semantic representation constructed from labels, which effectively exploits the semantic complementarity of labels and samples. Third, we embed labels into hash codes and keep the correlation between different modal samples by using a pairwise similarity matrix. Fourth, we use an asymmetric strategy with relaxation to associate hash codes with semantic representation, which not only avoids the difficulty of symmetric frame optimization but also embeds more semantic information into the Hamming space. In addition, a strongly orthogonal constraint is introduced to optimize the hash codes. Finally, an effective optimization algorithm is developed to directly generate discrete hash codes while reducing the complexity from $O(n^{2})$ to $O(n)$ . The experimental results on three benchmark datasets illustrate the superiority of the FADCH method.

[1]  Shaohua Teng,et al.  Semantic-guided hashing learning for domain adaptive retrieval , 2022, World Wide Web.

[2]  Bin Wang,et al.  Bi-CMR: Bidirectional Reinforcement Guided Hashing for Effective Cross-Modal Retrieval , 2022, AAAI.

[3]  Young-joo Chung,et al.  Efficient Cross-Modal Retrieval via Deep Binary Hashing and Quantization , 2022, BMVC.

[4]  Xiao-Ming Wu,et al.  Discrete online cross-modal hashing , 2022, Pattern Recognit..

[5]  Jingkuan Song,et al.  BATCH: A Scalable Asymmetric Discrete Cross-Modal Hashing , 2021, IEEE Transactions on Knowledge and Data Engineering.

[6]  Zi Huang,et al.  Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing , 2021, IEEE Transactions on Knowledge and Data Engineering.

[7]  Xiao-Jun Wu,et al.  Robust and discrete matrix factorization hashing for cross-modal retrieval , 2021, Pattern Recognit..

[8]  Yang Gao,et al.  Adaptive Label Correlation Based Asymmetric Discrete Hashing for Cross-Modal Retrieval , 2021, IEEE Transactions on Knowledge and Data Engineering.

[9]  Shaohua Teng,et al.  Discrete matrix factorization hashing for cross-modal retrieval , 2021, International Journal of Machine Learning and Cybernetics.

[10]  Yilong Yin,et al.  Deep Multiscale Fusion Hashing for Cross-Modal Retrieval , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Zhikui Chen,et al.  UCMH: Unpaired cross-modal hashing with matrix factorization , 2020, Neurocomputing.

[12]  Dongling Zhang,et al.  Scalable Discrete Matrix Factorization and Semantic Autoencoder for Cross-Media Retrieval , 2020, IEEE Transactions on Cybernetics.

[13]  Yilong Yin,et al.  Fast Unmediated Hashing for Cross-Modal Retrieval , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Min Meng,et al.  Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval , 2020, IEEE Transactions on Image Processing.

[15]  Long Lan,et al.  Semantic-consistent cross-modal hashing for large-scale image retrieval , 2020, Neurocomputing.

[16]  Jingjing Li,et al.  Fast Discrete Collaborative Multi-Modal Hashing for Large-Scale Multimedia Retrieval , 2020, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yong Xu,et al.  Dynamic Double Classifiers Approximation for Cross-Domain Recognition , 2020, IEEE Transactions on Cybernetics.

[18]  Zhihai He,et al.  Semantic deep cross-modal hashing , 2020, Neurocomputing.

[19]  Xin Luo,et al.  SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Li Liu,et al.  Deep semantic cross modal hashing with correlation alignment , 2020, Neurocomputing.

[21]  Tao Yao,et al.  Fast discrete cross-modal hashing with semantic consistency , 2020, Neural Networks.

[22]  Jun Wang,et al.  Strongly Constrained Discrete Hashing , 2020, IEEE Transactions on Image Processing.

[23]  Qiang Ni,et al.  Joint Image-Text Hashing for Fast Large-Scale Cross-Media Retrieval Using Self-Supervised Deep Learning , 2019, IEEE Transactions on Industrial Electronics.

[24]  Jie Yang,et al.  Cluster-wise Unsupervised Hashing for Cross-Modal Similarity Search , 2019, Pattern Recognit..

[25]  Liqiang Nie,et al.  A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps , 2019, ACM Multimedia.

[26]  Chunguang Li,et al.  Discrete Double-Bit Hashing , 2019, IEEE Transactions on Big Data.

[27]  Xinbo Gao,et al.  Label Consistent Matrix Factorization Hashing for Large-Scale Cross-Modal Similarity Search , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Zi Huang,et al.  Scalable Supervised Asymmetric Hashing With Semantic and Latent Factor Embedding , 2019, IEEE Transactions on Image Processing.

[29]  Qi Tian,et al.  Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval , 2019, Neurocomputing.

[30]  Yilong Yin,et al.  Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels , 2018, ACM Multimedia.

[31]  Jungong Han,et al.  Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval , 2018, IJCAI.

[32]  Xin-Shun Xu,et al.  Scalable Supervised Discrete Hashing for Large-Scale Search , 2018, WWW.

[33]  Zhenan Sun,et al.  Fast Supervised Discrete Hashing , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Jungong Han,et al.  Cross-View Retrieval via Probability-Based Semantics-Preserving Hashing , 2017, IEEE Transactions on Cybernetics.

[35]  Pengfei Zhang,et al.  Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval , 2017, ACM Multimedia.

[36]  Wu-Jun Li,et al.  Discrete Latent Factor Model for Cross-Modal Hashing , 2017, IEEE Transactions on Image Processing.

[37]  Rongrong Ji,et al.  Cross-Modality Binary Code Learning via Fusion Similarity Hashing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Gaofeng Meng,et al.  AMVH: Asymmetric Multi-Valued hashing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Xuelong Li,et al.  Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval , 2017, IEEE Transactions on Image Processing.

[40]  Guiguang Ding,et al.  Collective Matrix Factorization Hashing for Multimodal Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Dongqing Zhang,et al.  Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization , 2014, AAAI.

[42]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[43]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[44]  Yimu Wang,et al.  Deep Unified Cross-Modality Hashing by Pairwise Data Alignment , 2021, IJCAI.

[45]  Huan Zhao,et al.  Learning a maximized shared latent factor for cross-modal hashing , 2021, Knowl. Based Syst..

[46]  Song Wang,et al.  Supervised Matrix Factorization Hashing With Quantitative Loss for Image-Text Search , 2020, IEEE Access.

[47]  Devraj Mandal,et al.  Generalized Semantic Preserving Hashing for Cross-Modal Retrieval , 2019, IEEE Transactions on Image Processing.