论文信息 - Efficient Parameter-Free Adaptive Multi-Modal Hashing

Efficient Parameter-Free Adaptive Multi-Modal Hashing

Unsupervised multi-modal hashing has recently attracted broad attention in research area of large-scale multimedia retrieval for its low storage cost, high retrieval speed, and independence on semantic labels. However, the model learning process of existing methods still suffer from the problem of low efficiency: 1) Many existing methods measure the contributions of different modalities using fixed modality weights. In order to avoid over-fitting, they need an inefficient hyper-parameter adjustment process. 2) Most existing methods adopt inefficient optimization strategies to learn hash codes. In this letter, we propose an unsupervised Efficient Parameter-free Adaptive Multi-modal Hashing (EPAMH) model to adaptively capture the modality variations and preserve the discriminative semantics of multi-modal features into the binary hash codes. Moreover, we directly learn the binary codes with simple and efficient operations, which prevents the relaxing quantization errors and improves the model learning efficiency. Experiments prove the superior performance of EPAMH on three public multimedia retrieval datasets. Our source codes and testing datasets can be obtained at https://github.com/ChaoqunZheng/EPAMH.

Huaxiang Zhang | Lei Zhu | Chaoqun Zheng | Shusen Zhang

[1] Wu-Jun Li,et al. Deep Cross-Modal Hashing , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Mark J. Huiskes,et al. The MIR flickr retrieval evaluation , 2008, MIR '08.

[3] Zi Huang,et al. Discrete Multimodal Hashing With Canonical Views for Robust Mobile Landmark Search , 2017, IEEE Transactions on Multimedia.

[4] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[5] Devraj Mandal,et al. Generalized Semantic Preserving Hashing for Cross-Modal Retrieval , 2019, IEEE Transactions on Image Processing.

[6] Svetlana Lazebnik,et al. Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[7] Jingjing Li,et al. Unsupervised Deep Cross-modal Hashing with Virtual Label Regression , 2020, Neurocomputing.

[8] Wei Liu,et al. Discrete Graph Hashing , 2014, NIPS.

[9] Weiwei Liu,et al. Multiview Discrete Hashing for Scalable Multimedia Search , 2018, ACM Trans. Intell. Syst. Technol..

[10] Xuelong Li,et al. Large Graph Hashing with Spectral Rotation , 2017, AAAI.

[11] Xuelong Li,et al. Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval , 2017, IEEE Transactions on Image Processing.

[12] Heng Tao Shen,et al. Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search , 2020, IEEE Transactions on Multimedia.

[13] Meng Wang,et al. Multi-View Object Retrieval via Multi-Scale Topic Models , 2016, IEEE Transactions on Image Processing.

[14] Meng Wang,et al. Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud , 2017, IEEE Transactions on Image Processing.

[15] Rui Yang,et al. Discrete Multi-view Hashing for Effective Image Retrieval , 2017, ICMR.

[16] Pascal Fua,et al. LDAHash: Improved Matching with Smaller Descriptors , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Zi Huang,et al. Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval , 2013, IEEE Transactions on Multimedia.

[18] Xianglong Liu,et al. Multiple feature kernel hashing for large-scale visual search , 2014, Pattern Recognit..

[19] Heng Tao Shen,et al. Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[20] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[21] Huaxiang Zhang,et al. Flexible Multi-modal Hashing for Scalable Multimedia Retrieval , 2020, ACM Trans. Intell. Syst. Technol..

[22] Huaxiang Zhang,et al. Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval , 2019, ACM Multimedia.

[23] Ling Shao,et al. Multiview Alignment Hashing for Efficient Image Search , 2015, IEEE Transactions on Image Processing.

[24] Lei Zhu,et al. Online Multi-modal Hashing with Dynamic Query-adaption , 2019, SIGIR.

[25] Raghavendra Udupa,et al. Learning Hash Functions for Cross-View Similarity Search , 2011, IJCAI.

[26] Wei Liu,et al. Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Fumin Shen,et al. Multi-view Latent Hashing for Efficient Multimedia Search , 2015, ACM Multimedia.