Unsupervised Cross-Modal Hashing with Soft Constraint

The booming demands for cross-modal retrieval tasks can often bring in its wake the development of retrieval technologies, as people turn to pursuing a more effective way to improve the performance of search results in both accuracy and efficiency, for example by unsupervised cross-modal hashing. It’s worth noting that most of the cross-modal hashing methods focus on utilizing merely one approach to generate hash codes. However, each approach has its own intrinsic drawback, which would inevitably diminish the quality of hash codes. In this paper, we propose a state-of-the-art model named Soft Constraint Hashing (SCH), using a special soft constraint term defined as an “information tunnel” to achieve the goal that conveys information from one approach to another. In particular, this “tunnel” can eliminate potential data noises to some extent and bridge the gap between two unsupervised discrete hashing allocation approaches to simultaneously reinforce the quality of hash codes. The empirical results on publicly available datasets illustrate that our proposed model outperforms all the existing unsupervised cross-model hashing methods.

[1]  Zi Huang,et al.  Linear cross-modal hashing for efficient multimedia search , 2013, ACM Multimedia.

[2]  Zi Huang,et al.  Multiple feature hashing for real-time large scale near-duplicate video retrieval , 2011, ACM Multimedia.

[3]  Wei Liu,et al.  Supervised Discrete Hashing , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[5]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[6]  Zi Huang,et al.  Inter-media hashing for large-scale retrieval from heterogeneous data sources , 2013, SIGMOD '13.

[7]  Yue Gao,et al.  Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing , 2016, IEEE Transactions on Image Processing.

[8]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[9]  Jim Jing-Yan Wang,et al.  Supervised Cross-Modal Factor Analysis for Multiple Modal Data Classification , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[10]  Zhi-Hua Zhou,et al.  Column Sampling Based Discrete Supervised Hashing , 2016, AAAI.

[11]  Yizhou Wang,et al.  Quantized Correlation Hashing for Fast Cross-Modal Search , 2015, IJCAI.

[12]  Shuiwang Ji,et al.  SLEP: Sparse Learning with Efficient Projections , 2011 .

[13]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[14]  Shiming Xiang,et al.  Cross-Modal Hashing via Rank-Order Preserving , 2017, IEEE Transactions on Multimedia.

[15]  Ling Shao,et al.  Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval , 2017, IEEE Transactions on Image Processing.

[16]  Zhenan Sun,et al.  Fast Supervised Discrete Hashing , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Fei Wang,et al.  Composite hashing with multiple information sources , 2011, SIGIR.

[18]  Nikos Paragios,et al.  Data fusion through cross-modality metric learning using similarity-sensitive hashing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Yang Yang,et al.  Zero-Shot Hashing via Transferring Supervised Knowledge , 2016, ACM Multimedia.