Double-Coding Density Sensitive Hashing

This paper proposes a double-coding density sensitive hashing (DCDSH) method. DCDSH accomplishes approximate nearest neighbor (ANN) search tasks based on its double coding scheme. First, DCDSH generates real-valued hash codes by projecting objects along the principle hyper-planes. These hyper-planes are determined by principle distributions and geometric structures of data set. Second, DCDSH derives binary hash codes based on these real-valued hash codes. Real-valued hash codes can avoid undesirable partition of objects in low density areas and effectively improve representation capability and discriminating power. Binary codes contribute to query speed owing to the low complexity for computing hamming distance. DCDSH integrates the advantages of these two kinds of hash codes. Experimental results on large scale high dimensional data show that the proposed DCDSH exhibits superior performance compared to several state-of-the-art hashing methods.

[1]  Ling Shao,et al.  Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval , 2017, IEEE Transactions on Image Processing.

[2]  Deng Cai,et al.  Density Sensitive Hashing , 2012, IEEE Transactions on Cybernetics.

[3]  Qiang Huang,et al.  Reverse Query-Aware Locality-Sensitive Hashing for High-Dimensional Furthest Neighbor Search , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[4]  Xiangjian He,et al.  Unsupervised Video Hashing by Exploiting Spatio-Temporal Feature , 2016, ICONIP.

[5]  Hongtao Lu,et al.  Local Linear Spectral Hashing , 2013, ICONIP.

[6]  Anirban Dasgupta,et al.  Fast locality-sensitive hashing , 2011, KDD.