A novel locality-sensitive hashing algorithm for similarity searches on large-scale hyperspectral data

ABSTRACT Similarity search is a fundamental process in many hyperspectral remote sensing applications. In this article, we investigated the locality-sensitive hashing (LSH) algorithm for approximate similarity search on hyperspectral remote sensing data and proposed a new method that uses a regulated random hyperplane projection hash function family to index data and an adjacent graph-probing method for similarity queries. This method improves the performance of LSH over non-uniformly distributed hyperspectral data sets. Comparative experiments with three benchmark hyperspectral data sets showed that the proposed method is at least two times faster than the basic LSH and two other improved methods whilst achieving the same search accuracy. Moreover, results of the proposed method are proven equivalent to the accurate similarity search for real applications.

[1]  Philip S. Yu,et al.  Top-k Similarity Join in Heterogeneous Information Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[2]  Xinpeng Zhang,et al.  A robust and fast video fingerprinting based on 3D-DCT and LSH , 2011, 2011 International Conference on Multimedia Technology.

[3]  Abdollah Homaifar,et al.  Satellite image retrieval using low memory locality sensitive hashing in Euclidean space , 2011, Earth Sci. Informatics.

[4]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[5]  Shiming Xiang,et al.  A Graph-Based Classification Method for Hyperspectral Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[7]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[8]  Yang Gao,et al.  Dimensionality Reduction for Hyperspectral Data Based on Class-Aware Tensor Neighborhood Graph and Patch Alignment , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[10]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[11]  Cédric Richard,et al.  A graph Laplacian regularization for hyperspectral data unmixing , 2014, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).