SiLi Index: Data Structure for Fast Vector Space Searching

Nearest neighbor queries in high-dimensional spaces are expensive. In this article, we propose a method of building and querying a stand-alone data structure, SiLi (Similarity List) Index, which supports approximating the results of k-NN queries in high-dimensional spaces, while using a significantly reduced amount of system memory and processor time compared to the usual brute-force search methods.