An efficient bitmap indexing method for similarity search in high dimensional multimedia databases

The paper proposes a new indexing mechanism for similarity search in high-dimensional multimedia database; it quickly filters out irrelevant objects using a bitmap index, in which the characteristic of each object is approximated as a bit-string. The bits in a bitstring that are set to '1' denote the representative dimensions of an object that their attribute values are a relatively larger value than others. Since two objects are dissimilar if their representative dimensions are so much different, the degree of dissimilarity can be computed easily by XORing the bit-strings of two objects and counting the number of '1' s in the resulting bit-string. Experimental results with more than 100,000 images show that a remarkable speed-up can be obtained with the proposed indexing method compared to the VA-file and linear scan method because of the simple XORing operation in the filtering process, although there is some loss in search accuracy

[1]  Xiaoming Zhu,et al.  An efficient indexing method for nearest neighbor searches in high-dirnensional image databases , 2002, IEEE Trans. Multim..

[2]  Peter N. Yianilos,et al.  Data structures and algorithms for nearest neighbor search in general metric spaces , 1993, SODA '93.

[3]  Z. Meral Özsoyoglu,et al.  Distance-based indexing for high-dimensional metric spaces , 1997, SIGMOD '97.

[4]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[5]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[6]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[7]  Patrick E. O'Neil,et al.  Improved query performance with variant indexes , 1997, SIGMOD '97.

[8]  Masatoshi Yoshikawa,et al.  The A-tree: An Index Structure for High-Dimensional Spaces Using Relative Approximation , 2000, VLDB.

[9]  Changzhou Wang,et al.  Indexing very high-dimensional sparse and quasi-sparse vectors for similarity searches , 2001, The VLDB Journal.

[10]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[11]  Jae Soo Yoo,et al.  The HS File: A New Dynamic Signature File Method for Efficient Information Retrieval , 1994, DEXA.

[12]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[13]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.