Variable Granularity Space Filling Curve for Indexing Multidimensional Data

Efficiently accessing multidimensional data is a challenge for building modern database applications that involve many folds of data such as temporal, spatial, data warehousing, bio-informatics, etc. This problem stems from the fact that multidimensional data have no given order that preserves proximity. The majority of the existing solutions to this problem cannot be easily integrated into the current relational database systems since they require modifications to the kernel. A prominent class of methods that can use existing access structures are 'space filling curves'. In this study, we describe a method that is also based on the space filling curve approach, but in contrast to earlier methods, it connects regions of various sizes rather than points in multidimensional space. Our approach allows an efficient transformation of interval queries into regions of data that results in significant improvements when accessing the data. A detailed empirical study demonstrates that the proposed method outperforms the best available off-theshelf methods for accessing multidimensional data.

[1]  Christian Böhm,et al.  Implementation of Multidimensional Index Structures for Knowledge Discovery in Relational Databases , 1999, DaWaK.

[2]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[3]  D. B. Lomet,et al.  A robust multi-attribute search structure , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[4]  Christos Faloutsos,et al.  Multiattribute hashing using Gray codes , 1986, SIGMOD '86.

[5]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[6]  Christos H. Papadimitriou,et al.  On the analysis of indexing schemes , 1997, PODS '97.

[7]  Byunggu Yu,et al.  A retrieval technique for high-dimensional data and partially specified queries , 2002, Data Knowl. Eng..

[8]  Sharad Mehrotra,et al.  The hybrid tree: an index structure for high dimensional feature spaces , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[9]  Masao Sakauchi,et al.  The BD-Tree - A New N-Dimensional Data Structure with Highly Efficient Dynamic Characteristics , 1983, IFIP Congress.

[10]  Thomas Hamelryck,et al.  Efficient identification of side‐chain patterns using a multidimensional index tree , 2003, Proteins.

[11]  Volker Markl,et al.  The UB-Tree: Performance of Multidimensional Range Queries , 1998 .

[12]  I K Fodor,et al.  A Survey of Dimension Reduction Techniques , 2002 .

[13]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[14]  T. H. Merrett,et al.  A class of data structures for associative searching , 1984, PODS.

[15]  Abdul Sattar,et al.  Advanced indexing technique for temporal data , 2010, Comput. Sci. Inf. Syst..

[16]  Abdul Sattar,et al.  Indexing Temporal Data with Virtual Structure , 2010, ADBIS.

[17]  Christos Faloutsos,et al.  Fractals for secondary key retrieval , 1989, PODS.

[18]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.