The hB-pi* Tree: An Optimized Comprehensive Access Method for Frequent-Update Multi-dimensional Point Data

The R-tree [7] family is the most popular multi-dimensional index method. The R-tree, however, has overlaps among index entries and its index page fanout decreases rapidly as data dimension increases. Furthermore, the R-tree has poor concurrency performance. For frequent-update multi-dimensional point data sets, the hB-pi [5] tree is a better choice than the R*-tree. But the hB-pi tree (and all other kd-tree based access methods) indexes the whole space no matter whether or not there is any data in some sub-spaces. Indexing empty space(i.e., space without data inside) leads to unnecessary data page accesses which increase with growing dimension. This paper addresses this problem by proposing the hB-pi* tree, which efficiently indicates empty spaces and improves range query performances while preserving the hB-pi's high fan-out and good concurrency. Our methods can be applied to any kd-tree based access methods, and our claims are supported by extensive experimental evaluation.

[1]  David B. Lomet,et al.  Access method concurrency with recovery , 1992, SIGMOD '92.

[2]  Tian Xia,et al.  Improving the R*-tree with outlier handling techniques , 2005, GIS '05.

[3]  Georgios Evangelidis,et al.  The hB $^\Pi$-tree: a multi-attribute index supporting concurrency, recovery and node consolidation , 1997, The VLDB Journal.

[4]  Jeffrey Scott Vitter,et al.  Bkd-Tree: A Dznamic Scalable kd-Tree , 2003, SSTD.

[5]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[6]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[7]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[8]  Peter Widmayer,et al.  The LSD tree: spatial access to multidimensional and non-point objects , 1989, VLDB 1989.

[9]  Jon Louis Bentley,et al.  Multidimensional Binary Search Trees in Database Applications , 1979, IEEE Transactions on Software Engineering.

[10]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[11]  Chengyang Zhang,et al.  Advances in Spatial and Temporal Databases , 2015, Lecture Notes in Computer Science.

[12]  Andreas Henrich,et al.  The LSD/sup h/-tree: an access structure for feature vectors , 1998, Proceedings 14th International Conference on Data Engineering.

[13]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[14]  D. B. Lomet,et al.  A robust multi-attribute search structure , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[15]  Ning An,et al.  Improving Performance with Bulk-Inserts in Oracle R-Trees , 2003, VLDB.