Enhancement of Data Aggregation Using A Novel Point Access Method

The B + -tree and its variants have been reported as the good index structures for retrieving data. Database systems frequently establish the B + -tree style indices for fast access to data records. However, traditional B + -tree index could be a performance bottleneck because of its inflatable hierarchy. Many works focus on improving indexing techniques. In fact, the optimization of data organization inside index nodes is the most critical factor to improve retrieval quality. Some handles like pre-partition of data space, node splitting by force, node splitting with unbalanced partition, and node splitting upon overflow loading always burden index structures with plenty of storage space and building overhead. In this paper, we propose a new index scheme to highly aggregate the external structure in a B + -tree. It also adopts a better splitting policy to completely remove the suffering from data insertion orders. Our new index technique can compress data records in leaves and in turn reduce index size to improve query performance. In addition, the entire index's space utilization is promoted to a higher level; thereby the index's space requirement become s smaller and easily resides in memory.

[1]  Kenneth A. Ross,et al.  Making B+-Trees Cache Conscious in Main Memory , 2000, SIGMOD Conference.

[2]  Todd C. Mowry,et al.  Improving index performance through prefetching , 2001, SIGMOD '01.

[3]  Hans-Peter Kriegel,et al.  The Buddy Effect: An efficient and robust access method for spatial data base systems , 1990 .

[4]  David J. DeWitt,et al.  DBMSs on a Modern Processor: Where Does Time Go? , 1999, VLDB.

[5]  Hans J. Berliner,et al.  The B* Tree Search Algorithm: A Best-First Proof Procedure , 1979, Artif. Intell..

[6]  Ratko Orlandic,et al.  On accessing data in high-dimensional spaces: A comparative study of three space partitioning strategies , 2004, J. Syst. Softw..

[7]  Byunggu Yu,et al.  KDB/sub KD/-tree: a compact KDB-tree structure for indexing multidimensional data , 2003, Proceedings ITCC 2003. International Conference on Information Technology: Coding and Computing.

[8]  Christos Faloutsos,et al.  The TV-tree: An index structure for high-dimensional data , 1994, The VLDB Journal.

[9]  Byunggu Yu,et al.  A retrieval technique for high-dimensional data and partially specified queries , 2002, Data Knowl. Eng..

[10]  Jignesh M. Patel,et al.  Effect of node size on the performance of cache-conscious B+-trees , 2003, SIGMETRICS '03.

[11]  Andrew Chi-Chih Yao,et al.  On random 2–3 trees , 1978, Acta Informatica.

[12]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[13]  Kenneth A. Ross,et al.  Making B+- trees cache conscious in main memory , 2000, SIGMOD '00.

[14]  Peter Widmayer,et al.  The LSD tree: spatial access to multidimensional and non-point objects , 1989, VLDB 1989.

[15]  Myoung-Ho Kim,et al.  Restructuring the concurrent B+-tree with non-blocked search operations , 2002, Inf. Sci..

[16]  Beng Chin Ooi,et al.  Efficient Indexing of High-Dimensional Data Through Dimensionality Reduction , 2000, Data Knowl. Eng..

[17]  Michael J. Carey,et al.  Performance of B+ tree concurrency control algorithms , 1993, The VLDB Journal.

[18]  David J. DeWitt,et al.  Weaving Relations for Cache Performance , 2001, VLDB.

[19]  Gonzalo Navarro,et al.  A compact space decomposition for effective metric indexing , 2005, Pattern Recognit. Lett..

[20]  Martin L. Kersten,et al.  Database Architecture Optimized for the New Bottleneck: Memory Access , 1999, VLDB.

[21]  Beng Chin Ooi,et al.  iDistance: An adaptive B+-tree based indexing method for nearest neighbor search , 2005, TODS.