Cost models for overlapping and multi-version B-trees

Overlapping and multi-version techniques are two popular frameworks that transform an ephemeral index into a multiple logical-tree structure in order to support versioning databases. Although both frameworks have produced numerous efficient indexing methods, their performance analysis is rather limited; as a result, there is no clear understanding about the behavior of the alternative structures and the choice of the best one, given the data and query characteristics. Furthermore, query optimization based on these methods is currently impossible. These are serious problems due to the incorporation of overlapping and multi-version techniques in several traditional (e.g. banking) and emerging (e.g. spatio-temporal) applications. In this paper, we propose frameworks for reducing the performance analysis of overlapping and multi-version structures to that of the corresponding ephemeral structures, thus simplifying the problem significantly. The frameworks lead to accurate cost models that predict the sizes of the trees, the node accesses and query selectivity. Although we focus on B-tree-based structures, the proposed models can be employed with a variety of indexes.

[1]  Christos Faloutsos,et al.  Designing Access Methods for Bitemporal Databases , 1998, IEEE Trans. Knowl. Data Eng..

[2]  Vassilis J. Tsotras,et al.  Comparison of access methods for time-evolving data , 1999, CSUR.

[3]  Mario A. Nascimento,et al.  Towards historical R-trees , 1998, SAC '98.

[4]  Yannis Manolopoulos,et al.  Overlapping B+-Trees: An Implementation of a Transaction Time Access Method , 1999, Data Knowl. Eng..

[5]  Prashant Palvia,et al.  Approximating Block Accesses in Database Organizations , 1984, Inf. Process. Lett..

[6]  Bernd-Uwe Pagel,et al.  Towards an analysis of range query performance in spatial data structures , 1993, PODS '93.

[7]  Bernd-Uwe Pagel,et al.  Are window queries representative for arbitrary range queries? , 1996, PODS.

[8]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[9]  Timos K. Sellis,et al.  Efficient Cost Models for Spatial Queries Using R-Trees , 2000, IEEE Trans. Knowl. Data Eng..

[10]  Dimitrios Gunopulos,et al.  Efficient computation of temporal aggregates with range predicates , 2001, PODS '01.

[11]  Yannis Manolopoulos,et al.  Overlapping Linear Quadtrees and Spatio-Temporal Query Processing , 2000, Comput. J..

[12]  Bernhard Seeger,et al.  Query Processing Techniques for Multiversion Access Methods , 1996, VLDB.

[13]  Christos Faloutsos,et al.  On packing R-trees , 1993, CIKM '93.

[14]  David B. Lomet,et al.  The BT-tree: A Branched and Temporal Access Method , 2000, VLDB.

[15]  Rakesh M. Verma,et al.  An Efficient Multiversion Access STructure , 1997, IEEE Trans. Knowl. Data Eng..

[16]  F. Warren Burton,et al.  Multiple Generation Text Files Using Overlapping Tree Structures , 1985, Comput. J..

[17]  Dimitrios Gunopulos,et al.  Indexing Animated Objects Using Spatiotemporal Access Methods , 2001, IEEE Trans. Knowl. Data Eng..

[18]  F. Warren Burton,et al.  Implementation of Overlapping B-Trees for Time and Space Efficient Representation of Collections of Similar Files , 1990, Comput. J..

[19]  Yufei Tao,et al.  MV3R-Tree: A Spatio-Temporal Access Method for Timestamp and Interval Queries , 2001, VLDB.

[20]  Vassilis J. Tsotras,et al.  A Comparison of Access Methods for Temporal Data Title a Comparison of Access Methods for Temporal Data Individual Participants , 2022 .

[21]  Yannis Manolopoulos,et al.  Multiversion Linear Quadtree for Spatio-Temporal Data , 2000, ADBIS-DASFAA.