Comparison of access methods for time-evolving data

This paper compares different indexing techniques proposed for supporting efficient access to temporal data. The comparison is based on a collection of important performance criteria, including the space consumed, update processing, and query time for representative queries. The comparison is based on worst-case analysis, hence no assumptions on data distribution or query frequencies are made. When a number of methods have the same asymptotic worst-case behavior, features in the methods that affect average case behavior are discussed. Additional criteria examined are the pagination of an index, the ability to cluster related data together, and the ability to efficiently separate old from current data (so that larger archival storage media such as write-once optical disks can be used). The purpose of the paper is to identify the difficult problems in accessing temporal data and describe how the different methods aim to solve them. A general lower bound for answering basic temporal queries is also introduced.

[1]  llsoo Ahn,et al.  Temporal Databases , 1986, Computer.

[2]  Richard R. Muntz,et al.  Stream Processing: Temporal Query Processing and Optimization , 1993, Temporal Databases.

[3]  Malcolm C. Easton,et al.  Key-Sequence Data Sets on Inedible Storage , 1986, IBM J. Res. Dev..

[4]  Ramez Elmasri,et al.  The Time Index: An Access Structure for Temporal Data , 1990, VLDB.

[5]  Yannis Manolopoulos,et al.  Overlapping B+trees for temporal data , 1990, Proceedings of the 5th Jerusalem Conference on Information Technology, 1990. 'Next Decade in Information Technology'.

[6]  Shamkant B. Navathe,et al.  A Temporal Relational Model and a Query Language , 1989, Inf. Sci..

[7]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[8]  Jacov Ben-Zvi,et al.  The time relational model , 1982 .

[9]  Arie Segev,et al.  Physical organization of temporal data , 1987, 1987 IEEE Third International Conference on Data Engineering.

[10]  Margaret H. Dunham,et al.  A Multiple Tree Mapping-Based Approach for Range Indexing , 1996 .

[11]  Ramez Elmasri,et al.  Efficient implementation techniques for the time index , 1991, [1991] Proceedings. Seventh International Conference on Data Engineering.

[12]  David B. Lomet,et al.  Exploiting A History Database for Backup , 1993, VLDB.

[13]  Alberto O. Mendelzon,et al.  Similarity-based queries , 1995, PODS '95.

[14]  Curtis P. Kolovson Indexing techniques for historical databases , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[15]  Christian S. Jensen,et al.  R-Tree Based Indexing of Now-Relative Bitemporal Data , 1998, VLDB.

[16]  Vassilis J. Tsotras,et al.  The Snapshot Index: An I/O-optimal access method for timeslice queries , 1995, Inf. Syst..

[17]  Michael Stonebraker,et al.  The Design of the POSTGRES Storage System , 1988, VLDB.

[18]  Jeffrey Scott Vitter,et al.  Optimal dynamic interval management in external memory , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[19]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[20]  Christos Faloutsos,et al.  Designing Access Methods for Bitemporal Databases , 1998, IEEE Trans. Knowl. Data Eng..

[21]  Arie Segev,et al.  Efficient Indexing Methods for Temporal Relations , 1993, IEEE Trans. Knowl. Data Eng..

[22]  Miron Livny,et al.  The Design and Implementation of a Sequence Database System , 1996, VLDB.

[23]  Jeffrey Scott Vitter,et al.  An efficient I/O interface for optical disks , 1985, TODS.

[24]  Vassilis J. Tsotras,et al.  Efficient Algorithms for Managing the History of Evolving Databases , 1990, ICDT.

[25]  Michael H. Böhlen,et al.  Temporal database system implementations , 1995, SGMD.

[26]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[27]  Vassilis J. Tsotras,et al.  Efficient Management of Time-Evolving Databases , 1995, IEEE Trans. Knowl. Data Eng..

[28]  H. Edelsbrunner A new approach to rectangle intersections part I , 1983 .

[29]  Sridhar Ramaswamy,et al.  Path caching (extended abstract): a technique for optimal external searching , 1994, PODS '94.

[30]  Witold Litwin,et al.  Linear Hashing: A new Algorithm for Files and Tables Addressing , 1980, ICOD.

[31]  David J. DeWitt,et al.  Object and File Management in the EXODUS Extensible Database System , 1986, VLDB.

[32]  Christian S. Jensen,et al.  Incremental Implementation Model for Relational Databases with Transaction Time , 1991, IEEE Trans. Knowl. Data Eng..

[33]  Arie Shoshani,et al.  Temporal Data Management , 1986, VLDB.

[34]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[35]  Christian S. Jensen,et al.  Temporal Data Management , 1999, IEEE Trans. Knowl. Data Eng..

[36]  Ramez Elmasri,et al.  Declustering techniques for parallelizing temporal access structures , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[37]  Michael Stonebraker,et al.  Segment indexes: dynamic indexing techniques for multi-dimensional interval data , 1991, SIGMOD '91.

[38]  Richard R. Muntz,et al.  Temporal Query Processing and Optimization in Multiprocessor Database Machines , 1992, VLDB.

[39]  Gerhard Weikum,et al.  LoT: Dynamic Declustering of TSB-Tree Nodes for Parallel Access to Temporal Data , 1996, EDBT.

[40]  Ramez Elmasri,et al.  A consensus glossary of temporal database concepts , 1994, SGMD.

[41]  Gerhard Weikum,et al.  A Log-Structured History Data Access Method (LHAM) , 1993, HPTS.

[42]  Christos Faloutsos,et al.  Hilbert R-tree: An Improved R-tree using Fractals , 1994, VLDB.

[43]  Rolf Klein,et al.  Priority Search Trees in Secondary Memory (Extended Abstract) , 1987, WG.

[44]  Friedhelm Meyer auf der Heide,et al.  Dynamic perfect hashing: upper and lower bounds , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[45]  Roger G. Johnson,et al.  Extending relational algebra to manipulate temporal data , 1988, Inf. Syst..

[46]  Christos Faloutsos,et al.  Access Methods for Bi-Temporal Databases , 1995, Temporal Databases.

[47]  David B. Lomet,et al.  Using timestamping to optimize two phase commit , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[48]  Sridhar Ramaswamy,et al.  Indexing for data models with constraints and classes (extended abstract) , 1993, PODS '93.

[49]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[50]  Sridhar Ramaswamy Efficient Indexing for Constraint and Temporal Databases , 1997, ICDT.

[51]  Arie Segev,et al.  A consensus glossary of temporal database concepts , 1994, SIGMOD 1994.

[52]  Bernhard Seeger,et al.  A Generic Approach to Bulk Loading Multidimensional Index Structures , 1997, VLDB.

[53]  Betty Salzberg,et al.  File Structures: An Analytic Approach , 1988 .

[54]  Robert E. Tarjan,et al.  Making data structures persistent , 1986, STOC '86.

[55]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[56]  David B. Lomet,et al.  The performance of a multiversion access method , 1990, SIGMOD '90.

[57]  Oliver Günther,et al.  The design of the cell tree: an object-oriented index structure for geometric databases , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[58]  Richard T. Snodgrass,et al.  Partitioned storage for temporal databases , 1988, Inf. Syst..

[59]  Rakesh M. Verma,et al.  Efficient Archivable Time Index: A Dynamic Indexing Scheme for Temporal Data , 1994 .

[60]  Betty Salzberg,et al.  Timestamping after commit , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[61]  David B. Lomet,et al.  The hB-tree: a multiattribute indexing method with good guaranteed performance , 1990, TODS.

[62]  Christian S. Jensen,et al.  An extensible notation for spatiotemporal index queries , 1998, SGMD.

[63]  Roberto Tamassia,et al.  Dynamic algorithms in computational geometry , 1992, Proc. IEEE.

[64]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[65]  Ralf Hartmut Güting,et al.  XP-Trees: External Priority Search Trees , 1990 .

[66]  J TsotrasVassilis,et al.  Comparison of access methods for time-evolving data , 1999 .

[67]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[68]  Kurt Mehlhorn,et al.  Data Structures and Algorithms 3: Multi-dimensional Searching and Computational Geometry , 2012, EATCS Monographs on Theoretical Computer Science.

[69]  Ronald L. Rivest,et al.  Partial-Match Retrieval Algorithms , 1976, SIAM J. Comput..

[70]  Richard T. Snodgrass,et al.  A taxonomy of time databases , 1985, SIGMOD Conference.

[71]  SellisTimos,et al.  Using differential techniques to efficiently support transaction time , 1993, VLDB 1993.

[72]  Ramez Elmasri,et al.  The time index+: an incremental access structure for temporal databases , 1994, CIKM '94.

[73]  Peter Dadam,et al.  Designing DBMS support for the temporal dimension , 1984, SIGMOD '84.

[74]  Christos H. Papadimitriou,et al.  On the analysis of indexing schemes , 1997, PODS '97.

[75]  David B. Lomet,et al.  Access methods for multiversion data , 1989, SIGMOD '89.

[76]  Gultekin Özsoyoglu,et al.  Temporal and Real-Time Databases: A Survey , 1995, IEEE Trans. Knowl. Data Eng..

[77]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[78]  Rakesh M. Verma,et al.  An Efficient Multiversion Access STructure , 1997, IEEE Trans. Knowl. Data Eng..

[79]  Carlo Zaniolo,et al.  Temporal aggregation in active database rules , 1997, SIGMOD '97.

[80]  Richard R. Muntz,et al.  Generalized data stream indexing and temporal query processing , 1992, [1992 Proceedings] Second International Workshop on Research Issues on Data Engineering: Transaction and Query Processing.

[81]  F. Warren Burton,et al.  Multiple Generation Text Files Using Overlapping Tree Structures , 1985, Comput. J..

[82]  Anil Kumar,et al.  Temporal Database Bibliography Update , 1996, SIGMOD Rec..

[83]  Edward M. McCreight,et al.  Priority Search Trees , 1985, SIAM J. Comput..

[84]  Eric Mays,et al.  Fully persistent B+-trees , 1991, SIGMOD '91.

[85]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[86]  Ramez Elmasri,et al.  M-IVTT: An Index for Bitemporal Databases , 1996, DEXA.

[87]  Z. Meral Özsoyoglu,et al.  Indexing Transaction Time Databases , 1998, Inf. Sci..

[88]  Ramez Elmasri,et al.  The Time Index and the Monotonic B+-tree , 1993, Temporal Databases.

[89]  Arie Segev,et al.  Event-Join Optimization in Temporal Relational Databases , 1989, VLDB.

[90]  J. Ian Munro,et al.  Sorting and Searching in Multisets , 1976, SIAM J. Comput..

[91]  Bernard Chazelle,et al.  Filtering search: A new approach to query-answering , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).