Efficient Self-Join Algorithm in Interval-based Temporal Data Models

Interval-based temporal data model is a popular data model in temporal databases. It uses time intervals for representing the period of validity of a tuple, leading to unavoidable self-joins when combining tuples for objects. It requires k + 1-way self-join for k conjunctive conditions. Join operations are one of the most expensive operations in databases and they are even more serious in temporal databases because of growing data. There are many join algorithms for temporal databases. However, they focus on joining different inputs rather than an identical input, leading to multiple scans for the identical input. Advanced 2way join algorithms avoid a quadratic disk I/O complexity, but they are affected by the number of self-joins and partition sizes. In this paper, we address the problem of self-joins in the interval-based temporal data model and introduce a stream-based self-join algorithm. The proposed algorithm shows that it achieves a single relation scan for k-way self-join and its performance is not affected by partition sizes.

[1]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[2]  Richard T. Snodgrass,et al.  The temporal query language TQuel , 1987, TODS.

[3]  Bernhard Seeger,et al.  Efficient temporal join processing using indices , 2002, Proceedings 18th International Conference on Data Engineering.

[4]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[5]  Margaret H. Dunham,et al.  Join processing in relational databases , 1992, CSUR.

[6]  Thomas Seidl,et al.  Joining interval data in relational databases , 2004, SIGMOD '04.

[7]  Sushil Jajodia,et al.  Temporal Databases: Theory, Design, and Implementation , 1993 .

[8]  Christian S. Jensen,et al.  Join operations in temporal databases , 2005, The VLDB Journal.

[9]  Shamkant B. Navathe,et al.  Temporal Extensions to the Relational Model and SQL , 1993, Temporal Databases.

[10]  Shashi K. Gadia,et al.  Temporal Databases: A Prelude to Parametric Data , 1993, Temporal Databases.

[11]  Arie Segev,et al.  A glossary of temporal database concepts , 1992, SGMD.

[12]  R. Elmasri,et al.  Efficient temporal join processing using time index , 1996, Proceedings of 8th International Conference on Scientific and Statistical Data Base Management.

[13]  Christian S. Jensen,et al.  Efficient evaluation of the valid-time natural join , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.