On computing temporal aggregates with range predicates

Computing temporal aggregates is an important but costly operation for applications that maintain time-evolving data (data warehouses, temporal databases, etc.) Due to the large volume of such data, performance improvements for temporal aggregate queries are critical. Previous approaches have aggregate predicates that involve only the time dimension. In this article we examine techniques to compute temporal aggregates that include key-range predicates as well (range-temporal aggregates). In particular we concentrate on the SUM aggregate, while COUNT is a special case. To handle arbitrary key ranges, previous methods would need to keep a separate index for every possible key range. We propose an approach based on a new index structure called the Multiversion SB-Tree, which incorporates features from both the SB-Tree and the Multiversion B+--tree, to handle arbitrary key-range temporal aggregate queries. We analyze the performance of our approach and present experimental results that show its efficiency. Furthermore, we address a novel and practical variation called functional range-temporal aggregates. Here, the value of any record is a function over time. The meaning of aggregates is altered such that the contribution of a record to the aggregate result is proportional to the size of the intersection between the record's time interval and the query time interval. Both analytical and experimental results show the efficiency of our result.

[1]  Christian S. Jensen,et al.  Temporal Data Management , 1999, IEEE Trans. Knowl. Data Eng..

[2]  Richard T. Snodgrass,et al.  Main Memory-Based Algorithms for Efficient Parallel Aggregation for Temporal Databases , 2004, Distributed and Parallel Databases.

[3]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[4]  Dimitrios Gunopulos,et al.  Efficient aggregation over objects with extent , 2002, PODS '02.

[5]  Yufei Tao,et al.  Efficient temporal counting with bounded error , 2007, The VLDB Journal.

[6]  Vassilis J. Tsotras,et al.  Comparison of access methods for time-evolving data , 1999, CSUR.

[7]  Jennifer Widom,et al.  Incremental computation and maintenance of temporal aggregates , 2003, The VLDB Journal.

[8]  Yufei Tao,et al.  Range aggregate processing in spatial databases , 2004, IEEE Transactions on Knowledge and Data Engineering.

[9]  Christos Faloutsos,et al.  Approximate temporal aggregation , 2004, Proceedings. 20th International Conference on Data Engineering.

[10]  On computing temporal aggregates with range predicates , 2008 .

[11]  Richard T. Snodgrass,et al.  Computing temporal aggregates , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[12]  Dimitrios Gunopulos,et al.  Efficient computation of temporal aggregates with range predicates , 2001, PODS '01.

[13]  Richard T. Snodgrass,et al.  Parallel algorithms for computing temporal aggregates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[14]  Gerhard Weikum,et al.  ACM Transactions on Database Systems , 2005 .

[15]  F. Frances Yao,et al.  Computational Geometry , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.

[16]  Bernard Chazelle,et al.  A Functional Approach to Data Structures and Its Use in Multidimensional Searching , 1988, SIAM J. Comput..

[17]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[18]  Myoung-Ho Kim,et al.  Effective Temporal Aggregation Using Point-Based Trees , 1999, DEXA.

[19]  Bongki Moon,et al.  Scalable algorithms for large temporal aggregation , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[20]  Yon Dohn Chung,et al.  An efficient method for temporal aggregation with range-condition attributes , 2004, Inf. Sci..

[21]  Xinfeng Ye,et al.  Processing temporal aggregates in parallel , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[22]  Pankaj K. Agarwal,et al.  CRB-Tree: An Efficient Indexing Scheme for Range-Aggregate Queries , 2003, ICDT.