Variable Sized Partitions for Range Query Algorithms

A range query applies an aggregation operation over all selected cells of an OLAP data cube where selection is specified by the range of contiguous values for each dimension. Many works have focused on efficiently computing range sum or range max queries. Most of these algorithms use a uniformly partitioning scheme for the data cube. In this paper, we improve on query costs of some of these existing algorithms by noting two key areas. First, end-user range queries usually involve repetitive query patterns, which provide a variable sized partitioning scheme that can be used to partition the data cubes. Query costs are reduced because pre-computation is retrieved for entire partitions, rather than computed for a partial region in many partitions, which requires large amounts of cell accesses to the data cube. Second, data in the data cube can be arranged such that each partition is stored in as few physical storage blocks as possible, thus reducing the I/O costs for answering range queries.

[1]  Terence R. Smith,et al.  Relative prefix sums: an efficient approach for querying dynamic OLAP data cubes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[2]  Nimrod Megiddo,et al.  Range queries in OLAP data cubes , 1997, SIGMOD '97.

[3]  Divyakant Agrawal,et al.  The Dynamic Data Cube , 2000, EDBT.

[4]  Sin Yeung Lee,et al.  Hierarchical Compact Cube for Range-Max Queries , 2000, VLDB.

[5]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[6]  Sin Yeung Lee,et al.  Range-Max/Min Query in OLAP Data Cube , 2000, DEXA.

[7]  Sin Yeung Lee,et al.  Range sum queries in dynamic OLAP data cubes , 2001, Proceedings of the Third International Symposium on Cooperative Database Systems for Advanced Applications. CODAS 2001.

[8]  Yannis E. Ioannidis,et al.  Hierarchical Prefix Cubes for Range-Sum Queries , 1999, VLDB.