Range sum queries in dynamic OLAP data cubes

Range sum queries play an important role in analyzing data in data cubes. Many application domains require that data cubes should be updated quite often to provide real time information. Previous techniques for range sum queries, however, can incur an update cost of O(n/sup d/) in the worst case, where d is the number of dimensions of the data cube and n is the size of each dimension. To address this dynamic data cube problem, a technique called double relative prefix sum (DRPS), was proposed which achieves a query cost of O(n/sup 1/3/) and an update cost of O(n/sup d/3/ ) in the worst case. The total cost of DRPS is the smallest compared with other techniques under two cost models. However, this technique causes considerable space overhead which is about n/sup d/+dn/sup d-1/3/. While low query cost and update cost are critical for analysis in dynamic OLAP data cubes, growing data collections increase the demand for space-efficient approaches. We propose a new technique which promises the same query cost and update cost as DRPS while the additional space requirement is only nd.

[1]  Sin Yeung Lee,et al.  Hierarchical Compact Cube for Range-Max Queries , 2000, VLDB.

[2]  Jeffrey D. Ullman,et al.  Index selection for OLAP , 1997, Proceedings 13th International Conference on Data Engineering.

[3]  Divyakant Agrawal,et al.  Using wavelet decomposition to support progressive and approximate range-sum queries over data cubes , 2000, CIKM '00.

[4]  D. Shasha,et al.  Hierarchically Split Cube Forests for Decision Support: description and tuned design , 1996 .

[5]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[6]  Sin Yeung Lee,et al.  Range-Max/Min Query in OLAP Data Cube , 2000, DEXA.

[7]  Inderpal Singh Mumick,et al.  Maintenance of data cubes and summary tables in a warehouse , 1997, SIGMOD '97.

[8]  Nimrod Megiddo,et al.  Range queries in OLAP data cubes , 1997, SIGMOD '97.

[9]  Divyakant Agrawal,et al.  The Dynamic Data Cube , 2000, EDBT.

[10]  Kenneth A. Ross,et al.  Fast Computation of Sparse Datacubes , 1997, VLDB.

[11]  Divyakant Agrawal,et al.  Flexible Data Cubes for Online Aggregation , 2001, ICDT.

[12]  Divyakant Agrawal,et al.  pCube: Update-efficient online aggregation with progressive feedback and error bounds , 2000, Proceedings. 12th International Conference on Scientific and Statistica Database Management.

[13]  S. Muthukrishnan,et al.  Optimal and approximate computation of summary statistics for range aggregates , 2001, PODS '01.

[14]  Jeffrey F. Naughton,et al.  Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies , 1996, VLDB.

[15]  Jehoshua Bruck,et al.  Partial-Sum Queries in OLAP Data Cubes Using Covering Codes , 1998, IEEE Trans. Computers.

[16]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[17]  Dimitrios Gunopulos,et al.  Approximating multi-dimensional aggregate range queries over real attributes , 2000, SIGMOD '00.

[18]  Mirek Riedewald,et al.  The Iterative Data Cube , 2000 .

[19]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[20]  Viswanath Poosala,et al.  Fast approximate answers to aggregate queries on a data cube , 1999, Proceedings. Eleventh International Conference on Scientific and Statistical Database Management.

[21]  Hongjun Lu,et al.  Condensed cube: an effective approach to reducing data cube size , 2002, Proceedings 18th International Conference on Data Engineering.

[22]  Terence R. Smith,et al.  Relative prefix sums: an efficient approach for querying dynamic OLAP data cubes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[23]  Jeffrey Scott Vitter,et al.  Approximate computation of multidimensional aggregates of sparse data using wavelets , 1999, SIGMOD '99.

[24]  Ju-Hong Lee,et al.  Dynamic Update Cube for Range-sum Queries , 2001, VLDB.

[25]  Jeffrey F. Naughton,et al.  An array-based algorithm for simultaneous multidimensional aggregates , 1997, SIGMOD '97.

[26]  Sharad Mehrotra,et al.  Progressive approximate aggregate queries with a multi-resolution tree structure , 2001, SIGMOD '01.

[27]  Cyrus Shahabi,et al.  How to evaluate multiple range-sum queries progressively , 2002, PODS '02.

[28]  Maria E. Orlowska,et al.  Range queries in dynamic OLAP data cubes , 2000, Data Knowl. Eng..

[29]  Divyakant Agrawal,et al.  Space-Efficient Data Cubes for Dynamic Environments , 2000, DaWaK.

[30]  Paul S. Bradley,et al.  Compressed data cubes for OLAP aggregate query approximation on continuous dimensions , 1999, KDD '99.

[31]  Yannis E. Ioannidis,et al.  Hierarchical Prefix Cubes for Range-Sum Queries , 1999, VLDB.