In an OLAP system, we can use data cubes (precomputed multidimensional views of data) to support real-time queries. To reduce the maintenance cost, which is related to the number of cubes materialized, some cubes can be merged, but the resulting larger cubes will increase the response time of answering some queries. In order to satisfy the maintenance bound and response time bound given by the user, we may have to sacrifice some of the queries and not to take them into our consideration. The optimization problem in the data cube system design is to optimize an initial set of cubes such that the system can answer a maximum number of queries and satisfy the bounds. This is an NP-complete problem. Approximate algorithms Greedy Removing and 2-Greedy Merging are proposed. Experiments have been done on a census database and the results show that our approach is both effective and efficient.
[1]
Jeffrey D. Ullman,et al.
Implementing data cubes efficiently
,
1996,
SIGMOD '96.
[2]
Goetz Graefe,et al.
Multi-table joins through bitmapped join indices
,
1995,
SGMD.
[3]
Jeffrey F. Naughton,et al.
Materialized View Selection for Multidimensional Datasets
,
1998,
VLDB.
[4]
Hongjun Lu,et al.
Requirement-based data cube schema design
,
1999,
CIKM '99.
[5]
Venky Harinarayan,et al.
Implementing Data Cubes E ciently
,
1996
.
[6]
Surajit Chaudhuri,et al.
An overview of data warehousing and OLAP technology
,
1997,
SGMD.
[7]
Patrick E. O'Neil,et al.
Improved query performance with variant indexes
,
1997,
SIGMOD '97.