Using Semantics for Query Derivability in Data Warehouse Applications

Materialized summary tables and cached query results are frequently used for the optimization of aggregate queries in a data warehouse. Query rewriting techniques are incorporated into database systems to use those materialized views and thus avoid accessing the possibly huge raw data. A rewriting is only possible if the query is derivable from these views. Several approaches can be found in the literature to check the derivability and find query rewritings. However, most algorithms either find rewritings only in very restricted cases or in complex cases which rarely occur in data warehouse environments. The specific application scenario of a data warehouse with its multidimensional perspective allows the consideration of much more semantic information, e.g. structural dependencies within the dimension hierarchies and different characteristics of measures. The motivation of this article is to use this information to present simple conditions for derivability in a large number of relevant cases which go beyond previous approaches.

[1]  Wolfgang Lehner,et al.  Management of multidimensional aggregates for efficient online analytical processing , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[2]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[3]  Sheldon J. Finkelstein Common expression analysis in database applications , 1982, SIGMOD '82.

[4]  Wolfgang Lehner,et al.  Set-Derivability of Multidimensional Aggregates , 1999, DaWaK.

[5]  Werner Nutt,et al.  Rewriting aggregate queries using views , 1999, PODS.

[6]  Divesh Srivastava,et al.  Answering Queries with Aggregation Using Views , 1996, VLDB.

[7]  Jiawei Han,et al.  Intelligent Query Answering by Knowledge Discovery Techniques , 1996, IEEE Trans. Knowl. Data Eng..

[8]  Werner Nutt,et al.  Algorithms for Rewriting Aggregate Queries Using Views , 2000, DMDW.

[9]  Wolfgang Lehner,et al.  Normal forms for multidimensional databases , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[10]  Lionel M. Ni,et al.  Solving implication problems in database applications , 1989, SIGMOD '89.

[11]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[12]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[13]  H. Sato Handling summary information in a database: derivability , 1981, SIGMOD '81.

[14]  Clement T. Yu,et al.  Automatic Knowledge Acquisition and Maintenance for Semantic Query Optimization , 1989, IEEE Trans. Knowl. Data Eng..