Query optimization by using derivability in a data warehouse environment

Materialized summary tables and cached query results are frequently used for the optimization of aggregate queries in a data warehouse. Query rewriting techniques are incorporated into database systems to use those materialized views and thus avoid the access of the possibly huge raw data. A rewriting is only possible if the query is derivable from these views. Several approaches can be found in the literature to check the derivability and find query rewritings. The specific application scenario of a data warehouse with its multidimensional perspective allows the consideration of much more semantic information, e.g. structural dependencies within the dimension hierarchies and different characteristics of measures. The motivation of this article is to use this information to present conditions for derivability in a large number of relevant cases which go beyond previous approaches.

[1]  Werner Nutt,et al.  Rewriting aggregate queries using views , 1999, PODS.

[2]  Jeffrey D. Ullman,et al.  Index selection for OLAP , 1997, Proceedings 13th International Conference on Data Engineering.

[3]  Luca Cabibbo,et al.  From a procedural to a visual query language for OLAP , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[4]  Wolfgang Lehner,et al.  Normal forms for multidimensional databases , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[5]  Per-Åke Larson,et al.  Eager Aggregation and Lazy Aggregation , 1995, VLDB.

[6]  Elena Baralis,et al.  Materialized Views Selection in a Multidimensional Database , 1997, VLDB.

[7]  Wolfgang Lehner,et al.  Management of multidimensional aggregates for efficient online analytical processing , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[8]  Arie Shoshani,et al.  Summarizability in OLAP and statistical data bases , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).

[9]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[10]  Wolfgang Lehner,et al.  Modelling Large Scale OLAP Scenarios , 1998, EDBT.

[11]  Wolfgang Lehner,et al.  Set-Derivability of Multidimensional Aggregates , 1999, DaWaK.

[12]  Clement T. Yu,et al.  Automatic Knowledge Acquisition and Maintenance for Semantic Query Optimization , 1989, IEEE Trans. Knowl. Data Eng..

[13]  Werner Nutt,et al.  Deciding equivalences among aggregate queries , 1998, PODS '98.

[14]  Lionel M. Ni,et al.  Solving implication problems in database applications , 1989, SIGMOD '89.

[15]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[16]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[17]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[18]  守屋 悦朗,et al.  J.E.Hopcroft, J.D. Ullman 著, "Introduction to Automata Theory, Languages, and Computation", Addison-Wesley, A5変形版, X+418, \6,670, 1979 , 1980 .

[19]  Wolfgang Lehner,et al.  Using Semantics for Query Derivability in Data Warehouse Applications , 2000, FQAS.

[20]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[21]  Kyuseok Shim,et al.  Optimizing Queries with Aggregate Views , 1996, EDBT.

[22]  Meng Chang Chen,et al.  A Model of Summary Data and its Applications in Statistical Databases , 1988, SSDBM.

[23]  Peter Scheuermann,et al.  Dynamic caching of query results for decision support systems , 1999, Proceedings. Eleventh International Conference on Scientific and Statistical Database Management.

[24]  H. Sato Handling summary information in a database: derivability , 1981, SIGMOD '81.

[25]  Divesh Srivastava,et al.  Answering Queries with Aggregation Using Views , 1996, VLDB.

[26]  Per-Åke Larson,et al.  Computing Queries from Derived Relations , 1985, VLDB.

[27]  Sheldon J. Finkelstein Common expression analysis in database applications , 1982, SIGMOD '82.

[28]  Werner Nutt,et al.  Algorithms for Rewriting Aggregate Queries Using Views , 2000, DMDW.

[29]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[30]  Jiawei Han,et al.  Intelligent Query Answering by Knowledge Discovery Techniques , 1996, IEEE Trans. Knowl. Data Eng..

[31]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .