PartJoin: An Efficient Storage and Query Execution for Data Warehouses

The performance of OLAP queries can be improved drastically if the warehouse data is properly selected and indexed. The problems of selecting and materializing views and indexing data have been studied extensively in the data warehousing environment. On the other hand, data partitioning can also greatly increase the performance of queries. Data partitioning has advantage over data selection and indexing since the former one does not require additional storage requirement. In this paper, we show that it is beneficial to integrate the data partitioning and indexing (join indexes) techniques for improving the performance of data warehousing queries. We present a data warehouse tuning strategy, called PartJoin, that decomposes the fact and dimension tables of a star schema and then selects join indexes. This solution takes advantage of these two techniques, i.e., data partitioning and indexing. Finally, we present the results of an experimental evaluation that demonstrates the effectiveness of our strategy in reducing the query processing cost and providing an economical utilisation of the storage space.

[1]  Surajit Chaudhuri,et al.  An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server , 1997, VLDB.

[2]  Qing Li,et al.  Evaluation of Materialized View Indexing in Data Warehousing Environments , 2000, DaWaK.

[3]  Patrick Valduriez,et al.  Principles of Distributed Database Systems, Second Edition , 1999 .

[4]  Patrick E. O'Neil,et al.  Improved query performance with variant indexes , 1997, SIGMOD '97.

[5]  Mark Allen Weiss,et al.  On Satisfiability, Equivalence, and Impication Problems Involving Conjunctive Queries in Database Systems , 1996, IEEE Trans. Knowl. Data Eng..

[6]  Mukesh K. Mohania,et al.  What can partitioning do for your data warehouses and data marts? , 2000, Proceedings 2000 International Database Engineering and Applications Symposium (Cat. No.PR00789).

[7]  C. D. de Aguiar Ciferri,et al.  Materialized views in data warehousing environments , 2001, SCCC 2001. 21st International Conference of the Chilean Computer Science Society.