On efficient storage space distribution among materialized views and indices in data warehousing environments

In data warehousing environments (DWEs), two major techniques are used to reduce the OLAP query processing cost: materialized views and indices. Therefore, the DW administrator (DWA) should select views to be materialized and indices according to the available storage capacity. The DWA has to run an algorithm to select views to be materialized according to the storage space reserved for views. After that he/she runs another algorithm to select indices over base relations and materialized views according to storage space reserved for indices. In this paper, we consider the problem of distributing storage space among materialized views and indexes to minimize total query processing cost. We develop a cost model for executing a set of frequently asked queries using both materialized views and indices. Furthermore, we develop an iterative approach to automatically distribute the storage space among views and indices for a DW. We extend this approach to handle dynamic DWEs. Finally, we evaluate our algorithm to show its applicability with an example taken from the Informix white paper.

[1]  Nick Roussopoulos,et al.  DynaMat: a dynamic view management system for data warehouses , 1999, SIGMOD '99.

[2]  Howard J. Karloff,et al.  On the complexity of the view-selection problem , 1999, PODS '99.

[3]  C. D. de Aguiar Ciferri,et al.  Materialized views in data warehousing environments , 2001, SCCC 2001. 21st International Conference of the Chilean Computer Science Society.

[4]  Wilburt Labio,et al.  Physical database design for data warehouses , 1997, Proceedings 13th International Conference on Data Engineering.

[5]  Timos K. Sellis,et al.  Data Warehouse Configuration , 1997, VLDB.

[6]  Mukesh K. Mohania,et al.  Making Aggregate Views Self-maintainable , 2000, Data Knowl. Eng..

[7]  Krithi Ramamritham,et al.  Curio: A Novel Solution for Efficient Storage and Indexing in Data Warehouses , 1999, VLDB.

[8]  Patrick E. O'Neil,et al.  Improved query performance with variant indexes , 1997, SIGMOD '97.

[9]  Kenneth A. Ross,et al.  Materialized view maintenance and integrity constraint checking: trading space for time , 1996, SIGMOD '96.

[10]  Surajit Chaudhuri,et al.  AutoAdmin “what-if” index analysis utility , 1998, SIGMOD '98.

[11]  Jian Yang,et al.  Algorithms for Materialized View Design in Data Warehousing Environment , 1997, VLDB.

[12]  Philip A. Bernstein,et al.  Using Semi-Joins to Solve Relational Queries , 1981, JACM.

[13]  Alberto O. Mendelzon,et al.  Maintaining data cubes under dimension updates , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[14]  Arie Segev,et al.  A Framework for Join Pattern Indexing in Intelligent Database Systems , 1995, IEEE Trans. Knowl. Data Eng..

[15]  T. Sellis Data Warehouse Connguration , 1997 .

[16]  Surajit Chaudhuri,et al.  Index merging , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[17]  Timos K. Sellis,et al.  Multiple-query optimization , 1988, TODS.

[18]  Qing Li,et al.  Evaluation of Materialized View Indexing in Data Warehousing Environments , 2000, DaWaK.