A Query Answering Greedy Algorithm for Selecting Materialized Views

Materialized views aim to improve the response time of analytical queries posed on a data warehouse. This entails that they contain information that provides answers to most future queries. The selection of such information from the data warehouse is referred to as view selection. View selection deals with selection of appropriate sets of views to improve the query response time. Several view selection algorithms exist in literature, most of them being greedy based. The greedy algorithm HRUA, which selects top-k views from a multidimensional lattice, is considered the most fundamental greedy based algorithm. It selects views having the highest benefit, computed in terms of size, for materialization. Though the views selected using HRUA are beneficial with respect to size, they may not account for a large number of future queries and may hence become an unnecessary overhead. This problem is addressed by the Query Answering Greedy Algorithm (QAGA) proposed in this paper. QAGA uses both the size of the view, and the frequency of previously posed queries answered by each view, to compute the profits of all views in each iteration. Thereafter it selects, from among them, the most profitable view for materialization. QAGA is able to select views which are beneficial with respect to size and have a greater likelihood of answering future queries. Further, experimental results show that QAGA, as compared to HRUA, is able to select views capable of answering greater number of queries. Though HRUA incurs a lower total cost of evaluating all the views, QAGA has a lower total cost of answering all the queries leading to an improvement in the average query response time. This in turn facilitates decision making.

[1]  Wolfgang Lehner,et al.  Improving query response time in scientific databases using data aggregation -a case study , 1996, Proceedings of 7th International Conference and Workshop on Database and Expert Systems Applications: DEXA 96.

[2]  T. V. Vijay Kumar,et al.  A Reduced Lattice Greedy Algorithm for Selecting Materialized Views , 2009, ICISTM.

[3]  Elena Baralis,et al.  Materialized Views Selection in a Multidimensional Database , 1997, VLDB.

[4]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[5]  Kamalakar Karlapalem,et al.  View Relevance Driven Materialized View Selection in Data Warehousing Environment , 2002, Australasian Database Conference.

[6]  Frada Burstein,et al.  Australian Journal of Information Systems , 2001 .

[7]  Salwani Abdullah,et al.  Great Deluge Algorithm for Rough Set Attribute Reduction , 2010, FGIT-DTA/BSBT.

[8]  T. V. Vijay Kumar,et al.  Greedy Selection of Materialized Views , 2009 .

[9]  Jérôme Darmont,et al.  Data mining-based materialized view and index selection in data warehouses , 2007, Journal of Intelligent Information Systems.

[10]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[11]  T. V. Vijay Kumar,et al.  Proposing Candidate Views for Materialization , 2010, ICISTM.

[12]  Xin Yao,et al.  Evolving materialized views in data warehouse , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[13]  John F. Roddick,et al.  Advances and Research Directions in Data-Warehousing Technology , 1999, Australas. J. Inf. Syst..

[14]  Dimitri Theodoratos,et al.  A general framework for the view selection problem for data warehouse design and evolution , 2000, DOLAP '00.

[15]  M. T. Serna-Encinas,et al.  Algorithm for selection of materialized views: based on a costs model , 2007 .

[16]  Rada Chirkova,et al.  A formal perspective on the view selection problem , 2002, The VLDB Journal.

[17]  Kyuseok Shim,et al.  Including Group-By in Query Optimization , 1994, VLDB.

[18]  Nick Roussopoulos,et al.  Materialized views and data warehouses , 1998, SGMD.

[19]  Karthik Ramachandran,et al.  A Hybrid Approach for Data Warehouse View Selection , 2006, Int. J. Data Warehous. Min..

[20]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[21]  Toby J. Teorey,et al.  Achieving scalability in OLAP materialized view selection , 2002, DOLAP '02.

[22]  Goetz Graefe,et al.  Multi-table joins through bitmapped join indices , 1995, SGMD.

[23]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[24]  Ashish Gupta,et al.  Generalized Projections: A Powerful Approach To Aggregation , 1995 .

[25]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize in a Data Warehouse , 2005, IEEE Trans. Knowl. Data Eng..

[26]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .