Computing closest common subexpressions for view selection problems

Selecting a set of views for materialization is a required task in many current database and data warehousing applications including the design of a data warehouse, and the maintenance of multiple materialized views. The selected views can be materialized permanently or transiently depending on the specific view selection problem. The view selection algorithms are expensive due to the size of the search space of the problem.In this paper we propose an approach for generating candidate views for materialization for view selection problems based on the definition of the input queries. We also provide rewritings of the input queries using the generated candidate views. In generating candidate views, we do not apply costbased techniques but we try to maximize the operations in the views. Subsequently, view selection algorithms can exploit problem dependent cost functions to choose among the generated candidate views. Our approach is not restricted to a specific view selection problem. Compared to a previous one, it generates views that involve more relation occurrences (or operations) and can reduce the size of the search space which can be very large. We implement our approach and we report some experimental evaluation with comparison to previous works.

[1]  Hamid Pirahesh,et al.  fAST refresh using mass query optimization , 2001, Proceedings 17th International Conference on Data Engineering.

[2]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[3]  Matthias Jarke,et al.  Common Subexpression Isolation in Multiple Query Optimization , 1984, Query Processing in Database Systems.

[4]  Mohamed Ziauddin,et al.  Materialized Views in Oracle , 1998, VLDB.

[5]  Zhimin Chen,et al.  Efficient computation of multiple group by queries , 2005, SIGMOD '05.

[6]  Jeffrey F. Naughton,et al.  Materialized View Selection for Multidimensional Datasets , 1998, VLDB.

[7]  Timos K. Sellis,et al.  Multiple-query optimization , 1988, TODS.

[8]  Dimitri Theodoratos,et al.  Constructing search spaces for materialized view selection , 2004, DOLAP '04.

[9]  Kenneth A. Ross,et al.  Materialized view maintenance and integrity constraint checking: trading space for time , 1996, SIGMOD '96.

[10]  Panos Kalnis,et al.  Optimization Algorithms for Simultaneous Multidimensional Queries in OLAP Environments , 2001, DaWaK.

[11]  Prasan Roy,et al.  Efficient and extensible algorithms for multi query optimization , 1999, SIGMOD '00.

[12]  Dimitri Theodoratos,et al.  A general framework for the view selection problem for data warehouse design and evolution , 2000, DOLAP '00.

[13]  Calisto Zuzarte,et al.  Exploiting common subqueries for complex query optimization , 2002, CASCON.

[14]  Hamid Pirahesh,et al.  Answering complex SQL queries using automatic summary tables , 2000, SIGMOD '00.

[15]  Krithi Ramamritham,et al.  Materialized view selection and maintenance using multi-query optimization , 2000, SIGMOD '01.

[16]  Margaret H. Dunham,et al.  Common Subexpression Processing in Multiple-Query Processing , 1998, IEEE Trans. Knowl. Data Eng..

[17]  Sheldon J. Finkelstein Common expression analysis in database applications , 1982, SIGMOD '82.

[18]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize in a Data Warehouse , 2005, IEEE Trans. Knowl. Data Eng..

[19]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize Under a Maintenance Cost Constraint , 1999, ICDT.

[20]  Rada Chirkova,et al.  A Formal Model for the Problem of View Selection for Aggregate Queries , 2005, ADBIS.

[21]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[22]  Calisto Zuzarte,et al.  Preprocessing for Fast Refreshing Materialized Views in DB2 , 2006, DaWaK.

[23]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[24]  Alon Y. Halevy,et al.  MiniCon: A scalable algorithm for answering queries using views , 2000, The VLDB Journal.

[25]  Timos K. Sellis,et al.  Data Warehouse Configuration , 1997, VLDB.