论文信息 - Caching all plans with just one optimizer call

Caching all plans with just one optimizer call

Modern database management systems (DBMS) answer a multitude of complex queries on increasingly larger datasets. Given the complexities of the queries and the numerous design features, manual design is no longer an option. Instead, automatically designing the database is vital to maximize its performance and to reduce the total cost of ownership. For this purpose, commercial DBMS feature automated physical designers suggesting an efficient DB design by using the optimizer as a cost model. Unfortunately, consulting the optimizer is time-consuming, an effect which is typically counter-acted by drastically pruning the search space, thereby potentially missing the optimal solution. Recently techniques cache the optimizer's output and evaluate some plans with the cached results, reducing the number of calls to the optimizer. Still, however, the cost of invoking the optimizer to fill the cache is nontrivial, undermining scalability when running workloads with thousands of queries. In this paper, we use the intermediate optimization results in a dynamic programming based optimizer to reduce the cache initialization overhead. We demonstrate the accuracy and efficiency of our techniques by implementing them on the PostgreSQL open source query optimizer. For a star-schema workload, our techniques build the cost model 5 to 10 times faster than the conventional approach, while preserving accuracy.

Anastasia Ailamaki | Ioannis Alagiannis | Debabrata Dash | Cristina Maier

[1] Nicolas Bruno,et al. Configuration-parametric query optimization for physical design tuning , 2008, SIGMOD Conference.

[2] Kai-Uwe Sattler,et al. An Integrated Approach to Performance Monitoring for Autonomous Tuning , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[3] Patricia G. Selinger,et al. Access path selection in a relational database management system , 1979, SIGMOD '79.

[4] I-En Liao,et al. An index selection method without repeated optimizer estimations , 2009, Inf. Sci..

[5] Sam Lightstone,et al. DB2 Design Advisor: Integrated Automatic Physical Database Design , 2004, VLDB.

[6] Surajit Chaudhuri,et al. An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server , 1997, VLDB.

[7] Anastasia Ailamaki,et al. Efficient use of the query optimizer for automated physical design , 2007, VLDB 2007.

[8] Surajit Chaudhuri,et al. Automatic physical database tuning: a relaxation-based approach , 2005, SIGMOD '05.