Self-adaptive Statistics Management for Efficient Query Processing

Consistently good performance required by mission-critical information systems has made it a pressing demand for self-tuning technologies in DBMSs. Automated Statistics management is an important step towards a self-tuning DBMS and plays a key role in improving the quality of execution plans generated by the optimizer, and hence leads to shorter query processing times. In this paper, we present SASM, a framework for Self-Adaptive Statistics Management where, using query feedback information, an appropriate set of histograms is recommended and refined, and through histogram refining and reconstruction, fixed amount of memory is dynamically distributed to histograms which are most useful to the current workload. Extensive experiments showed the effectiveness of our techniques.

[1]  Gerhard Weikum,et al.  Self-tuning Database Technology and Information Services: from Wishful Thinking to Viable Engineering , 2002, VLDB.

[2]  Volker Markl,et al.  LEO - DB2's LEarning Optimizer , 2001, VLDB.

[3]  Surajit Chaudhuri,et al.  Self-tuning histograms: building histograms without looking at data , 1999, SIGMOD '99.

[4]  Luis Gravano,et al.  STHoles: a multidimensional workload-aware histogram , 2001, SIGMOD '01.

[5]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[6]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[7]  Paul Brown,et al.  CORDS: automatic discovery of correlations and soft functional dependencies , 2004, SIGMOD '04.

[8]  Surajit Chaudhuri,et al.  Index selection for databases: a hardness study and a principled heuristic solution , 2004, IEEE Transactions on Knowledge and Data Engineering.

[9]  Surajit Chaudhuri,et al.  Exploiting statistics on query expressions for optimization , 2002, SIGMOD '02.

[10]  Peter J. Haas,et al.  Automatic relationship discovery in self-managing database systems , 2004 .

[11]  Surajit Chaudhuri,et al.  An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server , 1997, VLDB.

[12]  Vivek R. Narasayya,et al.  Integrating vertical and horizontal partitioning into automated physical database design , 2004, SIGMOD '04.

[13]  Peter J. Haas,et al.  CORDS: Automatic Generation of Correlation Statistics in DB2 , 2004, VLDB.

[14]  Kai-Uwe Sattler,et al.  Autonomous query-driven index mining , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[15]  Jeffrey Scott Vitter,et al.  SASH: A Self-Adaptive Histogram Set for Dynamically Changing Workloads , 2003, VLDB.

[16]  Beng Chin Ooi,et al.  Global optimization of histograms , 2001, SIGMOD '01.

[17]  Yannis E. Ioannidis,et al.  The History of Histograms (abridged) , 2003, VLDB.

[18]  Peter J. Haas,et al.  Automated Statistics Collection in DB2 UDB , 2004, VLDB.

[19]  Xiaojing Li,et al.  Self-learning histograms for changing workloads , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[20]  Surajit Chaudhuri,et al.  Automating statistics management for query optimizers , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).