论文信息 - ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases

ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases

Modern database management systems (DBMS) contain tens to hundreds of critical performance tuning knobs that determine the system runtime behaviors. To reduce the total cost of ownership, cloud database providers put in drastic effort to automatically optimize the resource utilization by tuning these knobs. There are two challenges. First, the tuning system should always abide by the service level agreement (SLA) while optimizing the resource utilization, which imposes strict constrains on the tuning process. Second, the tuning time should be reasonably acceptable since time-consuming tuning is not practical for production and online troubleshooting. In this paper, we design ResTune to automatically optimize the resource utilization without violating SLA constraints on the throughput and latency requirements. ResTune leverages the tuning experience from the history tasks and transfers the accumulated knowledge to accelerate the tuning process of the new tasks. The prior knowledge is represented from historical tuning tasks through an ensemble model. The model learns the similarity between the historical workloads and the target, which significantly reduces the tuning time by a meta-learning based approach. ResTune can efficiently handle different workloads and various hardware environments. We perform evaluations using benchmarks and real world workloads on different types of resources. The results show that, compared with the manually tuned configurations, ResTune reduces 65%, 87%, 39% of CPU utilization, I/O and memory on average, respectively. Compared with the state-of-the-art methods, ResTune finds better configurations with up to ~18x speedups.

[1] Gideon S. Mann,et al. Efficient Transfer Learning Method for Automatic Hyperparameter Tuning , 2014, AISTATS.

[2] Shivnath Babu,et al. Tuning Database Configuration Parameters with iTuned , 2009, Proc. VLDB Endow..

[3] LiFeng,et al. CPU sharing techniques for performance isolation in multi-tenant relational database-as-a-service , 2013, VLDB 2013.

[4] Lars Schmidt-Thieme,et al. Sequential Model-Free Hyperparameter Tuning , 2015, 2015 IEEE International Conference on Data Mining.

[5] Lars Kotthoff,et al. Automated Machine Learning: Methods, Systems, Challenges , 2019, The Springer Series on Challenges in Machine Learning.

[6] Lars Schmidt-Thieme,et al. Scalable Gaussian process-based transfer surrogates for hyperparameter optimization , 2017, Machine Learning.

[7] Kevin Leyton-Brown,et al. Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[8] Ricardo Vilalta,et al. A Perspective View and Survey of Meta-Learning , 2002, Artificial Intelligence Review.

[9] Michael Stonebraker,et al. P-Store: An Elastic Database System with Predictive Provisioning , 2018, SIGMOD Conference.

[10] Shivnath Babu,et al. Black or White? How to Develop an AutoTuner for Memory-based Analytics , 2020, SIGMOD Conference.

[11] Rachel Pottinger,et al. Facilitating SQL Query Composition and Analysis , 2020, SIGMOD Conference.

[12] Jasper Snoek,et al. Bayesian Optimization with Unknown Constraints , 2014, UAI.

[13] D. Sculley,et al. Google Vizier: A Service for Black-Box Optimization , 2017, KDD.

[14] Andreas Krause,et al. Contextual Gaussian Process Bandit Optimization , 2011, NIPS.

[15] Marco Wiering,et al. Reinforcement Learning and Markov Decision Processes , 2012, Reinforcement Learning.

[16] E. Nadaraya. On Estimating Regression , 1964 .

[17] Pascal Poupart,et al. A bayesian approach to online performance modeling for database appliances using gaussian models , 2011, ICAC '11.

[18] Djoerd Hiemstra,et al. A probabilistic justification for using tf×idf term weighting in information retrieval , 2000, International Journal on Digital Libraries.

[19] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[20] Lars Schmidt-Thieme,et al. Hyperparameter Optimization with Factorized Multilayer Perceptrons , 2015, ECML/PKDD.

[21] Lars Schmidt-Thieme,et al. Scalable Hyperparameter Optimization with Products of Gaussian Process Experts , 2016, ECML/PKDD.

[22] Jiawei Jiang,et al. Snapshot boosting: a fast ensemble framework for deep neural networks , 2019, Science China Information Sciences.

[23] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[24] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.

[25] Nando de Freitas,et al. Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[26] Geoffrey J. Gordon,et al. Automatic Database Management System Tuning Through Large-scale Machine Learning , 2017, SIGMOD Conference.

[27] Feifei Li,et al. iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases , 2019, Proc. VLDB Endow..

[28] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.

[29] Bogdan Gabrys,et al. Metalearning: a survey of trends and technologies , 2013, Artificial Intelligence Review.

[30] Scott Lundberg,et al. A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[31] Matthias Feurer. Scalable Meta-Learning for Bayesian Optimization using Ranking-Weighted Gaussian Process Ensembles , 2018 .

[32] Carlo Curino,et al. OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases , 2013, Proc. VLDB Endow..

[33] Michèle Sebag,et al. Collaborative hyperparameter tuning , 2013, ICML.

[34] Jasper Snoek,et al. Bayesian Optimization and Semiparametric Models with Applications to Assistive Technology , 2014 .

[35] Daniel R. Jiang,et al. BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization , 2020, NeurIPS.

[36] Yuqing Zhu,et al. BestConfig: tapping the performance potential of systems via automatic configuration tuning , 2017, SoCC.

[37] Aaron Klein,et al. Efficient and Robust Automated Machine Learning , 2015, NIPS.

[38] Joaquin Vanschoren,et al. Meta-Learning: A Survey , 2018, Automated Machine Learning.

[39] Dean Jacobs,et al. Ruminations on Multi-Tenant Databases , 2007, BTW.

[40] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[41] Lars Schmidt-Thieme,et al. Two-Stage Transfer Surrogate Model for Automatic Hyperparameter Optimization , 2016, ECML/PKDD.

[42] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[43] Matt J. Kusner,et al. Bayesian Optimization with Inequality Constraints , 2014, ICML.

[44] Guoliang Li,et al. QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning , 2019, Proc. VLDB Endow..