论文信息 - Automated Vertical Partitioning with Deep Reinforcement Learning

Automated Vertical Partitioning with Deep Reinforcement Learning

Finding the right vertical partitioning scheme to match a workload is one of the essential database optimization problems. With the proper partitioning, queries and management tasks can skip unnecessary data, improving their performance. Algorithmic approaches are common for determining a partitioning scheme, with solutions being shaped by their choice of cost models and pruning heuristics. In spite of their advantages, these can be inefficient since they don’t improve with experience (e.g., learning from errors in cost estimates or heuristics employed). In this paper we consider the feasibility of a general machine learning solution to overcome such drawbacks. Specifically, we extend the work in GridFormation, mapping the partitioning task to a reinforcement learning (RL) task. We validate our proposal experimentally using a TPC-H database and workload, HDD cost models and the Google Dopamine framework for deep RL. We report early evaluations using 3 standard DQN agents, establishing that agents can match the results of state-of-the-art algorithms. We find that convergence is easily achievable for single table-workload pairs, but that generalizing to random workloads requires further work. We also report competitive runtimes for our agents on both GPU and CPU inference, outperforming some state-of-the-art algorithms, as the number of attributes in a table increases.

[1] Carsten Binnig,et al. Towards learning a partitioning advisor with deep reinforcement learning , 2019, aiDM@SIGMOD.

[2] Olga Papaemmanouil,et al. Deep Reinforcement Learning for Join Order Enumeration , 2018, aiDM@SIGMOD.

[3] Olga Papaemmanouil,et al. Towards a Hands-Free Query Optimizer through Deep Learning , 2018, CIDR.

[4] Jens Dittrich,et al. The Case for Automatic Database Administration using Deep Reinforcement Learning , 2018, ArXiv.

[5] Rémi Munos,et al. Implicit Quantile Networks for Distributional Reinforcement Learning , 2018, ICML.

[6] Gunter Saake,et al. GridFormation: Towards Self-Driven Online Data Partitioning using Reinforcement Learning , 2018, aiDM@SIGMOD.

[7] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[8] Vivek R. Narasayya,et al. Integrating vertical and horizontal partitioning into automated physical database design , 2004, SIGMOD '04.

[9] Marc G. Bellemare,et al. Dopamine: A Research Framework for Deep Reinforcement Learning , 2018, ArXiv.

[10] Alekh Jindal,et al. A Comparison of Knives for Bread Slicing , 2013, Proc. VLDB Endow..

[11] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.

[12] Ion Stoica,et al. Learning to Optimize Join Queries With Deep Reinforcement Learning , 2018, ArXiv.