MetricStore repository: on the leveraging of performance metrics in databases

The database performance is one of the most important quality indicators that companies are looking for to choose their appropriate database management systems. Quantifying this performance is usually performed by the means of mathematical cost models. Due to the importance of these models, for each evolution of the database technology pushes researchers to revisit or to propose cost models in order to integrate new dimensions brought by that evolution. As a consequence, a huge number of cost models exists. To exploit them, we need to find their respective scientific papers. This situation is in contradiction with Era Sharing, because it reduces reuse of these cost models by researchers, students (from third world countries), etc. and even more it penalizes the reproduction of experiments that intensively use these cost models. In this paper, we propose a framework for cost models dedicated to query processing and optimization. We first propose a common repository, called, MetricStore, to store metrics of cost model units. Secondly, thanks to model-driven engineering facilities, the repository offers capabilities aiming at publishing, searching and reusing cost models through a suitable user interface. Tool support is fully available.

[1]  Martin Bichler,et al.  More than bin packing: Dynamic resource allocation strategies in cloud data centers , 2015, Inf. Syst..

[2]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[3]  Jignesh M. Patel,et al.  Towards Cost-Effective Storage Provisioning for DBMSs , 2011, Proc. VLDB Endow..

[4]  Surajit Chaudhuri,et al.  AutoAdmin “what-if” index analysis utility , 1998, SIGMOD '98.

[5]  Viktor Leis,et al.  How Good Are Query Optimizers, Really? , 2015, Proc. VLDB Endow..

[6]  Gunter Saake,et al.  A Framework for Optimal Selection of a Storage Architecture in RDBMS , 2010 .

[7]  Xiaorui Wang,et al.  PET: Reducing Database Energy Cost via Query Optimization , 2012, Proc. VLDB Endow..

[8]  Ilia Petrov,et al.  Making cost-based query optimization asymmetry-aware , 2012, DaMoN '12.

[9]  Martin L. Kersten,et al.  Generic Database Cost Models for Hierarchical Memory Systems , 2002, VLDB.

[10]  Alfredo Cuzzocrea,et al.  Verification of Partitioning and Allocation Techniques on Teradata DBMS , 2011, ICA3PP.

[11]  Surajit Chaudhuri,et al.  Table of Contents (pdf) , 2007, VLDB.

[12]  Ramakrishna Varadarajan,et al.  DBDesigner: A customizable physical design tool for Vertica Analytic Database , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[13]  Elizabeth N. Fong,et al.  Reference model for DBMS standardization , 1986, SGMD.

[14]  Stuart Kent Model Driven Language Engineering , 2003, Electron. Notes Theor. Comput. Sci..

[15]  Amine Roukh,et al.  A Meta-advisor Repository for Database Physical Design , 2016, MEDI.

[16]  ZhaoHui Tang,et al.  Calibrating the Query Optimizer Cost Model of IRO-DB, an Object-Oriented Federated Database System , 1996, VLDB.

[17]  Ladjel Bellatreche,et al.  How to exploit the device diversity and database interaction to propose a generic cost model? , 2013, IDEAS '13.

[18]  Thomas Heinis,et al.  PARINDA: an interactive physical designer for PostgreSQL , 2010, EDBT '10.

[19]  Ladjel Bellatreche,et al.  CostDL: A Cost Models Description Language for Performance Metrics in Database , 2016, 2016 21st International Conference on Engineering of Complex Computer Systems (ICECCS).

[20]  Amine Roukh,et al.  Eco-DMW: Eco-Design Methodology for Data warehouses , 2015, DOLAP.

[21]  Martin Bichler,et al.  Reproducible experiments on dynamic resource allocation in cloud data centers , 2016, Inf. Syst..

[22]  Gunter Saake,et al.  A framework for cost based optimization of hybrid CPU/GPU query plans in database systems , 2012 .

[23]  Barzan Mozafari,et al.  CliffGuard: A Principled Framework for Finding Robust Database Designs , 2015, SIGMOD Conference.