Dynamic Energy Estimation of Query Plans in Database Systems

Data centers are well known to consume large amounts of energy. Since database is one of the major applications in a typical data center, building energy-aware database systems has become an active research topic recently. The quantification of the energy cost of database systems is an important task in designing such systems. In this paper, we report our recent efforts on this topic, with a focus on the energy cost estimation of query plans during query optimization. We start from building a series of physical models for energy estimation of individual relational operators based on their resource consumption patterns. Since the execution of individual queries is a combination of relational operators, we use the physical models as a basis for a comprehensive energy cost estimation model for entire query plans. To further improve model accuracy under system dynamics and the variations of workload characteristics, we develop an online model estimation scheme that dynamically corrects the static model based on advanced modeling techniques adopted from control engineering. The models are implemented in a real database and evaluated on a physical test bed with a comprehensive set of experimental workloads. The results show that our solution achieves a high accuracy (above 90%) in energy estimation despite noises from the system and workloads.

[1]  Ricardo Bianchini,et al.  Energy conservation in heterogeneous server clusters , 2005, PPoPP.

[2]  Surajit Chaudhuri,et al.  Rethinking Query Processing for Energy Efficiency: Slowing Down to Win the Race. , 2011 .

[3]  Raghunath Othayoth Nambiar,et al.  Tuning servers, storage and database for energy efficient data warehouses , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[4]  Karthick Rajamani,et al.  Designing Energy-Efficient Servers and Data Centers , 2010, Computer.

[5]  Michael Kistler,et al.  The case for power management in web servers , 2002 .

[6]  Raghunath Othayoth Nambiar,et al.  Energy cost, the key challenge of today's data centers: a power consumption analysis of TPC-C results , 2008, Proc. VLDB Endow..

[7]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[8]  Anna G. Stefanopoulou,et al.  Recursive least squares with forgetting for online estimation of vehicle mass and road grade: theory and experiments , 2005 .

[9]  Robert Kooi,et al.  The Optimization of Queries in Relational Databases , 1980 .

[10]  T. N. Vijaykumar,et al.  Joint optimization of idle and cooling power in data centers while maintaining response time , 2010, ASPLOS XV.

[11]  Parthasarathy Ranganathan,et al.  Energy Efficiency: The New Holy Grail of Data Management Systems Research , 2009, CIDR.

[12]  I. David Abrahams,et al.  A brief historical perspective of the Wiener–Hopf technique , 2007 .

[13]  Xiaorui Wang,et al.  Exploring power-performance tradeoffs in database systems , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[14]  Feng Zhao,et al.  Virtual machine power metering and provisioning , 2010, SoCC '10.

[15]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[16]  Mehul A. Shah,et al.  Analyzing the energy efficiency of a database server , 2010, SIGMOD Conference.

[17]  Jignesh M. Patel,et al.  Rethinking Query Processing for Energy Efficiency: Slowing Down to Win the Race , 2011, IEEE Data Eng. Bull..

[18]  Rami Melhem,et al.  Power Aware Computing , 2002, Series in Computer Science.

[19]  Erol Gelenbe,et al.  Energy-Efficient Cloud Computing , 2010, Comput. J..

[20]  Sandeep K. S. Gupta,et al.  TACOMA: Server and workload management in internet data centers considering cooling-computing power trade-off and energy proportionality , 2012, TACO.

[21]  Gargi Dasgupta,et al.  Server Workload Analysis for Power Minimization using Consolidation , 2009, USENIX Annual Technical Conference.

[22]  Thomas F. Wenisch,et al.  Power management of online data-intensive services , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[23]  Jignesh M. Patel,et al.  Towards Eco-friendly Database Management Systems , 2009, CIDR.

[24]  Kai Ma,et al.  Adaptive Power Control with Online Model Estimation for Chip Multiprocessors , 2011, IEEE Transactions on Parallel and Distributed Systems.

[25]  Amin Vahdat,et al.  ECOSystem: managing energy as a first class operating system resource , 2002, ASPLOS X.

[26]  Ricardo Bianchini,et al.  Power and energy management for server systems , 2004, Computer.

[27]  Wenfei Fan,et al.  Power Based Performance and Capacity Estimation Models for Enterprise Information Systems. , 2011 .

[28]  Surajit Chaudhuri,et al.  An overview of query optimization in relational systems , 1998, PODS.

[29]  Luca Benini,et al.  A survey of design techniques for system-level dynamic power management , 2000, IEEE Trans. Very Large Scale Integr. Syst..

[30]  Sally A. McKee,et al.  A Cost Model For Integrated Restructuring Optimizations , 2003, J. Instr. Level Parallelism.

[31]  Sujata Banerjee,et al.  A Power Benchmarking Framework for Network Devices , 2009, Networking.

[32]  Beng Chin Ooi,et al.  The Claremont report on database research , 2008, SGMD.

[33]  Kushagra Vaid,et al.  Energy benchmarks: a detailed analysis , 2010, e-Energy.

[34]  Xiaoyun Zhu,et al.  Power-Efficient Response Time Guarantees for Virtualized Enterprise Servers , 2008, 2008 Real-Time Systems Symposium.

[35]  Jayant R. Haritsa,et al.  Peak power plays in database engines , 2012, EDBT '12.

[36]  Raghunath Othayoth Nambiar,et al.  Power Based Performance and Capacity Estimation Models for Enterprise Information Systems , 2011, IEEE Data Eng. Bull..

[37]  Kai Ma,et al.  Temperature-constrained power control for chip multiprocessors with online model estimation , 2009, ISCA '09.

[38]  Margaret Martonosi,et al.  Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data , 2003, MICRO.

[39]  Guy M. Lohman,et al.  Optimizer Validation and Performance Evaluation for Distributed Queries , 1998 .