GreenDB: Energy-Efficient Prefetching and Caching in Database Clusters

In this study, we propose an energy-efficient database system called GreenDB running on clusters. GreenDB applies a workload-skewness strategy by managing hot nodes coupled with a set of cold nodes in a database cluster. GreenDB fetches popular data tables to hot nodes, aiming to keep cold nodes in the low-power mode in increased time periods. GreenDB is conducive to reducing the number of power-state transitions, thereby lowering energy-saving overhead. A prefetching model and an energy saving model are seamlessly integrated into GreenDB to facilitate the power management in database clusters. We quantitatively evaluate GreenDB’s energy efficiency in terms of managing, fetching, and storing data. We compare GreenDB’s prefetching strategy with the one implemented in Postgresql. Experimental results indicate that GreenDB conserves the energy consumption of the existing solution by up to 98.4 percent. The findings show that the energy efficiency of GreenDB can be optimized by tuning system parameters, including table size, hit rates, number of nodes, number of disks, and inter-arrival delays.

[1]  David Atienza,et al.  Free cooling-aware dynamic power management for green datacenters , 2012, 2012 International Conference on High Performance Computing & Simulation (HPCS).

[2]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[3]  Gustavo Alonso,et al.  Scalable Replication in Database Clusters , 2000, DISC.

[4]  Hamed Mohsenian Rad,et al.  Energy and Performance Management of Green Data Centers: A Profit Maximization Approach , 2013, IEEE Transactions on Smart Grid.

[5]  Xiao Qin,et al.  PRE-BUD: Prefetching for energy-efficient parallel I/O systems with buffer disks , 2011, TOS.

[6]  Christopher Stewart,et al.  Blending on-demand and spot instances to lower costs for in-memory storage , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[7]  Arvind Krishnamurthy,et al.  Modeling Hard-Disk Power Consumption , 2003, FAST.

[8]  Lachlan L. H. Andrew,et al.  Dynamic Right-Sizing for Power-Proportional Data Centers , 2011, IEEE/ACM Transactions on Networking.

[9]  Chunming Qiao,et al.  Availability-aware energy-efficient virtual machine placement , 2015, 2015 IEEE International Conference on Communications (ICC).

[10]  Hewlett Packard,et al.  Enterprise Power and Cooling: a Chip-to-data Center Perspective , .

[11]  Xiao Qin,et al.  EAD and PEBD: Two Energy-Aware Duplication Scheduling Algorithms for Parallel Tasks on Homogeneous Clusters , 2011, IEEE Transactions on Computers.

[12]  Rajiv Ranjan,et al.  Survey of Techniques and Architectures for Designing Energy-Efficient Data Centers , 2016, IEEE Systems Journal.

[13]  Goetz Graefe,et al.  New algorithms for join and grouping operations , 2012, Computer Science - Research and Development.

[14]  Masaru Kitsuregawa,et al.  Energy Efficient Storage Management Cooperated with Large Data Intensive Applications , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[15]  Kenneth Salem,et al.  EdgeX: Edge Replication for Web Applications , 2015, 2015 IEEE 8th International Conference on Cloud Computing.

[16]  Alfons Kemper,et al.  Locality-sensitive operators for parallel main-memory database clusters , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[17]  Luís E. T. Rodrigues,et al.  AutoPlacer: Scalable Self-Tuning Data Placement in Distributed Key-Value Stores , 2015, TAAS.

[18]  Xiao Qin,et al.  EDOM: Improving energy efficiency of database operations on multicore servers , 2020, Future Gener. Comput. Syst..

[19]  Athanasios V. Vasilakos,et al.  GreenDCN: A General Framework for Achieving Energy Efficiency in Data Center Networks , 2013, IEEE Journal on Selected Areas in Communications.

[20]  César A. F. De Rose,et al.  Towards better manageability of database clusters on cloud computing platforms , 2014, SAC.

[21]  サートリ・フィリップ,et al.  Data processing system and method for the , 1999 .

[22]  Xiaorui Wang,et al.  PET: Reducing Database Energy Cost via Query Optimization , 2012, Proc. VLDB Endow..

[23]  Ali R. Hurson,et al.  Energy-efficient indexing on a broadcast channel in a mobile database access system , 2000, Proceedings International Conference on Information Technology: Coding and Computing (Cat. No.PR00540).

[24]  Daniel Schall,et al.  Towards an Energy-Proportional Storage System using a Cluster of Wimpy Nodes , 2013, BTW.

[25]  Rajkumar Buyya,et al.  OpenStack Neat: a framework for dynamic and energy‐efficient consolidation of virtual machines in OpenStack clouds , 2015, Concurr. Comput. Pract. Exp..

[26]  Marta Mattoso,et al.  Parallel OLAP query processing in database clusters with data replication , 2009, Distributed and Parallel Databases.

[27]  Nuno Diegues,et al.  Self-Tuning Intel Transactional Synchronization Extensions , 2014, ICAC.

[28]  Xiao Qin,et al.  An Energy-Efficient Scheduling Algorithm Using Dynamic Voltage Scaling for Parallel Applications on Clusters , 2007, 2007 16th International Conference on Computer Communications and Networks.

[29]  Sujata Banerjee,et al.  On energy efficiency for enterprise and data center networks , 2011, IEEE Communications Magazine.

[30]  Mehul A. Shah,et al.  Analyzing the energy efficiency of a database server , 2010, SIGMOD Conference.

[31]  Albert Y. Zomaya,et al.  A note on energy efficient data, services and memory management in Big Data Information Systems , 2015, Inf. Sci..