SPATIALLY-AWARE OPTIMIZATION OF ENERGY CONSUMPTION IN CONSOLIDATED DATA CENTER SYSTEMS

Energy efficiency in data center operation depends on many factors, including power distribution, thermal load and consequent cooling costs, and IT management in terms of how and where IT load is placed and moved under changing request loads. Current methods provided by vendors consolidate IT loads onto the smallest number of machines needed to meet application requirements. This paper’s goal is to gain further improvements in energy efficiency by also making such methods ’spatially aware’, so that load is placed onto machines in ways that respect the efficiency of both cooling and power usage, across and within racks. To help implement spatially aware load placement, we propose a model-based reinforcement learning method to learn and then predict the thermal distribution of different placements for incoming workloads. The method is trained with actual data captured in a fully instrumented data center facility. Experimental results showing notable differences in total power consumption for representative application loads indicate the utility of a two-level spatially-aware workload management (SpAWM) technique in which (i) load is distributed across racks in ways that recognize differences in cooling efficiencies and (ii) within racks, load is distributed so as to take into account cooling effectiveness due to local air flow. The technique is being im

[1]  Robert P. Goldberg,et al.  Survey of virtual machine research , 1974, Computer.

[2]  Ayan Banerjee,et al.  Cooling-aware and thermal-aware workload placement for green HPC data centers , 2010, International Conference on Green Computing.

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Ayan Banerjee,et al.  Energy Efficiency of Thermal-Aware Job Scheduling Algorithms under Various Cooling Models , 2009, IC3.

[5]  Madhusudan K. Iyengar,et al.  Challenges of data center thermal management , 2005, IBM J. Res. Dev..

[6]  Karsten Schwan,et al.  CoolIT: coordinating facility and it management for efficient datacenters , 2008, CLUSTER 2008.

[7]  Jeffrey S. Chase,et al.  Weatherman: Automated, Online and Predictive Thermal Mapping and Management for Data Centers , 2006, 2006 IEEE International Conference on Autonomic Computing.

[8]  Madhusudan K. Iyengar,et al.  Thermodynamics of information technology data centers , 2009, IBM J. Res. Dev..

[9]  Qinghui Tang,et al.  Sensor-Based Fast Thermal Evaluation Model For Energy Efficient High-Performance Datacenters , 2006, 2006 Fourth International Conference on Intelligent Sensing and Information Processing.

[10]  Karsten Schwan,et al.  Providing platform heterogeneity-awareness for data center power management , 2008, Cluster Computing.

[11]  Dutch T. Meyer,et al.  Remus: High Availability via Asynchronous Virtual Machine Replication. (Best Paper) , 2008, NSDI.

[12]  Werner Vogels,et al.  Beyond Server Consolidation , 2008, ACM Queue.

[13]  Cullen E. Bash,et al.  Smart cooling of data centers , 2003 .

[14]  Le Yi Wang,et al.  VCONF: a reinforcement learning approach to virtual machines auto-configuration , 2009, ICAC '09.

[15]  Abhijit Gosavi,et al.  Reinforcement Learning: A Tutorial Survey and Recent Advances , 2009, INFORMS J. Comput..

[16]  Ada Gavrilovska,et al.  VM power metering: feasibility and challenges , 2011, PERV.

[17]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[18]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[19]  Rajarshi Das,et al.  On the use of hybrid reinforcement learning for autonomic resource allocation , 2007, Cluster Computing.

[20]  Massoud Pedram,et al.  Minimizing data center cooling and server power costs , 2009, ISLPED.

[21]  Bahgat Sammakia,et al.  Data Center Cooling Prediction Using Artificial Neural Network , 2007 .

[22]  Karsten Schwan,et al.  Coordinated Optimization of Cooling and IT Power in Data Centers , 2010 .