GentleCool: Cooling aware proactive workload scheduling in multi-machine systems

In state of the art systems, workload scheduling and server fan speed operate independently leading to cooling inefficiencies. We propose GentleCool, a proactive multi-tier approach for significantly lowering the fan cooling costs without compromising the performance. Our technique manages the fan speed through intelligently allocating the workload across different machines. The experimental results show our approach delivers average cooling energy savings of 72% and improves the mean time between failures (MTBF) of the fans by 2.3X compared to the state of the art.

[1]  R.H. Lyon,et al.  Noise and cooling in electronics packages , 2004, Twentieth Annual IEEE Semiconductor Thermal Measurement and Management Symposium (IEEE Cat. No.04CH37545).

[2]  Kevin Skadron,et al.  Temperature-aware microarchitecture: Modeling and implementation , 2004, TACO.

[3]  Krste Asanovic,et al.  Reducing power density through activity migration , 2003, ISLPED '03.

[4]  S. Gupta,et al.  Thermal-aware task scheduling for data centers through minimizing heat recirculation , 2007, 2007 IEEE International Conference on Cluster Computing.

[5]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[6]  Cullen E. Bash,et al.  Smart cooling of data centers , 2003 .

[7]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[8]  Karthick Rajamani,et al.  Energy Management for Commercial Servers , 2003, Computer.

[9]  Manish Marwah,et al.  Optimal Fan Speed Control for Thermal Management of Servers , 2009 .

[10]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[11]  Herming Chiueh,et al.  A novel fully integrated fan controller for advanced computer systems , 2000, 2000 Southwest Symposium on Mixed-Signal Design (Cat. No.00EX390).

[12]  Shahin Nazarian,et al.  Thermal Modeling, Analysis, and Management in VLSI Circuits: Principles and Methods , 2006, Proceedings of the IEEE.

[13]  M.K. Patterson,et al.  The effect of data center temperature on energy efficiency , 2008, 2008 11th Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems.

[14]  George Paparrizos An Integrated Fan Speed Control Solution Can Lower System Costs, Reduce Acoustic Noise, Power Consumption and Enhance System Reliability , 2003 .

[15]  Tajana Simunic,et al.  vGreen: a system for energy efficient computing in virtualized environments , 2009, ISLPED.