Chip Temperature-Based Workload Allocation for Holistic Power Minimization in Air-Cooled Data Center

Minimizing the energy consumption is a dominant problem in data center design and operation. To cope with this issue, the common approach is to optimize the data center layout and the workload distribution among servers. Previous works have mainly adopted the temperature at the server inlet as the optimization constraint. However, the inlet temperature does not properly characterize the server’s thermal state. In this paper, a chip temperature-based workload allocation strategy (CTWA-MTP) is proposed to reduce the holistic power consumption in data centers. Our method adopts an abstract heat-flow model to describe the thermal environment in data centers and uses a thermal resistance model to describe the convective heat transfer of the server. The core optimizes the workload allocation with respect to the chip temperature threshold. In addition, the temperature-dependent leakage power of the server has been considered in our model. The proposed method is described as a constrained nonlinear optimization problem to find the optimal solution by a genetic algorithm (GA). We applied the method to a sample data center constructed with computational fluid dynamics (CFD) software. By comparing the simulation results with other different workload allocation strategies, the proposed method prevents the servers from overcooling and achieves a substantial energy saving by optimizing the workload allocation in an air-cooled data center.

[1]  Mengxuan Song,et al.  Thermal-Aware Energy Management of an HPC Data Center via Two-Time-Scale Control , 2017, IEEE Transactions on Industrial Informatics.

[2]  Bruce Nordman,et al.  ERNEST ORLANDO LAWRENCE BERKELEY NATIONAL LABORATORY , 2011 .

[3]  T. Wintgens,et al.  B and Li isotopes as intrinsic tracers for injection tests in aquifer storage and recovery systems , 2009 .

[4]  Christof Vömel,et al.  Neural Network-Based Prediction and Control of Air Flow in a Data Center , 2012 .

[5]  Yogendra Joshi,et al.  Reduced Order Thermal Models of Multiscale Microsystems , 2012 .

[6]  Ayan Banerjee,et al.  Integrating cooling awareness with thermal aware workload placement for HPC data centers , 2011, Sustain. Comput. Informatics Syst..

[7]  Masud Behnia,et al.  A Comparison of Parametric and Multivariable Optimization Techniques in a Raised-Floor Data Center , 2013 .

[8]  Yogendra Joshi,et al.  Rapid Temperature Predictions in Data Centers using Multi-Parameter Proper Orthogonal Decomposition , 2014 .

[9]  Michael D. Sohn,et al.  Tracer gas transport under mixed convection conditions in an experimental atrium: Comparison between experiments and CFD predictions , 2006 .

[10]  Alfonso Ortega,et al.  Optimization of Data Center Cooling Efficiency Using Reduced Order Flow Modeling Within a Flow Network Modeling Approach , 2014 .

[11]  Saman K. Halgamuge,et al.  Minimizing the thermal impact of computing equipment upgrades in data centers , 2012 .

[12]  S. A. Nada,et al.  Numerical investigation and parametric study for thermal and energy management enhancements in data centers' buildings , 2016 .

[13]  Saman K. Halgamuge,et al.  Multi-objective efficiency enhancement using workload spreading in an operational data center , 2015 .

[14]  Anand Sivasubramaniam,et al.  Fast and Accurate Evaluation of Cooling in Data Centers , 2015 .

[15]  Dustin W. Demetriou,et al.  Expanded Assessment of a Practical Thermally Aware Energy-Optimized Load Placement Strategy for Open-Aisle, Air-Cooled Data Centers , 2013 .

[16]  Yogendra Joshi,et al.  Proper Orthogonal Decomposition for Reduced Order Thermal Modeling of Air Cooled Data Centers , 2010 .

[17]  David W. Coit,et al.  Multi-objective optimization using genetic algorithms: A tutorial , 2006, Reliab. Eng. Syst. Saf..

[18]  Ran Zhang,et al.  Joint Cooling and Server Control in Data Centers: A Cross-Layer Framework for Holistic Energy Minimization , 2018, IEEE Systems Journal.

[19]  Saman K. Halgamuge,et al.  Potential of air-side economizers for data center cooling: A case study for key Australian cities , 2013 .

[20]  Bahgat Sammakia,et al.  Airflow and temperature distribution optimization in data centers using artificial neural networks , 2013 .

[21]  Yogendra Joshi Reduced Order Thermal Models of Multi-Scale Microsystems , 2010 .

[22]  Jae-Weon Jeong,et al.  Simplified server model to simulate data center cooling energy consumption , 2015 .

[23]  Katzalin Olcoz,et al.  On the leakage-power modeling for optimal server operation , 2013 .

[24]  Karsten Schwan,et al.  Coordinated Optimization of Cooling and IT Power in Data Centers , 2010 .

[25]  Ricardo Bianchini,et al.  Mercury and freon: temperature emulation and management for server systems , 2006, ASPLOS XII.

[26]  Jie Meng,et al.  Simulation and optimization of HPC job allocation for jointly reducing communication and cooling costs , 2015, Sustain. Comput. Informatics Syst..

[27]  Steve Greenberg,et al.  Best Practices for Data Centers: Lessons Learned from Benchmarking 22 Data Centers , 2006 .

[28]  Dimos Poulikakos,et al.  Aquasar: A hot water cooled data center with direct energy reuse , 2012 .

[29]  Sandeep K. S. Gupta,et al.  Energy-Efficient Thermal-Aware Task Scheduling for Homogeneous High-Performance Computing Data Centers: A Cyber-Physical Approach , 2008, IEEE Transactions on Parallel and Distributed Systems.

[30]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[31]  G. I. Meijer,et al.  Cooling Energy-Hungry Data Centers , 2010, Science.

[32]  Farrokh Mistree,et al.  Adaptable Robust Design of Multi-Scale Convective Systems Applied to Energy Efficient Data Centers , 2010 .

[33]  José Manuel Moya,et al.  Leakage-Aware Cooling Management for Improving Server Energy Efficiency , 2015, IEEE Transactions on Parallel and Distributed Systems.

[34]  M. Behnia,et al.  Thermal Performance of an Air-Cooled Data Center With Raised-Floor and Non-Raised-Floor Configurations , 2014 .

[35]  Huazhong Yang,et al.  Accurate temperature-dependent integrated circuit leakage power estimation is easy , 2007 .

[36]  Sandeep K. S. Gupta,et al.  Thermal-Aware Task Scheduling to Minimize Energy Usage of Blade Server Based Datacenters , 2006, 2006 2nd IEEE International Symposium on Dependable, Autonomic and Secure Computing.

[37]  Daniel Schall,et al.  Energy-proportional query execution using a cluster of wimpy nodes , 2013, DaMoN '13.

[38]  H. Ezzat Khalifa,et al.  Thermally Aware, Energy-Based Load Placement in Open-Aisle, Air-Cooled Data Centers , 2013 .