Thermal influence indices: Causality metrics for efficient exploration of data center cooling

Cooling is an important issue in data center design and operation. Accurate evaluation of a design or operational parameter choice for cooling is difficult as it requires several runs of computationally intensive Computational Fluid Dynamics (CFD) based models. Therefore there is need for an exploration method that does not incur enormous computation. In addition, the exploration should also provide insights that enable informed decision making. Given these twin goals of reduced computation and improved insights, we present a novel approach to data center cooling exploration. The key idea is to do a local search around the current design/operation of a data center to obtain better design/operation parameters subject to the desired constraints. To do this, all the microscopic information about airflow and temperature in data center available from a single run of CFD computation is converted into macroscopic metrics called influence indices. The influence indices, which characterize the causal relationship between heat sources and sinks, are used to refine the design/operation of the data center either manually or programmatically. New designs are evaluated with further CFD runs to compute new influence indices and the process is repeated to yield improved designs as per the computation budget available. We have carried out design exploration of a realistic data center using this methodology. Specifically, we considered maximization of the heat load in the data center subject to the constraints that: 1) servers are kept at appropriate temperatures and 2) overloading of CRACs is avoided. Our evaluation shows that the use of influence indices cuts down the exploration time by 80 % for a 1500 sq. ft. data center.

[1]  S. Patankar,et al.  Use of Computational Fluid Dynamics for Calculating Flow Rates Through Perforated Tiles in Raised-Floor Data Centers , 2003 .

[2]  C.D. Patel,et al.  Dynamic thermal management of air cooled data centers , 2006, Thermal and Thermomechanical Proceedings 10th Intersociety Conference on Phenomena in Electronics Systems, 2006. ITHERM 2006..

[3]  Joonwon Lee,et al.  A CFD-Based Tool for Studying Temperature in Rack-Mounted Servers , 2008, IEEE Transactions on Computers.

[4]  Roger R. Schmidt,et al.  Cluster of High-Powered Racks Within a Raised-Floor Computer Data Center: Effect of Perforated Tile Flow Distribution on Rack Inlet Air Temperatures , 2004 .

[5]  Cullen E. Bash,et al.  Thermal considerations in cooling large scale high compute density data centers , 2002, ITherm 2002. Eighth Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (Cat. No.02CH37258).

[6]  Michael D. Sohn,et al.  Tracer gas transport under mixed convection conditions in an experimental atrium: Comparison between experiments and CFD predictions , 2006 .

[7]  Qinghui Tang,et al.  Sensor-Based Fast Thermal Evaluation Model For Energy Efficient High-Performance Datacenters , 2006, 2006 Fourth International Conference on Intelligent Sensing and Information Processing.

[8]  Jeffrey Rambo,et al.  Thermal Performance Metrics for Arranging Forced Air Cooled Servers in a Data Processing Cabinet , 2005 .

[9]  Jeffrey S. Chase,et al.  Weatherman: Automated, Online and Predictive Thermal Mapping and Management for Data Centers , 2006, 2006 IEEE International Conference on Autonomic Computing.

[10]  Cullen E. Bash,et al.  DIMENSIONLESS PARAMETERS FOR EVALUATION OF THERMAL DESIGN AND PERFORMANCE OF LARGE-SCALE DATA CENTERS , 2002 .

[11]  J. Rambo,et al.  Reduced-Order Modeling of Multiscale Turbulent Convection: Application to Data Center Thermal Management , 2006 .

[12]  Saurabh K. Shrivastava,et al.  A Flow-Network Model for Predicting Rack Cooling in Containment Systems , 2009 .

[13]  Ricardo Bianchini,et al.  Mercury and freon: temperature emulation and management for server systems , 2006, ASPLOS XII.

[14]  Bahgat Sammakia,et al.  Optimization of data center room layout to minimize rack inlet air temperature , 2006 .

[15]  Tullie Circle,et al.  AMERICAN SOCIETY OF HEATING, REFRIGERATING AND AIR-CONDITIONING , 2013 .

[16]  Jeffrey S. Chase,et al.  Balance of power: dynamic thermal management for Internet data centers , 2005, IEEE Internet Computing.

[17]  EnergyInformationAdministration Annual Energy Outlook 2008 With Projections to 2030 , 2008 .

[18]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[19]  M.J. Ellsworth,et al.  Review of cooling technologies for computer products , 2004, IEEE Transactions on Device and Materials Reliability.

[20]  Umesh Singh,et al.  CFD-Based Operational Thermal Efficiency Improvement of a Production Data Center , 2010, SustainIT.

[21]  R. Schmidt,et al.  Experimental-Numerical Comparison for a High-Density Data Center: Hot Spot Heat Fluxes in Excess of 500 W/FT2 , 2006, Thermal and Thermomechanical Proceedings 10th Intersociety Conference on Phenomena in Electronics Systems, 2006. ITHERM 2006..