Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers

Trends towards consolidation and higher-density computing configurations make the problem of heat management one of the critical challenges in emerging data centers. Conventional approaches to addressing this problem have focused at the facilities level to develop new cooling technologies or optimize the delivery of cooling. In contrast to these approaches, our paper explores an alternate dimension to address this problem, namely a systems-level solution to control the heat generation through temperature-aware workload placement. We first examine a theoretic thermodynamic formulation that uses information about steady state hot spots and cold spots in the data center and develop real-world scheduling algorithms. Based on the insights from these results, we develop an alternate approach. Our new approach leverages the non-intuitive observation that the source of cooling inefficiencies can often be in locations spatially uncorrelated with its manifested consequences; this enables additional energy savings. Overall, our results demonstrate up to a factor of two reduction in annual data center cooling costs over location-agnostic workload distribution, purely through software optimizations without the need for any costly capital investment.

[1]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[2]  Amin Vahdat,et al.  Every joule is precious: the case for revisiting operating system design for energy efficiency , 2000, ACM SIGOPS European Workshop.

[3]  R. F. Sullivan Alternating Cold and Hot Aisles Provides More Reliable Cooling for Server Farms , 2000 .

[4]  Willy Zwaenepoel,et al.  Scalable Content-aware Request Distribution in Cluster-based Network Servers , 2000, USENIX ATC, General Track.

[5]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[6]  Jennifer D. Mitchell-Jackson,et al.  ENERGY NEEDS IN AN INTERNET ECONOMY: A CLOSER LOOK AT DATA CENTERS , 2001 .

[7]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[8]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[9]  Amin Vahdat,et al.  ECOSystem: managing energy as a first class operating system resource , 2002, ASPLOS X.

[10]  Balachander Krishnamurthy,et al.  Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites , 2002, WWW.

[11]  T. Mudge,et al.  Vertigo: automatic performance-setting for Linux , 2002, OSDI '02.

[12]  Jason Nieh,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation , 2022 .

[13]  Robbert van Renesse,et al.  Scalable Management and Data Mining Using Astrolabe , 2002, IPTPS.

[14]  Richard E. Harper,et al.  Workload-based power management for parallel computer systems , 2003, IBM J. Res. Dev..

[15]  HarrisTim,et al.  Xen and the art of virtualization , 2003 .

[16]  David E. Culler,et al.  Wide area cluster monitoring with Ganglia , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[17]  Jeffrey S. Chase,et al.  A Sense of Place: Toward a Location-aware Information Plane for Data Centers , 2003 .

[18]  Cullen E. Bash,et al.  Smart cooling of data centers , 2003 .

[19]  Erik Riedel,et al.  More Than an Interface - SCSI vs. ATA , 2003, FAST.

[20]  Luiz André Barroso,et al.  Web Search for a Planet: The Google Cluster Architecture , 2003, IEEE Micro.

[21]  Karthick Rajamani,et al.  On evaluating request-distribution schemes for saving energy in server clusters , 2003, 2003 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS 2003..

[22]  Graeme R. Cole Estimating Drive Reliability in Desktop Computers and Consumer Electronics , 2003 .

[23]  Xiaoyun Zhu,et al.  Triage: performance isolation and differentiation for storage systems , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[24]  J. Chase,et al.  Data Center Workload Monitoring , Analysis , and Emulation , 2005 .

[25]  G. Weikum Querying the Internet with PIER , 2005 .

[26]  Jeffrey S. Chase,et al.  Balance of power: dynamic thermal management for Internet data centers , 2005, IEEE Internet Computing.