VMAP: Proactive thermal-aware virtual machine allocation in HPC cloud datacenters

Clouds provide the abstraction of nearly-unlimited computing resources through the elastic use of federated resource pools (virtualized datacenters). They are being increasingly considered for HPC applications, which have traditionally targeted grids and supercomputing clusters. However, maximizing energy efficiency and utilization of cloud datacenter resources, avoiding undesired thermal hotspots (due to overheating of over-utilized computing equipment), and ensuring quality of service guaran-tees for HPC applications are all conflicting objectives, which require joint consideration of multiple pairwise tradeoffs. The novel concept of heat imbalance, which captures the unevenness in heat generation and extraction, at different regions inside a HPC cloud datacenter is introduced. This thermal awareness enables proactive datacenter management through prediction of future temperature trends as opposed to the state-of-the-art reactive management based on current temperature measurements. VMAP, an innovative proactive thermal-aware virtual machine consolidation technique is proposed to maximize computing resource utilization, to minimize datacenter energy consumption for computing, and to improve the efficiency of heat extraction. The effectiveness of the proposed technique is verified through experimental evaluations with HPC workload traces under single-as well as federated-datacenter scenarios (in the machine rooms at Rutgers University and University of Florida).

[1]  Jie Liu,et al.  Towards Discovering Data Center Genome Using Sensor Nets , 2008 .

[2]  Ricardo Bianchini,et al.  Mercury and freon: temperature emulation and management for server systems , 2006, ASPLOS XII.

[3]  Manish Parashar,et al.  Energy-efficient application-aware online provisioning for virtualized clouds and data centers , 2010, International Conference on Green Computing.

[4]  Cullen E. Bash,et al.  Computational Fluid Dynamics Modeling of High Compute Density Data Centers to Assure System Inlet Air Specifications , 2001 .

[5]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[6]  Dario Pompili,et al.  Proactive thermal management in green datacenters , 2012, The Journal of Supercomputing.

[7]  Steve Greenberg,et al.  Best Practices for Data Centers: Lessons Learned from Benchmarking 22 Data Centers , 2006 .

[8]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[9]  Ricardo Bianchini,et al.  C-Oracle: Predictive thermal management for data centers , 2008, 2008 IEEE 14th International Symposium on High Performance Computer Architecture.

[10]  Sandeep K. S. Gupta,et al.  Energy-Efficient Thermal-Aware Task Scheduling for Homogeneous High-Performance Computing Data Centers: A Cyber-Physical Approach , 2008, IEEE Transactions on Parallel and Distributed Systems.

[11]  Madhusudan K. Iyengar,et al.  Challenges of data center thermal management , 2005, IBM J. Res. Dev..

[12]  Krishna C. Saraswat,et al.  Scaling trends for the on chip power dissipation , 2002, Proceedings of the IEEE 2002 International Interconnect Technology Conference (Cat. No.02EX519).

[13]  Jeffrey S. Chase,et al.  Weatherman: Automated, Online and Predictive Thermal Mapping and Management for Data Centers , 2006, 2006 IEEE International Conference on Autonomic Computing.

[14]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[15]  Richard M. Karp,et al.  A probabilistic analysis of multidimensional bin packing problems , 1984, STOC '84.

[16]  Chandrakant D. Patel,et al.  Thermo-Fluids Provisioning of a High Performance High Density Data Center , 2007, Distributed and Parallel Databases.

[17]  Ayan Banerjee,et al.  Spatio-temporal thermal-aware job scheduling to minimize energy consumption in virtualized heterogeneous data centers , 2009, Comput. Networks.

[18]  Andrew Chi-Chih Yao,et al.  Resource Constrained Scheduling as Generalized Bin Packing , 1976, J. Comb. Theory A.

[19]  S. K. Chang,et al.  A general packing algorithm for multidimensional resource requirements , 1977, International Journal of Computer & Information Sciences.

[20]  George Forman,et al.  Cool Job Allocation: Measuring the Power Savings of Placing Jobs at Cooling-Efficient Locations in the Data Center , 2007, USENIX Annual Technical Conference.

[21]  Dario Pompili,et al.  Self-organizing sensing infrastructure for autonomic management of green datacenters , 2011, IEEE Network.

[22]  Jeffrey Rambo,et al.  Modeling of data center airflow and heat transfer: State of the art and future trends , 2007, Distributed and Parallel Databases.

[23]  Cullen E. Bash,et al.  DIMENSIONLESS PARAMETERS FOR EVALUATION OF THERMAL DESIGN AND PERFORMANCE OF LARGE-SCALE DATA CENTERS , 2002 .