Workload Management for Power Efficiency in Heterogeneous Data Centers

The cloud computing paradigm has recently emerged as a convenient solution for running different workloads on highly parallel and scalable infrastructures. One major appeal of cloud computing is its capability of abstracting hardware resources and making them easy to use. Conversely, one of the major challenges for cloud providers is the energy efficiency improvement of their infrastructures. Aimed at overcoming this challenge, heterogeneous architectures have started to become part of the standard equipment used in data centers. Despite this effort, heterogeneous systems remain difficult to program and manage, while their effectiveness has been proven only in the HPC domain. Cloud workloads are different in nature and a way to exploit heterogeneity effectively is still lacking. This paper takes a first step towards an effective use of heterogeneous architectures in cloud infrastructures. It presents an in-depth analysis of cloud workloads, highlighting where energy efficiency can be obtained. The microservices paradigm is then presented as a way of intelligently partitioning applications in such a way that different components can take advantage of the heterogeneous hardware, thus providing energy efficiency. Finally, the integration of microservices and heterogeneous architectures, as well as the challenge of managing legacy applications, is presented in the context of the OPERA project.

[1]  Karsten Schwan,et al.  VirtualPower: coordinated power management in virtualized enterprise systems , 2007, SOSP.

[2]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[3]  Charlene O'Hanlon,et al.  A Conversation with Werner Vogels , 2006, ACM Queue.

[4]  Christina Delimitrou,et al.  Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.

[5]  José L. Núñez-Yáñez,et al.  Energy efficient Reconfigurable Computing with Adaptive Voltage and Logic scaling , 2014, CARN.

[6]  Thomas F. Wenisch,et al.  PowerNap: eliminating server idle power , 2009, ASPLOS.

[7]  Marco Guazzone,et al.  Mining the Workload of Real Grid Computing Systems , 2014, ArXiv.

[8]  Christian Plessl,et al.  SAVE: Towards Efficient Resource Management in Heterogeneous System Architectures , 2014, ARC.

[9]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[10]  Yongmei Xu,et al.  Analyzing the Cooling Behavior of Hot and Cold Aisle Containment in Data Centers , 2013, 2013 Fourth International Conference on Emerging Intelligent Data and Web Technologies.

[11]  Urs Hölzle,et al.  Brawny cores still beat wimpy cores, most of the time , 2010 .

[12]  Amar Phanishayee,et al.  FAWN: a fast array of wimpy nodes , 2009, SOSP '09.

[13]  Christoforos E. Kozyrakis,et al.  Towards energy proportionality for large-scale latency-critical workloads , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[14]  Bruno Schulze,et al.  An Analysis of Public Clouds Elasticity in the Execution of Scientific Applications: a Survey , 2016, Journal of Grid Computing.

[15]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[16]  Kenli Li,et al.  Energy-Aware Scheduling Algorithm with Duplication on Heterogeneous Computing Systems , 2012, 2012 ACM/IEEE 13th International Conference on Grid Computing.

[17]  Daniel Mossé,et al.  Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers , 2015, 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA).

[18]  Stephen P. Crago,et al.  Heterogeneous Cloud Computing: The Way Forward , 2015, Computer.

[19]  Marco D. Santambrogio,et al.  Reconfigurable computing : architectures, tools, and applications : 10th International Symposium, ARC 2014, Vilamoura, Portugal, April 14-16, 2014, proceedings , 2014 .

[20]  Thomas F. Wenisch,et al.  Power management of online data-intensive services , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[21]  Sam Newman,et al.  Building microservices - designing fine-grained systems, 1st Edition , 2015 .

[22]  Christian Engelmann,et al.  Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[23]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition , 2013, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition.

[24]  Kevin Skadron,et al.  Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[25]  Susanne Albers,et al.  Race to idle: New algorithms for speed scaling with a sleep state , 2012, TALG.

[26]  Lingjia Tang,et al.  Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers , 2013, ISCA.