WASP: Workload Adaptive Energy-Latency Optimization in Server Farms Using Server Low-Power States

With the growing energy demands from server farms, it becomes necessary to understand the tradeoffs between energy consumption and application performance. Typically, server farms are provisioned for peak load even when they are mostly operating at low utilization levels. This results in wasteful energy consumption. At the same time, application workloads have Quality of Service (QoS) constraints that need to be satisfied. Optimizing server farm energy consumption with QoS constraints is a challenging task since the workload can have variabilities in job sizes, job arrival patterns and system utilization levels. In this paper, we present WASP, where we explore techniques that make smart use of the processor and system low-power states, and orchestrate their use with workload adaptivity for more effective energy management. We perform an extensive study of Energy-Latency tradeoffs with simulations, and evaluate WASP on a testbed with a cluster of servers. Our experiments on real systems show that WASP achieves up to 57% energy reduction over a naive policy that uses a shallow processor sleep state when there are no jobs to execute, and 39% over a delay timer based approach while maintaining the 90th percentile job service latency to be under 2x job execution time.

[1]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[2]  Erik-Jan van Baaren,et al.  WikiBench: A distributed, Wikipedia based web application benchmark , 2009 .

[3]  Fan Yao,et al.  Watts-inside: A hardware-software cooperative approach for Multicore Power Debugging , 2013, 2013 IEEE 31st International Conference on Computer Design (ICCD).

[4]  Lieven Eeckhout,et al.  Trends in Server Energy Proportionality , 2011, Computer.

[5]  Guru Venkataramani,et al.  The Need for Power Debugging in the Multi-Core Environment , 2012, IEEE Computer Architecture Letters.

[6]  Christoforos E. Kozyrakis,et al.  Towards energy proportionality for large-scale latency-critical workloads , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[7]  Thomas F. Wenisch,et al.  DreamWeaver: architectural support for deep sleep , 2012, ASPLOS XVII.

[8]  Nam Sung Kim,et al.  SleepScale: Runtime joint speed scaling and sleep states management for power efficient data centers , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[9]  Daniel Wong,et al.  Implications of high energy proportional servers on cluster-wide energy proportionality , 2014, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA).

[10]  Daniel Wong,et al.  KnightShift: Scaling the Energy Proportionality Wall through Server-Level Heterogeneity , 2012, 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture.

[11]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[12]  Guru Venkataramani,et al.  enDebug: A hardware-software framework for automated energy debugging , 2016, J. Parallel Distributed Comput..

[13]  Guru Venkataramani,et al.  A Hardware-Software Cooperative Approach for Application Energy Profiling , 2015, IEEE Computer Architecture Letters.

[14]  Gu-Yeon Wei,et al.  Tradeoffs between power management and tail latency in warehouse-scale applications , 2014, 2014 IEEE International Symposium on Workload Characterization (IISWC).

[15]  Thomas F. Wenisch,et al.  PowerNap: eliminating server idle power , 2009, ASPLOS.

[16]  Fan Yao,et al.  A Dual Delay Timer Strategy for Optimizing Server Farm Energy , 2015, 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom).

[17]  Fan Yao,et al.  A comparative analysis of data center network architectures , 2014, 2014 IEEE International Conference on Communications (ICC).

[18]  Mor Harchol-Balter,et al.  AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers , 2012, TOCS.

[19]  Diana Marculescu,et al.  Analysis of dynamic voltage/frequency scaling in chip-multiprocessors , 2007, Proceedings of the 2007 international symposium on Low power electronics and design (ISLPED '07).

[20]  Mor Harchol-Balter,et al.  How data center size impacts the effectiveness of dynamic power management , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[21]  Junjie Wu,et al.  BigHouse: A simulation infrastructure for data center systems , 2012, 2012 IEEE International Symposium on Performance Analysis of Systems & Software.

[22]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[23]  David C. Snowdon,et al.  Power Management and Dynamic Voltage Scaling: Myths and Facts , 2005 .