TS-Bat: Leveraging Temporal-Spatial Batching for Data Center Energy Optimization

Data centers that run latency-critical workloads are typically provisioned for peak load even when they are operating at low levels of system utilization. Optimizing energy in data centers with Quality of Service (QoS) constraints is challenging since variabilities exist in job sizes, system utilization, and server configurations. Therefore, it is impractical to have a single configuration for energy management that works well across various scenarios. In this paper, we propose TS-Bat, a new data center energy optimization framework that judiciously integrates spatial and temporal job batching while meeting QoS constraints. TS-Bat works on commodity server platforms and comprises two major components: a temporal batching engine that batches the incoming jobs and creates opportunities for the processor to enter low power modes, and a spatial batching engine that schedules the batched jobs on to a server that is estimated to be idle. We implement a prototype of TS-Bat on a testbed with a cluster of servers, and evaluate TS-Bat on a variety of workloads. Our results show that pure temporal batching achieves 49% savings in CPU energy compared to a baseline configuration without batching. Through combining temporal and spatial batching, TS-Bat increases the energy savings by up to 68%.

[1]  Fan Yao,et al.  A Dual Delay Timer Strategy for Optimizing Server Farm Energy , 2015, 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom).

[2]  David Mosberger,et al.  httperf—a tool for measuring web server performance , 1998, PERV.

[3]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[4]  J. Koomey Worldwide electricity used in data centers , 2008 .

[5]  Fan Yao,et al.  WASP: Workload Adaptive Energy-Latency Optimization in Server Farms Using Server Low-Power States , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[6]  Kai Li,et al.  The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[7]  Fan Yao,et al.  A comparative analysis of data center network architectures , 2014, 2014 IEEE International Conference on Communications (ICC).

[8]  Christoforos E. Kozyrakis,et al.  Towards energy proportionality for large-scale latency-critical workloads , 2014, 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA).

[9]  Thomas F. Wenisch,et al.  DreamWeaver: architectural support for deep sleep , 2012, ASPLOS XVII.

[10]  James Norris,et al.  Agile, efficient virtualization power management with low-latency server power states , 2013, ISCA.

[11]  David M. Brooks,et al.  CARB: A C-State Power Management Arbiter for Latency-Critical Workloads , 2017, IEEE Computer Architecture Letters.

[12]  Mor Harchol-Balter,et al.  AutoScale: Dynamic, Robust Capacity Management for Multi-Tier Data Centers , 2012, TOCS.