Data centers consume tremendous amounts of energy in terms of power distribution and cooling. Dynamic capacity provisioning is a promising approach for reducing energy consumption by dynamically adjusting the number of active machines to match resource demands. However, despite extensive studies of the problem, existing solutions have not fully considered the heterogeneity of both workload and machine hardware found in production environments. In particular, production data centers often comprise heterogeneous machines with different capacities and energy consumption characteristics. Meanwhile, the production cloud workloads typically consist of diverse applications with different priorities, performance and resource requirements. Failure to consider the heterogeneity of both machines and workloads will lead to both sub-optimal energy-savings and long scheduling delays, due to incompatibility between workload requirements and the resources offered by the provisioned machines. To address this limitation, we present Harmony, a Heterogeneity-Aware dynamic capacity provisioning scheme for cloud data centers. Specifically, we first use the K-means clustering algorithm to divide workload into distinct task classes with similar characteristics in terms of resource and performance requirements. Then we present a technique that dynamically adjusting the number of machines to minimize total energy consumption and scheduling delay. Simulations using traces from a Google’s compute cluster demonstrate Harmony can reduce energy by 28 percent compared to heterogeneity-oblivious solutions.
[1]
Jordi Petit,et al.
A Guide to Concentration Bounds
,
2001
.
[2]
Raouf Boutaba,et al.
On Cloud computational models and the heterogeneity challenge
,
2011,
Journal of Internet Services and Applications.
[3]
Sanjeev Khanna,et al.
On Multidimensional Packing Problems
,
2004,
SIAM J. Comput..
[4]
Anand Sivasubramaniam,et al.
Managing server energy and operational costs in hosting centers
,
2005,
SIGMETRICS '05.
[5]
Gwilym M. Jenkins,et al.
Time series analysis, forecasting and control
,
1972
.
[6]
Stephen P. Boyd,et al.
Convex Optimization
,
2004,
Algorithms and Theory of Computation Handbook.
[7]
Archana Ganapathi,et al.
Analysis and Lessons from a Publicly Available Google Cluster Trace
,
2010
.
[8]
Scott Shenker,et al.
Usenix Association 10th Usenix Symposium on Networked Systems Design and Implementation (nsdi '13) 185 Effective Straggler Mitigation: Attack of the Clones
,
2022
.