Optimal Algorithms for Right-Sizing Data Centers

Electricity cost is a dominant and rapidly growing expense in data centers. Unfortunately, much of the consumed energy is wasted because servers are idle for extended periods of time. We study a capacity management problem that dynamically right-sizes a data center, matching the number of active servers with the varying demand for computing capacity. We resort to a data-center optimization problem introduced by Lin, Wierman, Andrew and Thereska~\citeW1a,W1 that, over a time horizon, minimizes a combined objective function consisting of operating cost, modeled by a sequence of convex functions, and server switching cost. All prior work addresses a continuous setting in which the number of active servers, at any time, may take a fractional value. In this paper, we investigate for the first time the discrete data-center optimization problem where the number of active servers, at any time, must be integer valued. Thereby we seek truly feasible solutions. First, we show that the offline problem can be solved in polynomial time. Our algorithm relies on a new, yet intuitive graph theoretic model of the optimization problem and performs binary search in a layered graph. Second, we study the online problem and extend the algorithm \em Lazy Capacity Provisioning (LCP) by Lin et al. \citeW1a,W1 to the discrete setting. We prove that LCP is 3-competitive. Moreover, we show that no deterministic online algorithm can achieve a competitive ratio smaller than~3. Hence, while LCP does not attain an optimal competitiveness in the continuous setting, it does so in the discrete problem examined here. We prove that the lower bound of~3 also holds in a problem variant with more restricted operating cost functions, introduced by Lin et al. \citeW1a. Finally, we address the continuous setting and give a lower bound of~2 on the best competitiveness of online algorithms. This matches an upper bound by Bansal et al. \citeB+. A lower bound of~2 was also recently shown by Antoniadis and Schewior~\citeA2. We develop an independent proof that extends to the scenario with more restricted operating cost.

[1]  Yonggang Wen,et al.  Data Center Energy Consumption Modeling: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[2]  Lachlan L. H. Andrew,et al.  A tale of two metrics: simultaneous bounds on competitiveness and regret , 2013, SIGMETRICS '13.

[3]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[4]  Lachlan L. H. Andrew,et al.  Dynamic Right-Sizing for Power-Proportional Data Centers , 2011, IEEE/ACM Transactions on Networking.

[5]  Lachlan L. H. Andrew,et al.  Greening Geographical Load Balancing , 2015, IEEE/ACM Transactions on Networking.

[6]  Zygmunt J. Haas,et al.  On Power Management Policies for Data Centers , 2015, 2015 IEEE International Conference on Data Science and Data Intensive Systems.

[7]  Kirk Pruhs,et al.  A 2-Competitive Algorithm For Online Convex Optimization With Switching Costs , 2015, APPROX-RANDOM.

[8]  Samir Khuller,et al.  Energy efficient scheduling via partial shutdown , 2009, SODA '10.

[9]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[10]  Kevin Schewior,et al.  A Tight Lower Bound for Online Convex Optimization with Switching Costs , 2017, WAOA.

[11]  Lachlan L. H. Andrew,et al.  Online algorithms for geographical load balancing , 2012, 2012 International Green Computing Conference (IGCC).

[12]  Mor Harchol-Balter,et al.  How data center size impacts the effectiveness of dynamic power management , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[13]  Lachlan L. H. Andrew,et al.  Online dynamic capacity provisioning in data centers , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[14]  Richard E. Brown,et al.  United States Data Center Energy Usage Report , 2016 .

[15]  Kirk Pruhs,et al.  Chasing Convex Bodies and Functions , 2016, LATIN.

[16]  Samir Khuller,et al.  Generalized machine activation problems , 2011, SODA '11.

[17]  Mor Harchol-Balter,et al.  Server farms with setup costs , 2010, Perform. Evaluation.