Optimal Threshold Policies for Robust Data Center Control

With the simultaneous rise of energy costs and demand for cloud computing, efficient control of data centers becomes crucial. In the data center control problem, one needs to plan at every time step how many servers to switch on or off in order to meet stochastic job arrivals while trying to minimize electricity consumption. This problem becomes particularly challenging when servers can be of various types and jobs from different classes can only be served by certain types of server, as it is often the case in real data centers. We model this problem as a robust Markov decision process (i.e., the transition function is not assumed to be known precisely). We give sufficient conditions (which seem to be reasonable and satisfied in practice) guaranteeing that an optimal threshold policy exists. This property can then be exploited in the design of an efficient solving method, which we provide. Finally, we present some experimental results demonstrating the practicability of our approach and compare with a previous related approach based on model predictive control.

[1]  A. F. Veinott Optimal Policy for a Multi-product, Dynamic Non-Stationary Inventory Problem , 1965 .

[2]  Dieter Kalin,et al.  On the Optimality of (σ, S) Policies , 1980, Math. Oper. Res..

[3]  A. Hordijk,et al.  On the optimality of ( s , S )-policies in continuous review inventory models , 1986 .

[4]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[5]  Leslie Pack Kaelbling,et al.  On the Complexity of Solving Markov Decision Problems , 1995, UAI.

[6]  Robert Givan,et al.  Bounded-parameter Markov decision processes , 2000, Artif. Intell..

[7]  Laurent El Ghaoui,et al.  Robustness in Markov Decision Problems with Uncertain Transition Matrices , 2003, NIPS.

[8]  Christoforos E. Kozyrakis,et al.  Automatic power management schemes for Internet servers and data centers , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[9]  Ger Koole,et al.  Monotonicity in Markov Reward and Decision Chains: Theory and Applications , 2007, Found. Trends Stoch. Syst..

[10]  Bruno Sinopoli,et al.  Reducing data center energy consumption via coordinated cooling and load management , 2008, CLUSTER 2008.

[11]  Rajarshi Das,et al.  Expressive Power-Based Resource Allocation for Data Centers , 2009, IJCAI.

[12]  Michael I. Jordan,et al.  Automating Datacenter Operations Using Machine Learning , 2010 .

[13]  Maria Fox,et al.  Automatic Construction of Efficient Multiple Battery Usage Policies , 2011, IJCAI.

[14]  Bruno Sinopoli,et al.  A Cyber–Physical Systems Approach to Data Center Modeling and Control for Energy Efficiency , 2012, Proceedings of the IEEE.

[15]  Marco Wiering,et al.  Reinforcement Learning , 2014, Adaptation, Learning, and Optimization.

[16]  Alain Jean-Marie,et al.  Scheduling Services in a Queuing System with Impatience and Setup Costs , 2012, Comput. J..

[17]  Patrice Perny,et al.  Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes , 2013, AAAI.

[18]  Andrea Zanella,et al.  Markov Decision Processes with Threshold Based Piecewise Linear Optimal Policies , 2013, IEEE Wireless Communications Letters.

[19]  Patrice Perny,et al.  A Compromise Programming Approach to multiobjective Markov Decision Processes , 2011, Int. J. Inf. Technol. Decis. Mak..

[20]  Paul Weng,et al.  Solving Hidden-Semi-Markov-Mode Markov Decision Problems , 2014, SUM.

[21]  Bruno Sinopoli,et al.  Adaptive robust optimization for coordinated capacity and load control in data centers , 2014, 53rd IEEE Conference on Decision and Control.

[22]  Eugene A. Feinberg,et al.  Optimizing cloud utilization via switching decisions , 2014, PERV.

[23]  Jim Gao,et al.  Machine Learning Applications for Data Center Optimization , 2014 .

[24]  Christos Faloutsos,et al.  Beyond Poisson: Modeling Inter-Arrival Time of Requests in a Datacenter , 2014, PAKDD.

[25]  Benjamin Lubin,et al.  Modeling Multi-Attribute Demand for Sustainable Cloud Computing with Copulae , 2015, IJCAI.

[26]  Marek Petrik,et al.  Optimal Threshold Control for Energy Arbitrage with Degradable Battery Storage , 2015, UAI.

[27]  Daniel J. Scheeres,et al.  Heuristic Search and Receding-Horizon Planning in Complex Spacecraft Orbit Domains , 2015, SOCS.