Toward Efficient Compute-Intensive Job Allocation for Green Data Centers: A Deep Reinforcement Learning Approach

Reducing the energy consumption of the servers in a data center via proper job allocation is desirable. Existing advanced job allocation algorithms, based on constrained optimization formulations capturing servers' complex power consumption and thermal dynamics, often scale poorly with the data center size and optimization horizon. This paper applies deep reinforcement learning (DRL) to build an allocation algorithm for long-lasting and compute-intensive jobs that are increasingly seen among today's computation demands. Specifically, a deep Q-network is trained to allocate jobs, aiming to maximize a cumulative reward over long horizons. The training is performed offline using a computational model based on long short-term memory networks that capture the servers' power and thermal dynamics. This offline training approach avoids slow online convergence, low energy efficiency, and potential server overheating during the DRL's extensive state-action space exploration if it directly interacts with the physical data center in the usually adopted online learning scheme. At run time, the trained Q-network is forward-propagated with little computation to allocate jobs. Evaluation based on 8 months' physical state and job arrival records from a national supercomputing data center hosting 1,152 processors shows that our solution reduces computing power consumption by nearly 10% and processor temperature by more than 3°C without sacrificing job processing throughput.

[1]  Srikanth Kandula,et al.  Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[2]  Weiwei Lin,et al.  Random task scheduling scheme based on reinforcement learning in cloud computing , 2015, Cluster Computing.

[3]  Steve Greenberg,et al.  Best Practices for Data Centers: Lessons Learned from Benchmarking 22 Data Centers , 2006 .

[4]  Mohsen Guizani,et al.  Efficient datacenter resource utilization through cloud resource overcommitment , 2015, 2015 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[5]  Athanasios V. Vasilakos,et al.  Thermal-Aware Scheduling of Batch Jobs in Geographically Distributed Data Centers , 2014, IEEE Transactions on Cloud Computing.

[6]  Bianca Schroeder,et al.  Temperature management in data centers: why some (might) like it hot , 2012, SIGMETRICS '12.

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Jaume Salom,et al.  Energy-efficient, thermal-aware modeling and simulation of data centers: The CoolEmAll approach and evaluation results , 2015, Ad Hoc Networks.

[9]  Pasi Liljeberg,et al.  Energy-Efficient Virtual Machines Consolidation in Cloud Data Centers Using Reinforcement Learning , 2014, 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[10]  Richard S. Sutton,et al.  A Deeper Look at Experience Replay , 2017, ArXiv.

[11]  Richard E. Brown,et al.  United States Data Center Energy Usage Report , 2016 .

[12]  Teck Chaw Ling,et al.  Thermal-Aware Scheduling in Green Data Centers , 2015, ACM Comput. Surv..

[13]  Xiaohua Jia,et al.  Green scheduling for cloud data centers using renewable resources , 2015, 2015 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[14]  Li-Der Chou,et al.  A novel VM workload prediction using Grey Forecasting model in cloud data center , 2014, The International Conference on Information Networking 2014 (ICOIN2014).

[15]  Ching-Hsien Hsu,et al.  GEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centers , 2017, Computing.

[16]  J. Christopher Beck,et al.  Multi-stage resource-aware scheduling for data centers with heterogeneous servers , 2017, J. Sched..

[17]  Jürgen Schmidhuber,et al.  Learning to forget: continual prediction with LSTM , 1999 .

[18]  Lizhe Wang,et al.  Thermal aware workload placement with task-temperature profiles in a data center , 2011, The Journal of Supercomputing.

[19]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[20]  Guoliang Xing,et al.  A High-Fidelity Temperature Distribution Forecasting System for Data Centers , 2012, 2012 IEEE 33rd Real-Time Systems Symposium.

[21]  Qinru Qiu,et al.  A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[22]  Sandeep K. S. Gupta,et al.  Energy-Efficient Thermal-Aware Task Scheduling for Homogeneous High-Performance Computing Data Centers: A Cyber-Physical Approach , 2008, IEEE Transactions on Parallel and Distributed Systems.

[23]  S. Gupta,et al.  Thermal-aware task scheduling for data centers through minimizing heat recirculation , 2007, 2007 IEEE International Conference on Cluster Computing.

[24]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[25]  Marko Bacic,et al.  Model predictive control , 2003 .

[26]  Sandeep K. S. Gupta,et al.  Thermal-Aware Task Scheduling to Minimize Energy Usage of Blade Server Based Datacenters , 2006, 2006 2nd IEEE International Symposium on Dependable, Autonomic and Secure Computing.

[27]  Ayan Banerjee,et al.  Spatio-temporal thermal-aware job scheduling to minimize energy consumption in virtualized heterogeneous data centers , 2009, Comput. Networks.

[28]  Eric O'Shaughnessy,et al.  Status and Trends in the U.S. Voluntary Green Power Market (2015 Data) , 2016 .