DRL-cloud: Deep reinforcement learning-based resource provisioning and task scheduling for cloud service providers

Cloud computing has become an attractive computing paradigm in both academia and industry. Through virtualization technology, Cloud Service Providers (CSPs) that own data centers can structure physical servers into Virtual Machines (VMs) to provide services, resources, and infrastructures to users. Profit-driven CSPs charge users for service access and VM rental, and reduce power consumption and electric bills so as to increase profit margin. The key challenge faced by CSPs is data center energy cost minimization. Prior works proposed various algorithms to reduce energy cost through Resource Provisioning (RP) and/or Task Scheduling (TS). However, they have scalability issues or do not consider TS with task dependencies, which is a crucial factor that ensures correct parallel execution of tasks. This paper presents DRL-Cloud, a novel Deep Reinforcement Learning (DRL)-based RP and TS system, to minimize energy cost for large-scale CSPs with very large number of servers that receive enormous numbers of user requests per day. A deep Q-learning-based two-stage RP-TS processor is designed to automatically generate the best long-term decisions by learning from the changing environment such as user request patterns and realistic electric price. With training techniques such as target network, experience replay, and exploration and exploitation, the proposed DRL-Cloud achieves remarkably high energy cost efficiency, low reject rate as well as low runtime with fast convergence. Compared with one of the state-of-the-art energy efficient algorithms, the proposed DRL-Cloud achieves up to 320% energy cost efficiency improvement while maintaining lower reject rate on average. For an example CSP setup with 5,000 servers and 200,000 tasks, compared to a fast round-robin baseline, the proposed DRL-Cloud achieves up to 144% runtime reduction.

[1]  Odej Kao,et al.  Nephele: efficient parallel data processing in the cloud , 2009, MTAGS '09.

[2]  Ji Li,et al.  Softmax Regression Design for Stochastic Computing Based Deep Convolutional Neural Networks , 2017, ACM Great Lakes Symposium on VLSI.

[3]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[4]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[5]  Ji Li,et al.  Negotiation-based task scheduling to minimize user’s electricity bills under dynamic energy prices , 2014, 2014 IEEE Online Conference on Green Communications (OnlineGreenComm).

[6]  Ji Li,et al.  Negotiation-based resource provisioning and task scheduling algorithm for cloud systems , 2016, 2016 17th International Symposium on Quality Electronic Design (ISQED).

[7]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[8]  Qinru Qiu,et al.  A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[9]  Ji Li,et al.  Negotiation-based task scheduling and storage control algorithm to minimize user's electric bills under dynamic prices , 2015, The 20th Asia and South Pacific Design Automation Conference.

[10]  A. Faruqui The Ethics of Dynamic Pricing , 2010 .

[11]  Ramin Yahyapour,et al.  Service Level Agreements for Cloud Computing , 2011 .

[12]  Lihong Li,et al.  PAC model-free reinforcement learning , 2006, ICML.

[13]  Yang Xiaoguang,et al.  Research on cloud computing schedule based on improved hybrid PSO , 2013, Proceedings of 2013 3rd International Conference on Computer Science and Network Technology.

[14]  Chong Li,et al.  Model-Free Reinforcement Learning , 2019, Reinforcement Learning for Cyber-Physical Systems.

[15]  Ji Li,et al.  DSCNN: Hardware-oriented optimization for Stochastic Computing based Deep Convolutional Neural Networks , 2016, 2016 IEEE 34th International Conference on Computer Design (ICCD).

[16]  Qinru Qiu,et al.  SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing , 2016, ASPLOS.

[17]  Hamed Mohsenian Rad,et al.  Optimal Residential Load Control With Price Prediction in Real-Time Electricity Pricing Environments , 2010, IEEE Transactions on Smart Grid.

[18]  Ji Li,et al.  Fundamental Challenges Toward Making the IoT a Reachable Reality , 2017, ACM Trans. Design Autom. Electr. Syst..

[19]  Ji Li,et al.  CTS2M: concurrent task scheduling and storage management for residential energy consumers under dynamic energy pricing , 2017, IET Cyper-Phys. Syst.: Theory & Appl..

[20]  Alexander Rassau,et al.  Impact of dynamic energy pricing schemes on a novel multi-user home energy management system , 2015 .

[21]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[22]  John W. Rittinghouse,et al.  Cloud Computing: Implementation, Management, and Security , 2009 .

[23]  Michael O. Duff,et al.  Reinforcement Learning Methods for Continuous-Time Markov Decision Problems , 1994, NIPS.

[24]  Yue Gao,et al.  An energy and deadline aware resource provisioning, scheduling and optimization framework for cloud systems , 2013, 2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS).

[25]  Ji Li,et al.  Fast and energy-aware resource provisioning and task scheduling for cloud systems , 2017, 2017 18th International Symposium on Quality Electronic Design (ISQED).

[26]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[27]  Ji Li,et al.  Structural design optimization for deep convolutional neural networks using stochastic computing , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[28]  Ji Li,et al.  Towards acceleration of deep convolutional neural networks using stochastic computing , 2017, 2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC).

[29]  Giancarlo Zaccone Getting Started with TensorFlow , 2016 .