Learning Resource Allocation and Pricing for Cloud Profit Maximization

Cloud computing has been widely adopted to support various computation services. A fundamental problem faced by cloud providers is how to efficiently allocate resources upon user requests and price the resource usage, in order to maximize resource efficiency and hence provider profit. Existing studies establish detailed performance models of cloud resource usage, and propose offline or online algorithms to decide allocation and pricing. Differently, we adopt a blackbox approach, and leverage model-free Deep Reinforcement Learning (DRL) to capture dynamics of cloud users and better characterize inherent connections between an optimal allocation/pricing policy and the states of the dynamic cloud system. The goal is to learn a policy that maximizes net profit of the cloud provider through trial and error, which is better than decisions made on explicit performance models. We combine long short-term memory (LSTM) units with fully-connected neural networks in our DRL to deal with online user arrivals, and adjust the output and update methods of basic DRL algorithms to address both resource allocation and pricing. Evaluation based on real-world datasets shows that our DRL approach outperforms basic DRL algorithms and state-of-theart white-box online cloud resource allocation/pricing algorithms significantly, in terms of both profit and the number of accepted users.

[1]  Feng Zhao,et al.  Virtual machine power metering and provisioning , 2010, SoCC '10.

[2]  Besmir Sejdiu,et al.  Pricing Schemes in Cloud Computing: An Overview , 2016 .

[3]  Zongpeng Li,et al.  An online auction framework for dynamic resource provisioning in cloud computing , 2014, SIGMETRICS '14.

[4]  Yan Hong,et al.  Reinforcement Mechanism Design, with Applications to Dynamic Pricing in Sponsored Search Auctions , 2017, ArXiv.

[5]  Muli Ben-Yehuda,et al.  Deconstructing Amazon EC2 Spot Instance Pricing , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[6]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[7]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[8]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[9]  Christopher Leckie,et al.  Analysing Virtual Machine Usage in Cloud Computing , 2013, 2013 IEEE Ninth World Congress on Services.

[10]  Peter Stone,et al.  Deep Reinforcement Learning in Parameterized Action Space , 2015, ICLR.

[11]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[12]  Kristina Lerman,et al.  Resource allocation in the grid using reinforcement learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[13]  Mihaela van der Schaar,et al.  Dynamic pricing for smart grid with reinforcement learning , 2014, 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[14]  Yishay Mansour,et al.  Welfare and Profit Maximization with Production Costs , 2011, 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science.

[15]  Tim Oates,et al.  Automated Cloud Provisioning on AWS using Deep Reinforcement Learning , 2017, ArXiv.

[16]  W. Souma Physics of Personal Income , 2002, cond-mat/0202388.

[17]  Stuart J. Russell,et al.  Automated Pricing Agents in the On-Demand Economy , 2016 .

[18]  Rajarshi Das,et al.  A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation , 2006, 2006 IEEE International Conference on Autonomic Computing.

[19]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[20]  Marco Gramaglia,et al.  Optimising 5G infrastructure markets: The business of network slicing , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[21]  Ricardo Bianchini,et al.  Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms , 2017, SOSP.

[22]  Zhen Li,et al.  Understanding Hidden Memories of Recurrent Neural Networks , 2017, 2017 IEEE Conference on Visual Analytics Science and Technology (VAST).

[23]  Zongpeng Li,et al.  Online Auctions in IaaS Clouds: Welfare and Profit Maximization With Server Costs , 2015, IEEE/ACM Transactions on Networking.