Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Cloud computing enables effortless access to a seemingly infinite shared pool of resources, on a pay-per-use basis. As a result, a new challenge has emerged: designing control mechanisms to precisely meet the actual workload requirements of cloud applications in an online manner. To this end, a variety of complex resource management issues have to be addressed, because workloads in the cloud are of a dynamic and heterogeneous nature, and traditional algorithms do not cope well within this context. In this work, we adopt the point of view of the user of a cloud infrastructure and focus on the task of controlling leased resources. We formulate this task as a Reinforcement Learning problem and we simulate the decision-making process of a controller implementing the Q-learning algorithm. We conduct an experimental study, the outcomes of which offer valuable insight into the advantages and shortcomings of using Reinforcement Learning to implement such adaptive cloud resource controllers.

[1]  Ioannis Konstantinou,et al.  Automated, Elastic Resource Provisioning for NoSQL Clusters Using TIRAMOLA , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[2]  Xiaohui Gu,et al.  AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[3]  Isis Truck,et al.  Using Reinforcement Learning for Autonomic Resource Allocation in Clouds: towards a fully automated workflow , 2011 .

[4]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[5]  Francisco Facchinei,et al.  Resource management in multi-cloud scenarios via reinforcement learning , 2015, 2015 34th Chinese Control Conference (CCC).

[6]  Paul Marshall,et al.  Elastic Site: Using Clouds to Elastically Extend Site Resources , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[7]  Cheng-Zhong Xu,et al.  Coordinated Self-Configuration of Virtual Machines and Appliances Using a Model-Free Learning Approach , 2013, IEEE Transactions on Parallel and Distributed Systems.

[8]  Muli Ben-Yehuda,et al.  The rise of RaaS: the resource-as-a-service cloud , 2014, CACM.

[9]  Ioannis Konstantinou,et al.  Automated workload-aware elasticity of NoSQL clusters in the cloud , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[10]  Daniel A. Menascé,et al.  Near-Optimal Allocation of VMs from IaaS Providers by SaaS Providers , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[11]  Moustafa Ghanem,et al.  Future Generation Computer Systems ( ) – Future Generation Computer Systems Enabling Cost-aware and Adaptive Elasticity of Multi-tier Cloud Applications , 2022 .

[12]  Wouter Joosen,et al.  Middleware for efficient and confidentiality-aware federation of access control policies , 2013, Journal of Internet Services and Applications.

[13]  Stuart Dreyfus,et al.  Richard Bellman on the Birth of Dynamic Programming , 2002, Oper. Res..

[14]  Yue Tan,et al.  An Adaptive Learning Approach for Efficient Resource Provisioning in Cloud Services , 2015, PERV.

[15]  Claus Pahl,et al.  Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[16]  Rolf Stadler,et al.  Resource Management in Clouds: Survey and Research Challenges , 2015, Journal of Network and Systems Management.

[17]  Alessandro Maria Rizzi,et al.  Optimal Map Reduce Job Capacity Allocation in Cloud Systems , 2015, PERV.

[18]  Yudi Wei,et al.  DynaQoS: Model-free self-tuning fuzzy control of virtualized resources for QoS provisioning , 2011, 2011 IEEE Nineteenth IEEE International Workshop on Quality of Service.

[19]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[20]  J. Doyle,et al.  Survey of Time Preference, Delay Discounting Models , 2012, Judgment and Decision Making.

[21]  Danilo Ardagna,et al.  Quality-of-service in cloud computing: modeling techniques and their applications , 2014, Journal of Internet Services and Applications.

[22]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[23]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[24]  Jie Yang,et al.  A Profile-Based Approach to Just-in-Time Scalability for Cloud Applications , 2009, 2009 IEEE International Conference on Cloud Computing.

[25]  R. Bellman A Markovian Decision Process , 1957 .