论文信息 - Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Cloud computing enables effortless access to a seemingly infinite shared pool of resources, on a pay-per-use basis. As a result, a new challenge has emerged: designing control mechanisms to precisely meet the actual workload requirements of cloud applications in an online manner. To this end, a variety of complex resource management issues have to be addressed, because workloads in the cloud are of a dynamic and heterogeneous nature, and traditional algorithms do not cope well within this context. In this work, we adopt the point of view of the user of a cloud infrastructure and focus on the task of controlling leased resources. We formulate this task as a Reinforcement Learning problem and we simulate the decision-making process of a controller implementing the Q-learning algorithm. We conduct an experimental study, the outcomes of which offer valuable insight into the advantages and shortcomings of using Reinforcement Learning to implement such adaptive cloud resource controllers.

[1] Ioannis Konstantinou,et al. Automated, Elastic Resource Provisioning for NoSQL Clusters Using TIRAMOLA , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[2] Xiaohui Gu,et al. AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[3] Isis Truck,et al. Using Reinforcement Learning for Autonomic Resource Allocation in Clouds: towards a fully automated workflow , 2011 .

[4] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .

[5] Francisco Facchinei,et al. Resource management in multi-cloud scenarios via reinforcement learning , 2015, 2015 34th Chinese Control Conference (CCC).

[6] Paul Marshall,et al. Elastic Site: Using Clouds to Elastically Extend Site Resources , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[7] Cheng-Zhong Xu,et al. Coordinated Self-Configuration of Virtual Machines and Appliances Using a Model-Free Learning Approach , 2013, IEEE Transactions on Parallel and Distributed Systems.

[8] Muli Ben-Yehuda,et al. The rise of RaaS: the resource-as-a-service cloud , 2014, CACM.

[9] Ioannis Konstantinou,et al. Automated workload-aware elasticity of NoSQL clusters in the cloud , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[10] Daniel A. Menascé,et al. Near-Optimal Allocation of VMs from IaaS Providers by SaaS Providers , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[11] Moustafa Ghanem,et al. Future Generation Computer Systems ( ) – Future Generation Computer Systems Enabling Cost-aware and Adaptive Elasticity of Multi-tier Cloud Applications , 2022 .

[12] Wouter Joosen,et al. Middleware for efficient and confidentiality-aware federation of access control policies , 2013, Journal of Internet Services and Applications.

[13] Stuart Dreyfus,et al. Richard Bellman on the Birth of Dynamic Programming , 2002, Oper. Res..

[14] Yue Tan,et al. An Adaptive Learning Approach for Efficient Resource Provisioning in Cloud Services , 2015, PERV.

[15] Claus Pahl,et al. Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution , 2015, 2015 International Conference on Cloud and Autonomic Computing.

[16] Rolf Stadler,et al. Resource Management in Clouds: Survey and Research Challenges , 2015, Journal of Network and Systems Management.

[17] Alessandro Maria Rizzi,et al. Optimal Map Reduce Job Capacity Allocation in Cloud Systems , 2015, PERV.

[18] Yudi Wei,et al. DynaQoS: Model-free self-tuning fuzzy control of virtualized resources for QoS provisioning , 2011, 2011 IEEE Nineteenth IEEE International Workshop on Quality of Service.

[19] Kevin Lee,et al. Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[20] J. Doyle,et al. Survey of Time Preference, Delay Discounting Models , 2012, Judgment and Decision Making.

[21] Danilo Ardagna,et al. Quality-of-service in cloud computing: modeling techniques and their applications , 2014, Journal of Internet Services and Applications.

[22] Chris Watkins,et al. Learning from delayed rewards , 1989 .

[23] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[24] Jie Yang,et al. A Profile-Based Approach to Just-in-Time Scalability for Cloud Applications , 2009, 2009 IEEE International Conference on Cloud Computing.

[25] R. Bellman. A Markovian Decision Process , 1957 .