A Preliminary Study of Machine Learning Workload Prediction Techniques for Cloud Applications

Cloud computing has transformed the means of computing in recent years with several benefits over traditional systems, like scalability and high availability. However, there are still some opportunities, especially in the area of resource provisioning and scaling [13]. Since workload may fluctuate a lot in certain environments, over-provisioning is a common practice to avoid abrupt Quality of Service (QoS) drops that may result in Service Level Agreement (SLA) violations, but at the price of an increase in provisioning costs and energy consumption. Workload prediction is one of the strategies by which efficiency and operational cost of a cloud can be improved [13]. Knowing demand in advance allows the previous allocation of sufficient resources to maintain QoS and avoid SLA violations [1]. This paper presents the advantages and disadvantages of three workload prediction techniques when applied in the context of cloud computing. Our preliminary results compare ARIMA, MLP, and GRU under different cloud configurations to help administrators choose the more appropriate and efficient predictive model for their specific problem.

[1]  Tao Li,et al.  Cloud Analytics for Capacity Planning and Instant VM Provisioning , 2013, IEEE Transactions on Network and Service Management.

[2]  Jitendra Kumar,et al.  Workload prediction in cloud using artificial neural network and adaptive differential evolution , 2018, Future Gener. Comput. Syst..

[3]  Rajkumar Buyya,et al.  SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter , 2014, J. Netw. Comput. Appl..

[4]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[5]  Kranthimanoj Nagothu,et al.  Prediction of cloud data center networks loads using stochastic and neural models , 2011, 2011 6th International Conference on System of Systems Engineering.

[6]  Jing Zhang,et al.  The placement method of resources and applications based on request prediction in cloud data center , 2014, Inf. Sci..

[7]  Asser N. Tantawi,et al.  Analytic modeling of multitier Internet applications , 2007, TWEB.

[8]  Thomas J. Hacker,et al.  Predictive Model for Dynamically Provisioning Resources in Multi-Tier Web Applications , 2016, 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom).

[9]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[10]  Rahul Jain,et al.  Strategic arrivals into queueing networks , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[11]  Kang G. Shin,et al.  What does control theory bring to systems research? , 2009, OPSR.

[12]  Maryam Amiri,et al.  Survey on prediction models of applications for resources provisioning in cloud , 2017, J. Netw. Comput. Appl..

[13]  Marcos José Santana,et al.  Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure , 2015, Neural Computing and Applications.

[14]  Achille Fokoue,et al.  An effective algorithm for hyperparameter optimization of neural networks , 2017, IBM J. Res. Dev..

[15]  Sunilkumar S. Manvi,et al.  Resource management for Infrastructure as a Service (IaaS) in cloud computing: A survey , 2014, J. Netw. Comput. Appl..

[16]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[17]  Frank Leymann,et al.  Runtime Prediction of Service Level Agreement Violations for Composite Services , 2009, ICSOC/ServiceWave Workshops.

[18]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[19]  Yu Zhou,et al.  A new method based on PSR and EA-GMDH for host load prediction in cloud computing system , 2014, The Journal of Supercomputing.

[20]  Bo Cheng,et al.  A cost-aware auto-scaling approach using the workload prediction in service clouds , 2014, Inf. Syst. Frontiers.

[21]  Inderveer Chana,et al.  A Survey on Resource Scheduling in Cloud Computing: Issues and Challenges , 2016, Journal of Grid Computing.

[22]  Rajkumar Buyya,et al.  Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications’ QoS , 2015, IEEE Transactions on Cloud Computing.

[23]  Andreas Muller Introduction to Machine Learning with Python , 2016 .