Using Machine Learning Algorithms for Cloud Client Prediction Models in a Web VM Resource Provisioning Environment

In order to meet Service Level Agreement (SLA) requirements, efficient scaling of Virtual Machine (VM) resources in cloud computing needs to be provisioned  ahead due to the instantiation time required by the VM. One way to do this is by predicting future resource demands. The existing research on VM resource provisioning are either reactive in their approach or use only non-business level metrics. In this research, a Cloud client prediction model for TPC-W benchmark web application is developed and evaluated using three machine learning techniques: Support Vector Regression (SVR), Neural Networks (NN) and Linear Regression (LR). Business level metrics for Response Time and Throughput are included in the prediction model with the aim of providing cloud clients with a more robust scaling decision choice. Results and analysis from the experiments carried out on Amazon Elastic Compute Cloud (EC2) show that Support Vector Regression provides the best prediction model for random-like workload traffic pattern.

[1]  Zhiyuan Luo,et al.  Time series prediction with performance guarantee , 2011, IET Commun..

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Mikko H. Lipasti,et al.  An architectural evaluation of Java TPC-W , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[4]  Ivona Brandic,et al.  Energy-efficient and SLA-aware management of IaaS clouds , 2012, 2012 Third International Conference on Future Systems: Where Energy, Computing and Communication Meet (e-Energy).

[5]  M. Ashraful Amin,et al.  Neural network and regression based processor load prediction for efficient scaling of Grid and Cloud resources , 2011, 14th International Conference on Computer and Information Technology (ICCIT 2011).

[6]  Guihai Chen,et al.  Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers , 2012, J. Parallel Distributed Comput..

[7]  Pascal Bouvry,et al.  Certicloud: A Novel TPM-based Approach to Ensure Cloud IaaS Security , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[8]  Ajay Mohindra,et al.  Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment , 2009, 2009 IEEE International Conference on e-Business Engineering.

[9]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[10]  Imad H. Elhajj,et al.  Artificial intelligence for forest fire prediction , 2010, 2010 IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[11]  Alexander Clemm,et al.  Integrated and autonomic cloud resource scaling , 2012, 2012 IEEE Network Operations and Management Symposium.

[12]  Samuel Ajila,et al.  Cloud Client Prediction Models for Cloud Resource Provisioning in a Multitier Web Application Environment , 2013, 2013 IEEE Seventh International Symposium on Service-Oriented System Engineering.

[13]  Johan Tordsson,et al.  An adaptive hybrid elasticity controller for cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[14]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[15]  Kun Li,et al.  Grid Resources Prediction with Support Vector Regression and Particle Swarm Optimization , 2010, 2010 Third International Joint Conference on Computational Science and Optimization.

[16]  Ian Witten,et al.  Data Mining , 2000 .

[17]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[19]  Jeffrey S. Chase,et al.  Automated control for elastic storage , 2010, ICAC '10.

[20]  Marin Litoiu,et al.  Optimal autoscaling in a IaaS cloud , 2012, ICAC '12.

[21]  Parag Kulkarni Reinforcement and Systemic Machine Learning for Decision Making , 2012 .

[22]  Moustafa Ghanem,et al.  Lightweight Resource Scaling for Cloud Applications , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[23]  Ying Wang,et al.  Decision model for provisioning virtual resources in Amazon EC2 , 2012, 2012 8th international conference on network and service management (cnsm) and 2012 workshop on systems virtualiztion management (svm).

[24]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[25]  Jeffrey S. Chase,et al.  Automated control in cloud computing: challenges and opportunities , 2009, ACDC '09.

[26]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[27]  David Hilley,et al.  Cloud Computing: A Taxonomy of Platform and Infrastructure-level Offerings , 2009 .

[28]  ZhiHui Lv,et al.  RPPS: A Novel Resource Prediction and Provisioning Scheme in Cloud Data Center , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[29]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[30]  Dimosthenis Kyriazis,et al.  Platform-as-a-Service Architecture for Real-Time Quality of Service Management in Clouds , 2010, 2010 Fifth International Conference on Internet and Web Applications and Services.

[31]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[32]  Prashant J. Shenoy,et al.  Profiling and Modeling Resource Usage of Virtualized Applications , 2008, Middleware.

[33]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[34]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[35]  Manish Marwah,et al.  Hybrid resource provisioning for minimizing data center SLA violations and power consumption , 2012, Sustain. Comput. Informatics Syst..

[36]  Samuel Ajila,et al.  Cloud Client Prediction Models Using Machine Learning Techniques , 2013, 2013 IEEE 37th Annual Computer Software and Applications Conference.

[37]  Ronald M. Summers,et al.  Machine learning and radiology , 2012, Medical Image Anal..

[38]  Naveen Sharma,et al.  Towards autonomic workload provisioning for enterprise Grids and clouds , 2009, 2009 10th IEEE/ACM International Conference on Grid Computing.

[39]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[40]  Adnan Khashman,et al.  Intelligent prediction of crude oil price using Support Vector Machines , 2011, 2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI).

[41]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[42]  Isis Truck,et al.  Using Reinforcement Learning for Autonomic Resource Allocation in Clouds: towards a fully automated workflow , 2011 .

[43]  Eddy Caron,et al.  Forecasting for Grid and Cloud Computing On-Demand Resources Based on Pattern Matching , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[44]  Ravi Sankar,et al.  Time Series Prediction Using Support Vector Machines: A Survey , 2009, IEEE Computational Intelligence Magazine.