Adaptive workload prediction for proactive auto scaling in PaaS systems

Elasticity is a key feature of cloud computing where resources are allocated and released according to user demands. Reactive auto scaling, in which the scaling actions take place just after meeting the triggering thresholds, suffers from several issues like risk of under provisioning at peak loads and over provisioning during other times. Proactive scaling solutions, where future resource demand can be forecast and necessary scaling actions enacted beforehand, can overcome these issues. Nevertheless, the effectiveness of such proactive scaling solutions depends on the accuracy of the prediction method(s) adopted. We propose a forecasting technique to enhance the accuracy of workload forecasting in cloud auto-scalers. An ensemble workload prediction mechanism based on time series and machine learning techniques is proposed to make more accurate predictions on drastically different workload patterns. In this work, we initially evaluated several forecasting models for their applicability in forecasting different workload patterns. The proposed ensemble technique is then implemented using three well-known forecasting models and tested for three real-world workloads. Simulation results show that our ensemble method produces significantly lower forecast errors compared to the use of individual models and the prediction technique employed in Apache Stratos, an open source PaaS platform.

[1]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[2]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[3]  Aniruddha S. Gokhale,et al.  Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[4]  Hui Zou,et al.  Combining time series models for forecasting , 2004, International Journal of Forecasting.

[5]  Junliang Chen,et al.  Workload Predicting-Based Automatic Scaling in Service Clouds , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[6]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[7]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[8]  George Athanasopoulos,et al.  Forecasting: principles and practice , 2013 .

[9]  LiuAnna,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012 .

[10]  Arvind Mohais,et al.  Intelligent techniques for forecasting multiple time series in real-world systems , 2011, Int. J. Intell. Comput. Cybern..

[11]  Gang Yin,et al.  Online Self-Reconfiguration with Performance Guarantee for Energy-Efficient Large-Scale Cloud Computing Data Centers , 2010, 2010 IEEE International Conference on Services Computing.

[12]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[13]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[14]  Chandra Krintz,et al.  A Pluggable Autoscaling Service for Open Cloud PaaS Systems , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[15]  Abdelwahab Hamou-Lhadj,et al.  Analyzing auto-scaling issues in cloud environments , 2014, CASCON.

[16]  R. K. Agrawal,et al.  Combining multiple time series models through a robust weighted mechanism , 2012, 2012 1st International Conference on Recent Advances in Information Technology (RAIT).