Autonomic scaling of Cloud Computing resources using BN-based prediction models

The recent surge in the popularity and usage of Cloud Computing services by both the enterprise and individual consumers has necessitated efficient and proactive management of data center resources which host services having varied characteristics. One of the major issues concerning both the cloud service providers and consumers is the automatic scalability of resources (i.e., compute, storage and bandwidth) in response to the highly unpredictable demands. To this end, an opportunity exists to harness the predictive and diagnostic capabilities of machine learning approaches to incorporate dynamic scaling up and scaling down of resources without violating the Service Level Agreements (SLA) and simultaneously ensuring adequate revenue to the providers. This paper proposes, implements and evaluates a Bayesian Networks based predictive modeling framework to provide for an autonomic scaling of utility computing resources in the Cloud Computing scenario. In essence, the BN-based model captures the historical behavior of the system involving various performance metrics (indicators) and predicts the desired unknown metric (e.g. SLA parameter). Initial simulated experiments involving random demand scenarios provide insights into the feasibility and applicability of the proposed approach for improving the management of present data center facilities.

[1]  ZhiHui Lv,et al.  RPPS: A Novel Resource Prediction and Provisioning Scheme in Cloud Data Center , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[2]  Michael C. Horsch,et al.  Dynamic Bayesian networks , 1990 .

[3]  Harald Steck,et al.  Constraint-based structural learning in Bayesian networks using finite data sets , 2001 .

[4]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[5]  Philip J. Morrow,et al.  Realtime utilisation of cloud storage , 2012, 2012 IEEE 1st International Conference on Cloud Networking (CLOUDNET).

[6]  Michael I. Jordan,et al.  Statistical Machine Learning Makes Automatic Control Practical for Internet Datacenters , 2009, HotCloud.

[7]  Tharam S. Dillon,et al.  Cloud Computing: Issues and Challenges , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[8]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[9]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[10]  Eddy Caron,et al.  Forecasting for Grid and Cloud Computing On-Demand Resources Based on Pattern Matching , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[11]  Ziming Zhang,et al.  Ensemble of Bayesian Predictors for Autonomic Failure Management in Cloud Computing , 2011, 2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN).

[12]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[13]  M. Ashraful Amin,et al.  Neural network and regression based processor load prediction for efficient scaling of Grid and Cloud resources , 2011, 14th International Conference on Computer and Information Technology (ICCIT 2011).

[14]  Philip J. Morrow,et al.  A SLA-compliant Cloud resource allocation framework for N-tier applications , 2012, 2012 IEEE 1st International Conference on Cloud Networking (CLOUDNET).

[15]  Alexander Clemm,et al.  Integrated and autonomic cloud resource scaling , 2012, 2012 IEEE Network Operations and Management Symposium.

[16]  Kevin Murphy,et al.  Dynamic Bayesian Networks , 2002 .

[17]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .