Proactive Cloud Management for Highly Heterogeneous Multi-cloud Infrastructures

Various literature studies demonstrated that the cloud computing paradigm can help to improve availability and performance of applications subject to the problem of software anomalies. Indeed, the cloud resource provisioning model enables users to rapidly access new processing resources, even distributed over different geographical regions, that can be promptly used in the case of, e.g., crashes or hangs of running machines, as well as to balance the load in the case of overloaded machines. Nevertheless, managing a complex geographically-distributed cloud deploy could be a complex and time-consuming task. Autonomic Cloud Manager (ACM) Framework is an autonomic framework for supporting proactive management of applications deployed over multiple cloud regions. It uses machine learning models to predict failures of virtual machines and to proactively redirect the load to healthy machines/cloud regions. In this paper, we study different policies to perform efficient proactive load balancing across cloud regions in order to mitigate the effect of software anomalies. These policies use predictions about the mean time to failure of virtual machines. We consider the case of heterogeneous cloud regions, i.e regions with different amount of resources, and we provide an experimental assessment of these policies in the context of ACM Framework.

[1]  Safraz Rampersaud,et al.  A Sharing-Aware Greedy Algorithm for Virtual Machine Maximization , 2014, 2014 IEEE 13th International Symposium on Network Computing and Applications.

[2]  Bruno Ciciani,et al.  Auto-tuning of Cloud-Based In-Memory Transactional Data Grids via Machine Learning , 2012, 2012 Second Symposium on Network Cloud Computing and Applications.

[3]  Sukalyan Goswami,et al.  A Comparative Study of Load Balancing Algorithms in Computational Grid Environment , 2013, 2013 Fifth International Conference on Computational Intelligence, Modelling and Simulation.

[4]  Dimiter R. Avresky,et al.  Dynamic reconfiguration in computer clusters with irregular topologies in the presence of multiple node and link failures , 2005, IEEE Transactions on Computers.

[5]  Bruno Ciciani,et al.  Providing Transaction Class-Based QoS in In-Memory Data Grids via Machine Learning , 2014, 2014 IEEE 3rd Symposium on Network Cloud Computing and Applications (ncca 2014).

[6]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[7]  Marco Aurélio Gerosa,et al.  Deploying Large-Scale Service Compositions on the Cloud with the CHOReOS Enactment Engine , 2014, 2014 IEEE 13th International Symposium on Network Computing and Applications.

[8]  Basavaraj Jakkali,et al.  A Load Balancing Model Based On Cloud Partitioning For The Public Cloud , 2015 .

[9]  Soila Pertet,et al.  Causes of Failure in Web Applications (CMU-PDL-05-109) , 2005 .

[10]  Edward I. George,et al.  Extracting Representative Tree Models From a Forest , 1998 .

[11]  Domenico Cotroneo,et al.  Software Aging and Rejuvenation: Where We Are and Where We Are Going , 2011, 2011 IEEE Third International Workshop on Software Aging and Rejuvenation.

[12]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[13]  Inderveer Chana,et al.  Cloud Load Balancing Techniques : A Step Towards Green Computing , 2012 .

[14]  Benjamin W. Wah,et al.  Automated Learning of Workload Measures for Load Balancing on a Distributed System , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[15]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[16]  Rajesh George Rajan A Survey on Load Balancing in Cloud Computing Environments , 2013 .

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  Bruno Ciciani,et al.  Proactive Scalability and Management of Resources in Hybrid Clouds via Machine Learning , 2015, 2015 IEEE 14th International Symposium on Network Computing and Applications.

[19]  Kishor S. Trivedi,et al.  A comprehensive model for software rejuvenation , 2005, IEEE Transactions on Dependable and Secure Computing.

[20]  Noëmie Simoni,et al.  Self-Control Cloud Services , 2014, 2014 IEEE 13th International Symposium on Network Computing and Applications.

[21]  Jordi Torres,et al.  Using Virtualization to Improve Software Rejuvenation , 2009, IEEE Trans. Computers.

[22]  Dimiter R. Avresky,et al.  Machine Learning for Achieving Self-* Properties and Seamless Execution of Applications in the Cloud , 2015, 2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA).

[23]  Dimiter R. Avresky,et al.  A Machine Learning-Based Framework for Building Application Failure Prediction Models , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

[24]  Prashant Pandey,et al.  Cloud computing , 2010, ICWET.

[25]  Roberto Palmieri,et al.  A framework for high performance simulation of transactional data grid platforms , 2013, SimuTools.

[26]  A. Khiyaita,et al.  Load balancing cloud computing: State of art , 2012, 2012 National Days of Network Security and Systems.

[27]  Luís Veiga,et al.  On-Demand Resource Allocation Middleware for Massively Multiplayer Online Games , 2014, 2014 IEEE 13th International Symposium on Network Computing and Applications.

[28]  Roberto Palmieri,et al.  A flexible framework for accurate simulation of cloud in-memory data stores , 2014, Simul. Model. Pract. Theory.

[29]  Zenon Chaczko,et al.  Availability and Load Balancing in Cloud Computing , 2011 .

[30]  Erol Gelenbe,et al.  Adaptive Dispatching of Tasks in the Cloud , 2015, IEEE Transactions on Cloud Computing.

[31]  Mikko H. Lipasti,et al.  Characterizing a Java Implementation of TPC-W , 1996 .

[32]  A. Taleb-Bendiab,et al.  A Comparative Study into Distributed Load Balancing Algorithms for Cloud Computing , 2010, 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops.

[33]  Wayne D. Smith,et al.  TPC-W: Benchmarking An Ecommerce Solution , 2001 .

[34]  Bingchiang Jeng,et al.  Load-Balancing Tactics in Cloud , 2011, 2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.

[35]  Aaron Vegh MySQL Database Server , 2011 .