SLA-Aware Application Deployment and Resource Allocation in Clouds

Provisioning resources as a service in a scalable on-demand manner is a basic feature in Cloud computing technology. Service provisioning in Clouds is based on Service Level Agreements (SLAs) representing a contract signed between the customer and the service provider stating the terms of the agreement including non-functional requirements of the service specified as Quality of Service (QoS), obligations, and penalties in case of agreement violations. On the one hand SLA violation should be prevented to avoid costly penalties and on the other hand providers have to efficiently utilize resources to minimize cost for the service provisioning. Thus, scheduling strategies considering multiple SLA parameters and efficient allocation of resources are necessary. Recent work considers various strategies with single SLA parameters. However, those approaches are limited to simple workflows and single task applications. Scheduling and deploying service requests considering multiple SLA parameters such as amount of CPU required, network bandwidth, memory and storage are still open research challenges. In this paper, we present a novel scheduling heuristic considering multiple SLA parameters for deploying applications in Clouds. We discuss in details the heuristic design and implementation and finally present detailed evaluations as a proof of concept emphasizing the performance of our approach.

[1]  Albert Y. Zomaya,et al.  Profit-Driven Service Request Scheduling in Clouds , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[2]  Rajkumar Buyya,et al.  Adapting Market-Oriented Scheduling Policies for Cloud Computing , 2010, ICA3PP.

[3]  Schahram Dustdar,et al.  FoSII - Foundations of Self-Governing ICT Infrastructures , 2010, ERCIM News.

[4]  Rajkumar Buyya,et al.  A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[5]  Schahram Dustdar,et al.  Low level Metrics to High level SLAs - LoM2HiS framework: Bridging the gap between monitored metrics and SLA parameters in cloud environments , 2010, 2010 International Conference on High Performance Computing & Simulation.

[6]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[7]  John M. Wilson,et al.  An Algorithm for the Generalized Assignment Problem with Special Ordered Sets , 2005, J. Heuristics.

[8]  Rajkumar Buyya,et al.  Time and cost trade-off management for scheduling parallel applications on Utility Grids , 2010, Future Gener. Comput. Syst..

[9]  Radu Prodan,et al.  A survey and taxonomy of infrastructure as a service and web hosting cloud providers , 2009, 2009 10th IEEE/ACM International Conference on Grid Computing.

[10]  Schahram Dustdar,et al.  Towards Knowledge Management in Self-Adaptable Clouds , 2010, 2010 6th World Congress on Services.

[11]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[12]  Qi Cao,et al.  An Optimized Algorithm for Task Scheduling Based on Activity Based Costing in Cloud Computing , 2009, 2009 3rd International Conference on Bioinformatics and Biomedical Engineering.

[13]  Ivona Brandic Towards Self-Manageable Cloud Services , 2009, 2009 33rd Annual IEEE International Computer Software and Applications Conference.