SLA-Driven Resource Provisioning in the Cloud

The common way to purchase computing capacity is to acquire a fix amount of servers which are hosted in a data center. The reserved amount of resources depends on the maximum demand that could occur. This over provisioning of resources is associated with high costs. Cloud providers promise to reduce the costs while guaranteeing always a good performance for the outsourced services. A reducing of the costs can be achieved by a better utilization of the resources for example. To optimize resource utilization while complying to the negotiated Service Level Agreements, the minimum amount of resources has to be determined as exactly as possible. Based on the simulated reprocessing of requests from recent history with different numbers of instances, the optimal number can be found in a short time. Further this work shows that an additional reduction of resources can be achieved by rescheduling the requests before processing.

[1]  Miron Livny,et al.  The cost of doing science on the cloud: The Montage example , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[2]  David L. Black,et al.  An Architecture for Differentiated Service , 1998 .

[3]  Serge Fdida,et al.  XML QoS specification language for enhancing communication services , 2002 .

[4]  Raghunath Othayoth Nambiar,et al.  Energy cost, the key challenge of today's data centers: a power consumption analysis of TPC-C results , 2008, Proc. VLDB Endow..

[5]  Svend Frølund,et al.  Quality-of-service specification in distributed object systems , 1998, Distributed Syst. Eng..

[6]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[7]  Paul Marshall,et al.  Elastic Site: Using Clouds to Elastically Extend Site Resources , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[8]  Steven Hand,et al.  Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters , 2009, ICAC '09.

[9]  Jeffrey S. Chase,et al.  Automated control in cloud computing: challenges and opportunities , 2009, ACDC '09.

[10]  Gregor Hohpe,et al.  Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions , 2003 .

[11]  Heiko Ludwig,et al.  Web Service Level Agreement (WSLA) Language Specification , 2003 .

[12]  Bernd Freisleben,et al.  On-Demand Resource Provisioning for BPEL Workflows Using Amazon's Elastic Compute Cloud , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[13]  Asit Dan,et al.  Web services agreement specification (ws-agreement) , 2004 .

[14]  Domenico Ferrari,et al.  An Empirical Investigation of Load Indices for Load Balancing Applications , 1987, Performance.

[15]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[16]  Manish Marwah,et al.  Minimizing data center SLA violations and power consumption via hybrid resource provisioning , 2011, 2011 International Green Computing Conference and Workshops.

[17]  Rajkumar Buyya,et al.  Evaluating the cost-benefit of using cloud computing to extend the capacity of clusters , 2009, HPDC '09.

[18]  Fernand Feltz,et al.  Feedback Controlled Quality of Service Enforcement for Service Oriented Architectures , 2010, 2010 IEEE International Conference on Services Computing.

[19]  Lakshmi Sobhana Kalli,et al.  Market-Oriented Cloud Computing : Vision , Hype , and Reality for Delivering IT Services as Computing , 2013 .