Cost-Efficient and Application SLA-Aware Client Side Request Scheduling in an Infrastructure-as-a-Service Cloud

Providers of applications deployed in an Infrastructure-as-a-Service cloud permanently face the decision of whether it is more cost-efficient to scale up(i.e., rent more resources from the cloud) or to delay incoming requests, even though doing so may lead to dissatisfied customers and broken service level agreements. This decision is further complicated by the fact that not all customers have the same agreements, and not all requests require the same amount of resources devoted to them. In this paper, we present an approach for optimally scheduling incoming requests to virtual computing resources in the cloud, so that the sum of payments for resources and loss incurred by service level agreement violations is minimized. We discuss our approach based on an illustrative use case. Furthermore, we present a numerical evaluation based on real-life request data, which shows that our agreement-aware algorithm improves upon earlier work, which does not take service level agreements into account.

[1]  Edward G. Coffman,et al.  Approximation algorithms for bin packing: a survey , 1996 .

[2]  Asit Dan,et al.  Web services on demand: WSLA-driven automated management , 2004, IBM Syst. J..

[3]  Asit Dan,et al.  Web services agreement specification (ws-agreement) , 2004 .

[4]  SLA management - challenges in the context of Web-service-based infrastructures , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[5]  Maged M. Michael,et al.  Scale-up x Scale-out: A Case Study using Nutch/Lucene , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[6]  Boudewijn F. van Dongen,et al.  Cycle Time Prediction: When Will This Case Finally Be Finished? , 2008, OTM Conferences.

[7]  Alexandru Iosup,et al.  The Grid Workloads Archive , 2008, Future Gener. Comput. Syst..

[8]  Frank Leymann,et al.  Runtime Prediction of Service Level Agreement Violations for Composite Services , 2009, ICSOC/ServiceWave Workshops.

[9]  Rajkumar Buyya,et al.  Evaluating the cost-benefit of using cloud computing to extend the capacity of clusters , 2009, HPDC '09.

[10]  David Hilley,et al.  Cloud Computing: A Taxonomy of Platform and Infrastructure-level Offerings , 2009 .

[11]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[12]  Luqun Li,et al.  An Optimistic Differentiated Service Job Scheduling System for Cloud Computing Service Users and Providers , 2009, 2009 Third International Conference on Multimedia and Ubiquitous Engineering.

[13]  Rajkumar Buyya,et al.  A cost-benefit analysis of using cloud computing to extend the capacity of clusters , 2010, Cluster Computing.

[14]  Dimosthenis Kyriazis,et al.  A Service-Oriented Framework for GNU Octave-Based Performance Prediction , 2010, 2010 IEEE International Conference on Services Computing.

[15]  Elizabeth Chang,et al.  Conceptual SLA framework for cloud computing , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[16]  Yun Chi,et al.  SLA-Aware Profit Optimization in Cloud Services via Resource Scheduling , 2010, 2010 6th World Congress on Services.

[17]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[18]  Helen D. Karatza,et al.  Performance and cost evaluation of Gang Scheduling in a Cloud Computing system with job migrations and starvation handling , 2011, 2011 IEEE Symposium on Computers and Communications (ISCC).

[19]  Anees Shaikh,et al.  A Cost-Aware Elasticity Provisioning System for the Cloud , 2011, 2011 31st International Conference on Distributed Computing Systems.

[20]  Julien Gossa,et al.  Cost-Wait Trade-Offs in Client-Side Resource Provisioning with Elastic Clouds , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[21]  Ying Wang,et al.  A dynamic priority scheduling algorithm on service request scheduling in cloud computing , 2011, Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology.

[22]  Jian Li,et al.  Cost-Conscious Scheduling for Large Graph Processing in the Cloud , 2011, 2011 IEEE International Conference on High Performance Computing and Communications.

[23]  Schahram Dustdar,et al.  Esc: Towards an Elastic Stream Computing Platform for the Cloud , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[24]  Schahram Dustdar,et al.  CloudScale: a novel middleware for building transparently scaling cloud applications , 2012, SAC '12.

[25]  Schahram Dustdar,et al.  Cost-Based Optimization of Service Compositions , 2013, IEEE Transactions on Services Computing.