SLA-based resource provisioning for management of Cloud-based Software-as-a-Service applications

The Cloud computing Software-as-a-Service (SaaS) model has changed the sales model for software providers. The SaaS model transforms the traditional license based model to a subscription model, which allows customers to access applications over the Internet without software and hardware upfront costs and provides reduced maintenance costs. However, the key for sales is still customer satisfaction which is at the heart of the selling process. To guarantee Quality of Service (QoS) for customer satisfaction therefore, the Service Level Agreement (SLA) is implemented between customers and SaaS providers, where the main objectives are profit maximization and increased market share. To achieve these objectives, there are several challenges due to the dynamic nature of the Cloud environment. Firstly, the SaaS provider utilizes shared infrastructure and various types of request loads which can lead to unpredictability in performance and availability of resources. Secondly, there is a possibility that existing customers may make changes in requirements, which can lead to resource reallocation. As such, resource allocation may cause SLA violations which could reduce the SaaS providers’ profit margin and reputation, meaning a possible loss of existing customers and potential new customers. Thirdly, SaaS providers need to attract customers with special needs and consider market competition from other providers in order to increase profit and market share. To overcome the above challenges, most proposed solutions are focused on the resource management with the aim of minimizing cost without sufficiently consideration of customer’ needs. Therefore, to address these challenges, this thesis proposes algorithms and techniques for optimal provisioning of Cloud resources with the aim of maximizing profit and customer base by handling the dynamism associated with SLAs and heterogeneous resources. The key contributions of the thesis are:  A comprehensive survey of how SLAs are created, managed and used with case examples drawn from both academy and industry with a major emphasis on the SLAbased resource management systems.  The admission control and scheduling algorithms assist in identifying which request is more acceptable based on profitability, reducing the probability of SLA violations given the heterogeneous nature of Cloud resources.  The customer requirements driven resource provisioning algorithms can help in adapting to changes in the requirements. The proposed algorithms provide personalized attention to the customer and are also able to understand specific customer needs.  A new negotiation framework to enlarge a SaaS provider’s customer base that considers dynamism in the Cloud environment with time and market factors to make the best possible decisions for negotiation.  A prototype of the customer requirements driven SLA-based resource management system to prove the usefulness of our proposed strategies using the latest technologies.

[1]  Asser N. Tantawi,et al.  Performance management for cluster-based web services , 2005, IEEE Journal on Selected Areas in Communications.

[2]  Michael Schroeder,et al.  Market-based Resource Allocation for Grid Computing: A Model and Simulation , 2003, Middleware Workshops.

[3]  Qiang Yang,et al.  WhatNext: a prediction system for Web requests using n-gram sequence models , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.

[4]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[5]  Rajkumar Buyya,et al.  Integrated Risk Analysis for a Commercial Computing Service , 2007, IPDPS.

[6]  Rajkumar Buyya,et al.  Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility , 2005, 2005 IEEE International Conference on Cluster Computing.

[7]  Brian D. Davison Learning Web Request Patterns , 2004, Web Dynamics.

[8]  Nicholas R. Jennings,et al.  Negotiation decision functions for autonomous agents , 1998, Robotics Auton. Syst..

[9]  David Laibson,et al.  The Age of Reason: Financial Decisions over the Life-Cycle with Implications for Regulation , 2009 .

[10]  Ronald W. Wolff,et al.  Poisson Arrivals See Time Averages , 1982, Oper. Res..

[11]  Rajkumar Buyya,et al.  Aneka—Integration of Private and Public Clouds , 2011 .

[12]  Rajkumar Buyya,et al.  InterCloud: Utility-Oriented Federation of Cloud Computing Environments for Scaling of Application Services , 2010, ICA3PP.

[13]  Sanjay Chaudhary,et al.  Negotiation for resource allocation in IaaS cloud , 2011, Bangalore Compute Conf..

[14]  Alexandru Iosup,et al.  An Early Performance Analysis of Cloud Computing Services for Scientific Computing , 2008 .

[15]  Martin Bichler,et al.  Admission control for media on demand services , 2007, Service Oriented Computing and Applications.

[16]  Rizos Sakellariou,et al.  An evaluation of heuristics for SLA based parallel job scheduling , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[17]  David Abramson,et al.  A case for economy grid architecture for service oriented grid computing , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[18]  Rajkumar Buyya,et al.  Market-oriented Grids and Utility Computing: The State-of-the-art and Future Directions , 2008, Journal of Grid Computing.

[19]  Mladen A. Vouk,et al.  Cloud Computing – Issues, Research and Implementations , 2008, CIT 2008.

[20]  Antony Bryant,et al.  Trust in Electronic Commerce Business Relationships , 2002, J. Electron. Commer. Res..

[21]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.

[22]  Warren Smith,et al.  Predicting Application Run Times Using Historical Information , 1998, JSSPP.

[23]  Rajkumar Buyya,et al.  Utility Computing and Global Grids , 2006, ArXiv.

[24]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[25]  John M. Wilson,et al.  An Algorithm for the Generalized Assignment Problem with Special Ordered Sets , 2005, J. Heuristics.

[26]  Dejan S. Milojicic,et al.  SLA Decomposition: Translating Service Level Objectives to System Level Thresholds , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[27]  Rajkumar Buyya,et al.  Cloud Computing Principles and Paradigms , 2011 .

[28]  Jian Lin,et al.  A coordinated architecture for the agent-based service level agreement negotiation of Web service composition , 2006, Australian Software Engineering Conference (ASWEC'06).

[29]  Dan Dumitriu,et al.  An overview of the Galaxy management framework for scalable enterprise cluster computing , 2000, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000.

[30]  Rajkumar Buyya,et al.  Aneka: Next-Generation Enterprise Grid Platform for e-Science and e-Business Applications , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[31]  Wolfgang Emmerich,et al.  Precise service level agreements , 2004, Proceedings. 26th International Conference on Software Engineering.

[32]  B. Schneider,et al.  Service Quality: Research Perspectives , 2003 .

[33]  Habel Kurian A Markov model for web request prediction , 2008 .

[34]  Kwang Mong Sim,et al.  A survey of bargaining models for grid resource allocation , 2006, SECO.

[35]  Heiko Ludwig,et al.  The WSLA Framework: Specifying and Monitoring Service Level Agreements for Web Services , 2003, Journal of Network and Systems Management.

[36]  Stefan Kirn,et al.  Towards model checking & simulation of a multi-tier negotiation protocol for service chains , 2010, AAMAS.

[37]  Arif Merchant,et al.  Minerva: An automated resource provisioning tool for large-scale storage systems , 2001, TOCS.

[38]  Rajkumar Buyya,et al.  Cloudbus Toolkit for Market-Oriented Cloud Computing , 2009, CloudCom.

[39]  Frances M. T. Brazier,et al.  A WS-Agreement Based Resource Negotiation Framework for Mobile Agents , 2006, Scalable Comput. Pract. Exp..

[40]  Rajkumar Buyya,et al.  SLA-based admission control for a Software-as-a-Service provider in Cloud computing environments , 2012, J. Comput. Syst. Sci..

[41]  Asit Dan,et al.  Cremona: an architecture and library for creation and monitoring of WS-agreents , 2004, ICSOC '04.

[42]  Javier Alonso,et al.  Prediction of Job Resource Requirements for Deadline Schedulers to Manage High-Level SLAs on the Cloud , 2010, 2010 Ninth IEEE International Symposium on Network Computing and Applications.

[43]  Pablo Chacin,et al.  Application deployment using catallactic Grid middleware , 2005, MGC '05.

[44]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[45]  Ian T. Foster,et al.  Resource co-allocation in computational grids , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[46]  Schahram Dustdar,et al.  Service mediation and negotiation bootstrapping as first achievements towards self-adaptable grid and cloud services , 2009, GMAC '09.

[47]  David E. Irwin,et al.  Balancing risk and reward in a market-based task service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[48]  Barbara Pernici,et al.  An architecture for flexible Web service QoS negotiation , 2005, Ninth IEEE International EDOC Enterprise Computing Conference (EDOC'05).

[49]  Andrew Hiles The complete guide to IT service level agreements : matching service quality to business needs , 1991 .

[50]  George Candea,et al.  OnCall: defeating spikes with a free-market application cluster , 2004 .

[51]  Vladimir Tosic,et al.  Management applications of the Web Service Offerings Language (WSOL) , 2005, Inf. Syst..

[52]  R. V. van Nieuwpoort,et al.  The Grid 2: Blueprint for a New Computing Infrastructure , 2003 .

[53]  Vijay S. Mookerjee,et al.  Maximizing business value by optimal assignment of jobs to resources in grid computing , 2009, Eur. J. Oper. Res..

[54]  Philip S. Yu,et al.  Utility computing SLA management based upon business objectives , 2004, IBM Syst. J..

[55]  Kuo-Ming Chao,et al.  Negotiating Agents in a Market-Oriented Grid , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[56]  F. Brazier,et al.  Managing violations in service level agreements , 2008 .

[57]  Harry G. Perros,et al.  SLA-based resource allocation in cluster computing systems , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[58]  Rajkumar Buyya,et al.  A Negotiation Mechanism for Advance Resource Reservations Using the Alternate Offers Protocol , 2008, 2008 16th Interntional Workshop on Quality of Service.

[59]  Dhabaleswar K. Panda,et al.  QoPS: A QoS Based Scheme for Parallel Job Scheduling , 2003, JSSPP.

[60]  D. Goderis,et al.  Service level agreements: a main challenge for next generation networks , 2002, 2nd European Conference on Universal Multiservice Networks. ECUMN'2001 (Cat. No.02EX563).

[61]  G. Hardin,et al.  The Tragedy of the Commons , 1968, Green Planet Blues.

[62]  Patrick Martin,et al.  A Policy-Based Middleware for Web Services SLA Negotiation , 2009, 2009 IEEE International Conference on Web Services.

[63]  Alexandru Iosup,et al.  A Performance Analysis of EC2 Cloud Computing Services for Scientific Computing , 2009, CloudComp.

[64]  Sean Rooney The ICorpMaker A Dynamic Framework for Application Service Providers , 2000 .

[65]  Rizos Sakellariou,et al.  On the flexibility of WS-agreement for job submission , 2005, MGC '05.

[66]  Ramin Yahyapour,et al.  Using SLA for Resource Management and Scheduling-a Survey, TR-0096 , 2007 .

[67]  Rajkumar Buyya,et al.  Compute Power Market: towards a market-oriented grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[68]  Douglas F. Parkhill,et al.  The Challenge of the Computer Utility , 1966 .

[69]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[70]  James Norris,et al.  OnCall: defeating spikes with a free-market application cluster , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[71]  Thomas Ledoux,et al.  A Language for Quality of Service Requirements Specification in Web Services Orchestrations , 2006, ICSOC Workshops.

[72]  Bogdan Franczyk,et al.  SLA Lifecycle Management in Services Grid- Requirements and Current Efforts Analysis , 2006, NODe/GSEM.

[73]  Mark S. Squillante,et al.  On maximizing service-level-agreement profits , 2001, EC.

[74]  Michael Joachim Bonell The UNIDROIT Principles of International Commercial Contracts and the Principles of European Contract Law: Similar Rules for the Same Purposes? , 1996 .

[75]  Ian Foster,et al.  On Fully Decentralized Resource Discovery in Grid Environments , 2001, GRID.

[76]  Ramin Yahyapour,et al.  Design and evaluation of job scheduling strategies for grid computing , 2000, GRID.

[77]  Jordi Guitart Fernández,et al.  Deadline constrained prediction of job resource requirements to manage high-level SLAs for SaaS cloud providers , 2010 .

[78]  Yolanda Gil,et al.  Automatically composed workflows for grid environments , 2004, IEEE Intelligent Systems.

[79]  Bu-Sung Lee,et al.  Optimization of Resource Provisioning Cost in Cloud Computing , 2012, IEEE Transactions on Services Computing.

[80]  Barbara Pernici,et al.  A framework for QoS-based Web service contracting , 2009, TWEB.

[81]  Warren Smith,et al.  A directory service for configuring high-performance distributed computations , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[82]  Robert Cooley,et al.  The use of web structure and content to identify subjectively interesting web usage patterns , 2003, TOIT.

[83]  Ken Binmore,et al.  Applying game theory to automated negotiation , 1999 .

[84]  Nicholas R. Jennings,et al.  Using similarity criteria to make negotiation trade-offs , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[85]  Graham R. Nudd,et al.  Pace—A Toolset for the Performance Prediction of Parallel and Distributed Systems , 2000, Int. J. High Perform. Comput. Appl..

[86]  Mikyung Kang,et al.  Heterogeneous Cloud Computing , 2011, 2011 IEEE International Conference on Cluster Computing.

[87]  David A. Patterson,et al.  Technical perspective: the data center is the computer , 2008, CACM.

[88]  Albert Y. Zomaya,et al.  Profit-Driven Service Request Scheduling in Clouds , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[89]  Muli Ben-Yehuda,et al.  The Reservoir model and architecture for open federated cloud computing , 2009, IBM J. Res. Dev..

[90]  R. Buyya,et al.  Towards a Meta-Negotiation Architecture for SLA-Aware Grid Services , 2008 .

[91]  Rajkumar Buyya,et al.  Modeling and simulation of scalable Cloud computing environments and the CloudSim toolkit: Challenges and opportunities , 2009, 2009 International Conference on High Performance Computing & Simulation.

[92]  Rajkumar Buyya,et al.  Mandi: a market exchange for trading utility and cloud computing services , 2011, The Journal of Supercomputing.

[93]  Ryszard Kowalczyk,et al.  Adaptive Negotiation with On-Line Prediction of Opponent Behaviour in Agent-Based Negotiations , 2006, 2006 IEEE/WIC/ACM International Conference on Intelligent Agent Technology.

[94]  Rajkumar Buyya,et al.  Proceedings of the 4th international workshop on Middleware for grid computing , 2006 .

[95]  Dhabaleswar K. Panda,et al.  Towards provision of quality of service guarantees in job scheduling , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[96]  Patrick Martin,et al.  An Adaptive and Intelligent SLA Negotiation System for Web Services , 2011, IEEE Transactions on Services Computing.

[97]  Omer F. Rana,et al.  An approach for quality of service adaptation in service‐oriented Grids , 2004, Concurr. Pract. Exp..

[98]  Joseph L. Hellerstein,et al.  Managing dynamic services: a contract based approach to a conceptual architecture , 2002, NOMS 2002. IEEE/IFIP Network Operations and Management Symposium. ' Management Solutions for the New Communications World'(Cat. No.02CH37327).

[99]  Daniel P. Siewiorek,et al.  Practical solutions for QoS-based resource allocation problems , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).

[100]  Rajkumar Buyya,et al.  SLA-Based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[101]  Michael P. Wellman,et al.  The Michigan Internet AuctionBot: a configurable auction server for human and software agents , 1998, AGENTS '98.

[102]  Rajkumar Buyya,et al.  SLA-oriented resource provisioning for cloud computing: Challenges, architecture, and solutions , 2011, 2011 International Conference on Cloud and Service Computing.

[103]  John A. Zinky,et al.  Specifying and measuring quality of service in distributed object systems , 1998, Proceedings First International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC '98).

[104]  Yechiam Yemini,et al.  Selfish optimization in computer networks , 1981, 1981 20th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[105]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[106]  Xavier Lorca,et al.  Entropy: a consolidation manager for clusters , 2009, VEE '09.

[107]  John Wilkes,et al.  Profitable services in an uncertain world , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[108]  Alfonso Sánchez-Macián,et al.  Dynamic Service Provisioning Using GRIA SLAs , 2007, ICSOC Workshops.

[109]  Chris Rose,et al.  A Break in the Clouds: Towards a Cloud Definition , 2011 .

[110]  Rajkumar Buyya,et al.  Time and cost trade-off management for scheduling parallel applications on Utility Grids , 2010, Future Gener. Comput. Syst..

[111]  L. Youseff,et al.  Toward a Unified Ontology of Cloud Computing , 2008, 2008 Grid Computing Environments Workshop.

[112]  Rajkumar Buyya,et al.  Offer-based scheduling of deadline-constrained Bag-of-Tasks applications for utility computing systems , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[113]  B. Looy,et al.  Service management : an integrated approach , 2013 .

[114]  Andrew Hiles Service Level Agreements , 1994 .

[115]  Andrzej Kochut,et al.  Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[116]  Banu Özden,et al.  The Eclipse Operating System: Providing Quality of Service via Reservation Domains , 1998, USENIX ATC.

[117]  David E. Culler,et al.  User-Centric Performance Analysis of Market-Based Cluster Batch Schedulers , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[118]  Kang G. Shin,et al.  Virtual Services: A New Abstraction for Server Consolidation , 2000, USENIX Annual Technical Conference, General Track.

[119]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[120]  Rajkumar Buyya,et al.  An Autonomous Reliability-Aware Negotiation Strategy for Cloud Computing Environments , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[121]  Baruch Schieber,et al.  Minimizing migrations in fair multiprocessor scheduling of persistent tasks , 2004, SODA '04.

[122]  José A. B. Fortes,et al.  Sky Computing , 2009, IEEE Internet Computing.

[123]  E. Litvak,et al.  Queuing Theory Accurately Models the Need for Critical Care Resources , 2004, Anesthesiology.

[124]  Rajkumar Buyya,et al.  Pricing for Utility-Driven Resource Management and Allocation in Clusters , 2007, Int. J. High Perform. Comput. Appl..

[125]  Dean Kuo,et al.  A Framework & Negotiation Protocol for Service Contracts , 2006, 2006 IEEE International Conference on Services Computing (SCC'06).

[126]  Virgílio A. F. Almeida,et al.  Capacity Planning for Web Performance: Metrics, Models, and Methods , 1998 .

[127]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[128]  Jean-Marc Menaud,et al.  SLA-Aware Virtual Resource Management for Cloud Infrastructures , 2009, 2009 Ninth IEEE International Conference on Computer and Information Technology.

[129]  Shijun Liu,et al.  A Hybrid Approach to Placement of Tenants for Service-Based Multi-tenant SaaS Application , 2011, 2011 IEEE Asia-Pacific Services Computing Conference.

[130]  Carl A. Waldspurger,et al.  Memory resource management in VMware ESX server , 2002, OSDI '02.

[131]  Akhil Sahai,et al.  Specifying and monitoring guarantees in commercial grids through SLA , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[132]  Shin-Dug Kim,et al.  Agent Based Resource Management for Grid Computing , 2002, PDPTA.

[133]  Gregor von Laszewski,et al.  QoS guided Min-Min heuristic for grid task scheduling , 2003, Journal of Computer Science and Technology.

[134]  Dominic Battré,et al.  Planning-based Scheduling for SLA-awareness and Grid Integration , 2007 .

[135]  Sachiyo Arai,et al.  Experience-Based Reinforcement Learning to Acquire Effective Behavior in a Multi-agent Domain , 2000, PRICAI.

[136]  Rajkumar Buyya,et al.  A taxonomy of market-based resource management systems for utility-driven cluster computing , 2006 .

[137]  Vasudeva Varma,et al.  Learning based opportunistic admission control algorithm for MapReduce as a service , 2010, ISEC.

[138]  David E. Irwin,et al.  Virtual Machine Hosting for Networked Clusters: Building the Foundations for "Autonomic" Orchestration , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[139]  D. Verma,et al.  Supporting Service Level Agreements on IP Networks , 1999 .

[140]  Hyong S. Kim,et al.  SageShift: Managing SLAs for highly consolidated cloud , 2012, 2012 Proceedings IEEE INFOCOM.

[141]  李伟,et al.  VEGA Infrastructure for Resource Discovery in Grids , 2003 .

[142]  Rajkumar Buyya,et al.  Utility Computing on Global Grids , 2012 .

[143]  Herman Lam,et al.  On automated e‐business negotiations: Goal, policy, strategy, and plans of decision and action , 2006, J. Organ. Comput. Electron. Commer..

[144]  Frank Teuteberg,et al.  Anticipating Agents ' Negotiation Strategies in an E-marketplace Using Belief Models , 2002 .