SLA-driven Load Scheduling in Multi-tier Cloud Computing: Financial Impact Considerations

A cloud service provider strives to effectively provide a high Quality of Service (QoS) to client jobs. Such jobs vary in computational and Service-Level-Agreement (SLA) obligations, as well as differ with respect to tolerating delays and SLA violations. The job scheduling plays a critical role in servicing cloud demands by allocating appropriate resources to execute client jobs. The response to such jobs is optimized by the cloud service provider on a multi-tier cloud computing environment. Typically, the complex and dynamic nature of multi-tier environments incurs difficulties in meeting such demands, because tiers are dependent on each others which in turn makes bottlenecks of a tier shift to escalate in subsequent tiers. However, the optimization process of existing approaches produces single-tier-driven schedules that do not employ the differential impact of SLA violations in executing client jobs. Furthermore, the impact of schedules optimized at the tier level on the performance of schedules formulated in subsequent tiers tends to be ignored, resulting in a less than optimal performance when measured at the multi-tier level. Thus, failing in committing job obligations incurs SLA penalties that often take the form of either financial compensations, or losing future interests and motivations of unsatisfied clients in the service provided. Therefore, tolerating the risk of such delays on the operational performance of a cloud service provider is vital to meet SLA expectations and mitigate their associated commercial penalties. Such situations demand the cloud service provider to employ scalable service mechanisms that efficiently manage the execution of resource loads in accordance to their financial influence on the system performance, so as to ensure system reliability and cost reduction. In this paper, a scheduling and allocation approach is proposed to formulate schedules that account for differential impacts of SLA violation penalties and, thus, produce schedules that are optimal in financial performance. A queue virtualization scheme is designed to facilitate the formulation of optimal schedules at the tier and multi-tier levels of the cloud environment. Because the scheduling problem is NPhard, a biologically inspired approach is proposed to mitigate the complexity of finding optimal schedules. The reported results in this paper demonstrate the efficacy of the proposed approach in formulating costoptimal schedules that reduce SLA penalties of jobs at various architectural granularities of the multi-tier cloud environment.

[1]  Chien Chen,et al.  Market-Based Load Balancing for Distributed Heterogeneous Multi-Resource Servers , 2009, 2009 15th International Conference on Parallel and Distributed Systems.

[2]  Kannan Ramchandran,et al.  On scheduling redundant requests with cancellation overheads , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[3]  Jean-Henry Morin,et al.  Towards Cloud Computing SLA Risk Management: Issues and Challenges , 2012, 2012 45th Hawaii International Conference on System Sciences.

[4]  Otman Basir,et al.  QoS-Driven Job Scheduling: Multi-Tier Dependency Considerations , 2019, 9th International Conference on Computer Science, Engineering and Applications (CCSEA 2019).

[5]  Chu-Sing Yang,et al.  A hybrid meta-heuristic algorithm for VM scheduling with load balancing in cloud computing , 2015, Neural Computing and Applications.

[6]  Rada Chirkova,et al.  Heuristic-Based Request Scheduling Subject to a Percentile Response Time SLA in a Distributed Cloud , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[7]  Prasant Kumar Pattnaik,et al.  An enhanced deadline constraint based task scheduling mechanism for cloud environment , 2018, J. King Saud Univ. Comput. Inf. Sci..

[8]  Saurabh Bilgaiyan,et al.  A study on load balancing in cloud computing environment using evolutionary and swarm based algorithms , 2014, 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT).

[9]  Inderveer Chana,et al.  Quality of Service and Service Level Agreements for Cloud Environments: Issues and Challenges , 2014 .

[10]  Prasanta K. Jana,et al.  Efficient task scheduling algorithms for heterogeneous multi-cloud environment , 2015, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[11]  Divya Chaudhary,et al.  Analytical study of load scheduling algorithms in cloud computing , 2014, 2014 International Conference on Parallel, Distributed and Grid Computing.

[12]  Douglas G. Down,et al.  Scheduling Jobs with Estimation Errors for Multi-server Systems , 2017, 2017 29th International Teletraffic Congress (ITC 29).

[13]  M. Okopa,et al.  Fixed Priority SWAP scheduling policy with differentiated services under varying job size distributions , 2012, 2012 Second International Conference on Digital Information Processing and Communications (ICDIPC).

[14]  Poonam Singh,et al.  A review of task scheduling based on meta-heuristics approach in cloud computing , 2017, Knowledge and Information Systems.

[15]  Ahmed Chiheb Ammari,et al.  An effective and distributed particle swarm optimization algorithm for flexible job-shop scheduling problem , 2015, Journal of Intelligent Manufacturing.

[16]  Philip Samuel,et al.  Enhanced Bee Colony Algorithm for Efficient Load Balancing and Scheduling in Cloud , 2015, IBICA.

[17]  Ricardo Jiménez-Peris,et al.  A multi-resource load balancing algorithm for cloud cache systems , 2013, SAC '13.

[18]  Sanjaya Kumar Panda,et al.  A pair-based task scheduling algorithm for cloud computing environment , 2018, J. King Saud Univ. Comput. Inf. Sci..

[19]  Philip Samuel,et al.  Load Balancing of Tasks in Cloud Computing Environment Based on Bee Colony Algorithm , 2015, 2015 Fifth International Conference on Advances in Computing and Communications (ICACC).

[20]  Helen D. Karatza,et al.  The Effect of Workload Computational Demand Variability on the Performance of a SaaS Cloud with a Multi-tier SLA , 2017, 2017 IEEE 5th International Conference on Future Internet of Things and Cloud (FiCloud).

[21]  Jinkuan Wang,et al.  Resource Scheduling of Cloud with QoS Constraints , 2013, ISNN.

[22]  Bibhudatta Sahoo,et al.  Load balancing in cloud computing: A big picture , 2018, J. King Saud Univ. Comput. Inf. Sci..

[23]  Javier Alonso,et al.  Prediction of Job Resource Requirements for Deadline Schedulers to Manage High-Level SLAs on the Cloud , 2010, 2010 Ninth IEEE International Symposium on Network Computing and Applications.

[24]  Alan Scheller-Wolf,et al.  Queueing with redundant requests: exact analysis , 2016, Queueing Syst. Theory Appl..

[25]  Huankai Chen,et al.  User-priority guided Min-Min scheduling algorithm for load balancing in cloud computing , 2013, 2013 National Conference on Parallel Computing Technologies (PARCOMPTECH).

[26]  Yang Xiaomei,et al.  A Genetic Algorithm for Job Shop Scheduling Problem Using Co-evolution and Competition Mechanism , 2010, 2010 International Conference on Artificial Intelligence and Computational Intelligence.

[27]  Sanjaya Kumar Panda,et al.  Task Partitioning Scheduling Algorithms for Heterogeneous Multi-Cloud Environment , 2018 .

[28]  Xinyu Li,et al.  An effective hybrid genetic algorithm and tabu search for flexible job shop scheduling problem , 2016 .

[29]  F. Brazier,et al.  Managing violations in service level agreements , 2008 .

[30]  Helen D. Karatza,et al.  Multi-criteria scheduling of Bag-of-Tasks applications on heterogeneous interlinked clouds with simulated annealing , 2015, J. Syst. Softw..

[31]  Imran Ghani,et al.  Quality of service approaches in cloud computing: A systematic mapping study , 2015, J. Syst. Softw..

[32]  Robert Birke,et al.  Power of redundancy: Designing partial replication for multi-tier applications , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[33]  Jimmy Vuong Disaster recovery planning , 2015, InfoSecCD.

[34]  Jun Zhang,et al.  Load Balance Aware Genetic Algorithm for Task Scheduling in Cloud Computing , 2014, SEAL.

[35]  Avnish Thakur,et al.  A taxonomic survey on load balancing in cloud , 2017, J. Netw. Comput. Appl..

[36]  Mohit Singh,et al.  Dynamic Resource Allocation in the Cloud with Near-Optimal Efficiency , 2018, Oper. Res..

[37]  Geeta,et al.  A Review on Quality of Service in Cloud Computing , 2018 .

[38]  Paul D. Witman,et al.  GOVERNANCE AND SERVICE LEVEL AGREEMENT ISSUES IN A CLOUD COMPUTING ENVIRONMENT , 2011 .

[39]  A. K. Singh,et al.  A survey on scheduling and load balancing techniques in cloud computing environment , 2014, 2014 International Conference on Computer and Communication Technology (ICCCT).

[40]  Otman Basir,et al.  Service Level Driven Job Scheduling in Multi-Tier Cloud Computing: A Biologically Inspired Approach , 2020, ArXiv.

[41]  Yun Chi,et al.  Performance evaluation of scheduling algorithms for database services with soft and hard SLAs , 2011, DataCloud-SC '11.