Autonomic scheduling of deadline-constrained bag of tasks in hybrid clouds

The use of Hybrid Cloud technologies in large scale applications allows organizations to complement on-premises infrastructure with hired infrastructure from Public Cloud providers. The efficient use of the hired resources to provide the expected quality of service while dealing with the heterogeneity and uncertainty of Public Clouds is the main difficulty. A scheduler able to deal with deadline-constrained bag of tasks in Hybrid Clouds is presented in this work. The main contribution of this scheduler is that task runtime estimations are not necessary as inputs. The scheduler includes a runtime estimator based on sampled data to generate the estimation autonomously. A discrete event simulator was developed in order to validate the proposed scheduler in different scenarios. Results show that an estimator based on the Chebyshev's inequality obtains very good results in terms of deadlines met and cost.

[1]  Fang Dong,et al.  Elastic resource provisioning for scientific workflow scheduling in cloud under budget and deadline constraints , 2016, Cluster Computing.

[2]  Rajkumar Buyya,et al.  Meeting Deadlines of Scientific Workflows in Public Clouds with Tasks Replication , 2014, IEEE Transactions on Parallel and Distributed Systems.

[3]  Sudhir Shenai,et al.  Survey on Scheduling Issues in Cloud Computing , 2012 .

[4]  Unai Arronategui,et al.  Fair scheduling of bag-of-tasks applications on large-scale platforms , 2015, Future Gener. Comput. Syst..

[5]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[6]  Rizos Sakellariou,et al.  A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[7]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[8]  Dick H. J. Epema,et al.  Parallel Workload Modeling with Realistic Characteristics , 2014, IEEE Transactions on Parallel and Distributed Systems.

[9]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[10]  Jan Broeckhove,et al.  Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds , 2013, Future Gener. Comput. Syst..

[11]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[12]  Jarek Nabrzyski,et al.  Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[13]  Ming Mao,et al.  A Performance Study on the VM Startup Time in the Cloud , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[14]  John Shalf,et al.  Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[15]  Inderveer Chana,et al.  A Survey on Resource Scheduling in Cloud Computing: Issues and Challenges , 2016, Journal of Grid Computing.

[16]  Jakob Engblom,et al.  The worst-case execution-time problem—overview of methods and survey of tools , 2008, TECS.

[17]  Helen D. Karatza,et al.  Multi-criteria scheduling of Bag-of-Tasks applications on heterogeneous interlinked clouds with simulated annealing , 2015, J. Syst. Softw..