Online scheduling of deadline‐constrained bag‐of‐task workloads on hybrid clouds

The advent of hybrid cloud technologies and public Infrastructures as a Service (IaaS) allows service developers to offer services to their customers with little upfront investment and to adapt services to different workload sizes. The problem of minimizing the costs of the hired public infrastructure while providing the quality of service needed by the final customer arises when using hybrid clouds. Several scheduling strategies have been proposed to solve this problem for services dealing with deadline‐constrained bag‐of‐tasks workloads. Most of these solutions do not consider the variable performance of the clouds, the provisioning delay of virtual machine instances that affects the elasticity, and the impracticality of having good processing time estimations in real systems. We propose a scheduler algorithm that overcomes previous limitations and can minimize the cost of the infrastructure while maximizing the number of deadlines met by the service. Our solution can work autonomously by using sampled observations of the processing times and considers the heterogeneity and the provisioning time of the virtual machine instances. An evaluation was conducted by simulating different scenarios and workload types. Simulation results show that our solution obtains better or similar results than previous techniques in most scenarios in terms of deadlines met.

[1]  Aloysius K. Mok,et al.  Multiprocessor On-Line Scheduling of Hard-Real-Time Tasks , 1989, IEEE Trans. Software Eng..

[2]  Joseph L. Hellerstein,et al.  Obfuscatory obscanturism: Making workload traces of commercially-sensitive systems safe to release , 2012, 2012 IEEE Network Operations and Management Symposium.

[3]  Sudhir Shenai,et al.  Survey on Scheduling Issues in Cloud Computing , 2012 .

[4]  Christos Koulamas,et al.  The Total Tardiness Problem: Review and Extensions , 1994, Oper. Res..

[5]  Rubén Ruiz,et al.  A delay-based dynamic scheduling algorithm for bag-of-task workflows with stochastic task execution times in clouds , 2017, Future Gener. Comput. Syst..

[6]  Radu Prodan,et al.  Low-time complexity budget-deadline constrained workflow scheduling on heterogeneous resources , 2016, Future Gener. Comput. Syst..

[7]  Rajkumar Buyya,et al.  Meeting Deadlines of Scientific Workflows in Public Clouds with Tasks Replication , 2014, IEEE Transactions on Parallel and Distributed Systems.

[8]  Alan D. George,et al.  RapidIO for radar processing in advanced space systems , 2007, TECS.

[9]  Alexandru Iosup,et al.  Grid Computing Workloads , 2011, IEEE Internet Computing.

[10]  Yue-Shan Chang,et al.  Adaptive scheduling for parallel tasks with QoS satisfaction for hybrid cloud environments , 2013, The Journal of Supercomputing.

[11]  J. M. J. Schutten,et al.  List scheduling revisited , 1996, Oper. Res. Lett..

[12]  Jan Broeckhove,et al.  Runtime Prediction Based Grid Scheduling of Parameter Sweep Jobs , 2008, 2008 IEEE Asia-Pacific Services Computing Conference.

[13]  Joaquín Entrialgo,et al.  Autonomic scheduling of deadline-constrained bag of tasks in hybrid clouds , 2016, 2016 International Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS).

[14]  Dan Tsafrir,et al.  Backfilling Using System-Generated Predictions Rather than User Runtime Estimates , 2007, IEEE Transactions on Parallel and Distributed Systems.

[15]  Unai Arronategui,et al.  Fair scheduling of bag-of-tasks applications on large-scale platforms , 2015, Future Gener. Comput. Syst..

[16]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[17]  Jan Broeckhove,et al.  Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds , 2013, Future Gener. Comput. Syst..

[18]  Jan Karel Lenstra,et al.  Complexity of machine scheduling problems , 1975 .

[19]  Cynthia Bailey Lee,et al.  On the User–Scheduler Dialogue: Studies of User-Provided Runtime Estimates and Utility Functions , 2006, Int. J. High Perform. Comput. Appl..

[20]  Dick H. J. Epema,et al.  Parallel Workload Modeling with Realistic Characteristics , 2014, IEEE Transactions on Parallel and Distributed Systems.

[21]  Bahram Alidaee,et al.  Scheduling parallel machines to minimize total weighted and unweighted tardiness , 1997, Comput. Oper. Res..

[22]  Blesson Varghese,et al.  A survey and taxonomy of resource optimisation for executing bag-of-task applications on public clouds , 2017, Future Gener. Comput. Syst..

[23]  Charles Reiss,et al.  Towards understanding heterogeneous clouds at scale : Google trace analysis , 2012 .

[24]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[25]  Franck Cappello,et al.  Characterizing Cloud Applications on a Google Data Center , 2013, 2013 42nd International Conference on Parallel Processing.

[26]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[27]  Alexandru Iosup,et al.  On the Performance Variability of Production Cloud Services , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[28]  Hamid Arabnejad,et al.  List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table , 2014, IEEE Transactions on Parallel and Distributed Systems.

[29]  Douglas Thain,et al.  Toward fine-grained online task characteristics estimation in scientific workflows , 2013, WORKS@SC.

[30]  Thilo Kielmann,et al.  Budget Estimation and Control for Bag-of-Tasks Scheduling in Clouds , 2011, Parallel Process. Lett..

[31]  Helen D. Karatza,et al.  A meta-heuristic optimization approach to the scheduling of bag-of-tasks applications on heterogeneous clouds with multi-level arrivals and critical jobs , 2015, Simul. Model. Pract. Theory.

[32]  Jakob Engblom,et al.  The worst-case execution-time problem—overview of methods and survey of tools , 2008, TECS.

[33]  Rizos Sakellariou,et al.  A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[34]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[35]  Wenzhong Guo,et al.  Online optimization scheduling for scientific workflows with deadline constraint on hybrid clouds , 2016, Concurr. Comput. Pract. Exp..

[36]  Ming Mao,et al.  A Performance Study on the VM Startup Time in the Cloud , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[37]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[38]  Mohamed Jmaiel,et al.  A Comparative Study of the Current Cloud Computing Technologies and Offers , 2011, 2011 First International Symposium on Network Cloud Computing and Applications.

[39]  Hermes Senger,et al.  Scalability limits of Bag-of-Tasks applications running on hierarchical platforms , 2011, J. Parallel Distributed Comput..

[40]  Jorge-Arnulfo Quiané-Ruiz,et al.  Runtime measurements in the cloud , 2010, Proc. VLDB Endow..

[41]  Amin Vahdat,et al.  Evaluating the impact of inaccurate information in utility-based scheduling , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.