Workflow Scheduling Issues and Techniques in Cloud Computing: A Systematic Literature Review

One of the most challenging issues in cloud computing is workflow scheduling. Workflow applications have a complex structure and many discrete tasks. Each task may include entering data, processing, accessing software, or storage functions. For these reasons, the workflow scheduling is considered to be an NP-hard problem. Then, efficient scheduling algorithms are required for selection of best suitable resources for workflow execution. In this paper, we conduct a SLR (Systematic literature review) of workflow scheduling strategies that have been proposed for cloud computing platforms to help researchers systematically and objectively gather and aggregate research evidences about this topic. Then, we present a comparative analysis of the studied strategies. Finally, we highlight workflow scheduling issues for further research. The findings of this review provide a roadmap for developing workflow scheduling models, which will motivate researchers to propose better workflow scheduling algorithms for service consumers and/or utility providers in cloud computing.

[1]  Lu Jie,et al.  Grid Task Scheduling Based on Improved Genetic Algorithm , 2010 .

[2]  Cong Wang,et al.  Secure and practical outsourcing of linear programming in cloud computing , 2011, 2011 Proceedings IEEE INFOCOM.

[3]  Xiaomin Zhu,et al.  Scheduling for Workflows with Security-Sensitive Intermediate Data by Selective Tasks Duplication in Clouds , 2017, IEEE Transactions on Parallel and Distributed Systems.

[4]  Prasanta K. Jana,et al.  Forward Load Aware Scheduling for Data-Intensive Workflow Applications in Cloud System , 2016, 2016 International Conference on Information Technology (ICIT).

[5]  Yue-Shan Chang,et al.  Adaptive scheduling for parallel tasks with QoS satisfaction for hybrid cloud environments , 2013, The Journal of Supercomputing.

[6]  C. Kesselman,et al.  CyberShake: A Physics-Based Seismic Hazard Model for Southern California , 2011 .

[7]  Xuewei Li,et al.  Application of Workflow Technology to Current Dispatching Order System , 2008 .

[8]  Ken Kennedy,et al.  Scheduling strategies for mapping application workflows onto the grid , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[9]  Luiz Fernando Bittencourt,et al.  HCOC: a cost optimization algorithm for workflow scheduling in hybrid clouds , 2011, Journal of Internet Services and Applications.

[10]  Marian Bubak,et al.  Scheduling Multilevel Deadline-Constrained Scientific Workflows on Clouds Based on Cost Optimization , 2015, Sci. Program..

[11]  Kuo-Chan Huang,et al.  Dynamic Resource Provisioning for Interactive Workflow Applications on Cloud Computing Platform , 2010, MTPP.

[12]  Bertrand Granado,et al.  Multi-Objective Approach for Energy-Aware Workflow Scheduling in Cloud Computing Environments , 2013, TheScientificWorldJournal.

[13]  Radu Prodan,et al.  Towards a general model of the multi-criteria workflow scheduling on the grid , 2009, Future Gener. Comput. Syst..

[14]  Chao Chen,et al.  Energy-aware scheduling of virtual machines in heterogeneous cloud computing systems , 2017, Future Gener. Comput. Syst..

[15]  Pearl Brereton,et al.  Systematic literature reviews in software engineering - A systematic literature review , 2009, Inf. Softw. Technol..

[16]  Fatma A. Omara,et al.  Dynamic task scheduling algorithm with load balancing for heterogeneous computing system , 2012 .

[17]  Rizos Sakellariou,et al.  Budget-Deadline Constrained Workflow Planning for Admission Control , 2013, Journal of Grid Computing.

[18]  Claude Tadonki,et al.  Graph-based model and algorithm for minimising big data movement in a cloud environment , 2018 .

[19]  Rizos Sakellariou,et al.  Using imbalance metrics to optimize task clustering in scientific workflow executions , 2015, Future Gener. Comput. Syst..

[20]  Li-Yeh Chuang,et al.  Natural Products: Bioactivity, Biochemistry, and Biological Effects in Cancer and Disease Therapy , 2013, The Scientific World Journal.

[21]  Rajkumar Buyya,et al.  SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter , 2014, J. Netw. Comput. Appl..

[22]  WenAn Tan,et al.  A Trust Service-Oriented Scheduling Model for Workflow Applications in Cloud Computing , 2014, IEEE Systems Journal.

[23]  Eunmi Choi,et al.  A service-oriented taxonomical spectrum, cloudy challenges and opportunities of cloud computing , 2012, Int. J. Commun. Syst..

[24]  Rajkumar Buyya,et al.  Deadline‐constrained coevolutionary genetic algorithm for scientific workflow scheduling in cloud computing , 2017, Concurr. Comput. Pract. Exp..

[25]  Luis Rodero-Merino,et al.  A break in the clouds: towards a cloud definition , 2008, CCRV.

[26]  Xiaolei Dong,et al.  Security and privacy for storage and computation in cloud computing , 2014, Inf. Sci..

[27]  Albert Y. Zomaya,et al.  A balanced scheduler with data reuse and replication for scientific workflows in cloud computing systems , 2017, Future Gener. Comput. Syst..

[28]  LiGuo Huang,et al.  A security and cost aware scheduling algorithm for heterogeneous tasks of scientific workflow in clouds , 2016, Future Gener. Comput. Syst..

[29]  Vicente Hernández García,et al.  SLA-driven dynamic cloud resource management , 2014 .

[30]  Rizos Sakellariou,et al.  DAG Scheduling Using a Lookahead Variant of the Heterogeneous Earliest Finish Time Algorithm , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[31]  W Chiu,et al.  EMAN: semiautomated software for high-resolution single-particle reconstructions. , 1999, Journal of structural biology.

[32]  Liang Chen,et al.  Efficient task scheduling for Many Task Computing with resource attribute selection , 2014, China Communications.

[33]  Rajkumar Buyya,et al.  Enhancing Reliability of Workflow Execution Using Task Replication and Spot Instances , 2016, ACM Trans. Auton. Adapt. Syst..

[34]  Ian J. Taylor,et al.  Workflows and e-Science: An overview of workflow system features and capabilities , 2009, Future Gener. Comput. Syst..

[35]  Claude Tadonki,et al.  DT-MG: many-to-one matching game for tasks scheduling towards resources optimization in cloud computing , 2018 .

[36]  Daniel S. Katz,et al.  Montage: a grid-enabled engine for delivering custom science-grade mosaics on demand , 2004, SPIE Astronomical Telescopes + Instrumentation.

[37]  Ritu Garg,et al.  Reliability-Aware Workflow Scheduling Using Monte Carlo Failure Estimation in Cloud , 2017 .

[38]  Michael Pinedo,et al.  Scheduling: Theory, Algorithms, and Systems , 1994 .

[39]  Rajkumar Buyya,et al.  Optimizing the makespan and reliability for workflow applications with reputation and a look-ahead genetic algorithm , 2011, Future Gener. Comput. Syst..

[40]  Andrew Y. C. Nee,et al.  An improved Intelligent Water Drops algorithm for achieving optimal job-shop scheduling solutions , 2012 .

[41]  Xiaorong Li,et al.  SABA: A security-aware and budget-aware workflow scheduling strategy in clouds , 2015, J. Parallel Distributed Comput..

[42]  Kouichi Sakurai,et al.  Reliable workflow scheduling with less resource redundancy , 2013, Parallel Comput..

[43]  Vijayan Sugumaran,et al.  FFBAT: A security and cost‐aware workflow scheduling approach combining firefly and bat algorithms , 2017, Concurr. Comput. Pract. Exp..

[44]  Roland H. C. Yap,et al.  Tagged-MapReduce: A General Framework for Secure Computing with Mixed-Sensitivity Data on Hybrid Clouds , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[45]  Claude Tadonki,et al.  E-HEFT: Enhancement Heterogeneous Earliest Finish Time algorithm for Task Scheduling based on Load Balancing in Cloud Computing , 2018, 2018 International Conference on High Performance Computing & Simulation (HPCS).

[46]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.