A GPU-Based Metaheuristic for Workflow Scheduling on Clouds

Scientific workflows are being used today in a number of areas. As they grow in complexity and importance, cloud computing emerges as an important execution environment. In this scenario, scheduling the workflow tasks and data on the cloud ensuring proper use of the computational resources is one of the key issues in the management of workflow execution. Although many workflow schedulers have been proposed in the literature, few of them deal with heterogeneous computing resources and data file assignment. The Hybrid Evolutionary Algorithm to Task Scheduling and Data File Assignment Problem (HEA-TaSDAP) addresses these two problems simultaneously, but the scheduling is time consuming, especially if we consider large scale workflows. In this work, we propose optimizations on HEA-TaSDAP by taking advantage of the massive parallelism provided by GPUs, leveraging the scheduling of larger instances in a reasonable amount of time. Our parallel solution provided about 98.83% of reductions in the scheduling time, keeping the quality of the solutions.

[1]  Mei-Hui Su,et al.  Characterization of scientific workflows , 2008, 2008 Third Workshop on Workflows in Support of Large-Scale Science.

[2]  Quan Z. Sheng,et al.  Science in the Cloud: Allocation and Execution of Data-Intensive Scientific Workflows , 2013, Journal of Grid Computing.

[3]  Marta Mattoso,et al.  Performance evaluation of parallel strategies in public clouds: A study with phylogenomic workflows , 2013, Future Gener. Comput. Syst..

[4]  Gurkan Ozturk,et al.  A Hybrid Genetic Algorithm for the Quadratic Assignment Problem on Graphics Processing Units , 2016 .

[5]  Rajkumar Buyya,et al.  A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[6]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[7]  Dick H. J. Epema,et al.  Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds , 2013, Future Gener. Comput. Syst..

[8]  Yolanda Gil,et al.  Pegasus: Mapping Scientific Workflows onto the Grid , 2004, European Across Grids Conference.

[9]  Luan Teylo,et al.  A hybrid evolutionary algorithm for task scheduling and data assignment of data-intensive scientific workflows on clouds , 2017, Future Gener. Comput. Syst..

[10]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[11]  Marta Mattoso,et al.  Towards a Taxonomy for Cloud Computing from an e-Science Perspective , 2010, Cloud Computing.

[12]  Yi Zhou,et al.  Optimization of parallel iterated local search algorithms on graphics processing unit , 2016, The Journal of Supercomputing.

[13]  Igor Machado Coelho,et al.  A Benchmark on Multi Improvement Neighborhood Search Strategies in CPU/GPU Systems , 2016, 2016 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW).

[14]  El-Ghazali Talbi,et al.  Neighborhood Structures for GPU-Based Local Search Algorithms , 2010, Parallel Process. Lett..

[15]  Igor Machado Coelho,et al.  An integrated CPU–GPU heuristic inspired on variable neighbourhood search for the single vehicle routing problem with deliveries and selective pickups , 2016 .