Low-Cost Approximation Algorithms for Scheduling Independent Tasks on Hybrid Platforms

Hybrid platforms embedding accelerators such as GPUs or Xeon Phis are increasingly used in computing. When scheduling tasks on such platforms, one has to take into account that a task execution time depends on the type of core used to execute it. We focus on the problem of minimizing the total completion time (or makespan) when scheduling independent tasks on two processor types, also known as the \((Pm,Pk)||C_{\max }\) problem. We propose BalancedEstimate and BalancedMakespan, two novel 2-approximation algorithms with low complexity. Their approximation ratio is both on par with the best approximation algorithms using dual approximation techniques (which are, thus, of high complexity) and significantly smaller than the approximation ratio of existing low-cost approximation algorithms. We compared both algorithms by simulations to existing strategies in different scenarios. These simulations showed that their performance is among the best ones in all cases.

[1]  Cédric Augonnet,et al.  StarPU: a unified platform for task scheduling on heterogeneous multicore architectures , 2011, Concurr. Comput. Pract. Exp..

[2]  Terry Cojean,et al.  Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources , 2016, 2016 IEEE 23rd International Conference on High Performance Computing (HiPC).

[3]  Dror G. Feitelson,et al.  Workload Modeling for Computer Systems Performance Evaluation , 2015 .

[4]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[5]  Safia Kedad-Sidhoum,et al.  Scheduling independent tasks on multi‐cores with GPU accelerators , 2015, Concurr. Comput. Pract. Exp..

[6]  Erik Saule,et al.  Considerations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

[7]  Denis Trystram,et al.  Scheduling Data Flow Program in XKaapi: A New Affinity Based Algorithm for Heterogeneous Architectures , 2014, Euro-Par.

[8]  Klaus Jansen,et al.  A PTAS for Scheduling Unrelated Machines of Few Different Types , 2018, Int. J. Found. Comput. Sci..

[9]  Vincenzo Bonifaci,et al.  Scheduling Unrelated Machines of Few Different Types , 2012, ArXiv.

[10]  Olivier Beaumont,et al.  Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[11]  Dror G. Feitelson,et al.  The workload on parallel supercomputers: modeling the characteristics of rigid jobs , 2003, J. Parallel Distributed Comput..

[12]  Emmanuel Agullo,et al.  Are Static Schedules so Bad? A Case Study on Cholesky Factorization , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[13]  Eduard Ayguadé,et al.  Leveraging OmpSs to Exploit Hardware Accelerators , 2014, 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing.

[14]  Jan Karel Lenstra,et al.  Approximation algorithms for scheduling unrelated parallel machines , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).