Evolutionary Scheduling of Parallel Tasks Graphs onto Homogeneous Clusters

Parallel task graphs (PTGs) arise when parallel programs are combined to larger applications, e.g., scientific workflows. Scheduling these PTGs onto clusters is a challenging problem due to the additional degree of parallelism stemming from moldable tasks. Most algorithms are based on the assumption that the execution time of a parallel task is monotonically decreasing as the number of processors increases. But this assumption does not hold in practice since parallel programs often perform better if the number of processors is a multiple of internally used block sizes. In this article, we introduce the Evolutionary Moldable Task Scheduling (EMTS) algorithm for scheduling static PTGs onto homogeneous clusters. We apply an evolutionary approach to determine the processor allocation of each task. The evolutionary strategy ensures that EMTS can be used with any underlying model for predicting the execution time of moldable tasks. With the purpose of finding solutions quickly, EMTS considers results of other heuristics (e.g., HCPA, MCPA) as starting solutions. The experimental results show that EMTS significantly reduces the make span of PTGs compared to other heuristics for both non-monotonically and monotonically decreasing models.

[1]  Joel H. Saltz,et al.  An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications , 2009, IEEE Transactions on Parallel and Distributed Systems.

[2]  Arnold L. Rosenberg,et al.  A Comparison of Dag-Scheduling Strategies for Internet-Based Computing , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[3]  Tchimou N'Takpé,et al.  Critical path and area based scheduling of parallel task graphs on heterogeneous platforms , 2006, 12th International Conference on Parallel and Distributed Systems - (ICPADS'06).

[4]  Günter Rudolph,et al.  Contemporary Evolution Strategies , 1995, ECAL.

[5]  Kuldip Singh,et al.  An improved two-step algorithm for task and data parallel scheduling in distributed memory machines , 2006, Parallel Comput..

[6]  Franz Rothlauf,et al.  Representations for genetic and evolutionary algorithms , 2002, Studies in Fuzziness and Soft Computing.

[7]  Carsten Franke,et al.  Greedy scheduling with custom-made objectives , 2010, Ann. Oper. Res..

[8]  F. Suter Scheduling Δ-Critical Tasks in mixed-parallel applications on a national grid , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[9]  Nicole Megow,et al.  Scheduling and Packing Malleable Tasks with Precedence Constraints of Bounded Width , 2009, WAOA.

[10]  Frédéric Suter,et al.  A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[11]  Sachin S. Sapatnekar,et al.  A Framework for Exploiting Task and Data Parallelism on Distributed Memory Multicomputers , 1997, IEEE Trans. Parallel Distributed Syst..

[12]  Franz Rothlauf,et al.  Representations for genetic and evolutionary algorithms (2. ed.) , 2006 .

[13]  Arjan J. C. van Gemund,et al.  A low-cost approach towards mixed task and data parallel scheduling , 2001, International Conference on Parallel Processing, 2001..

[14]  Kuo-Chi Lin,et al.  An incremental genetic algorithm approach to multiprocessor scheduling , 2004, IEEE Transactions on Parallel and Distributed Systems.

[15]  Nicholas J. Wright,et al.  Modeling and predicting application performance on parallel computers using HPC challenge benchmarks , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[16]  Thomas Rauber,et al.  Redistribution aware two-step scheduling for mixed-parallel applications , 2008, 2008 IEEE International Conference on Cluster Computing.

[17]  Allen B. Downey,et al.  A Model For Speedup of Parallel Programs , 1997 .

[18]  Frédéric Suter,et al.  One-step algorithm for mixed data and task parallel scheduling without data replication , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[19]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[20]  Jacques Carlier,et al.  Handbook of Scheduling - Algorithms, Models, and Performance Analysis , 2004 .

[21]  Jack Dongarra,et al.  ScaLAPACK Users' Guide , 1987 .

[22]  Francine Berman,et al.  A model for moldable supercomputer jobs , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[23]  Arjan J. C. van Gemund,et al.  CPR: mixed task and data parallel scheduling for distributed systems , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[24]  Sascha Hunold Low-Cost Tuning of Two-Step Algorithms for Scheduling Mixed-Parallel Applications onto Homogeneous Clusters , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.