Scheduling Moldable Tasks with Precedence Constraints and Arbitrary Speedup Functions on Multiprocessors

Due to the increasing number of cores of current parallel machines, the question arises to which cores parallel tasks should be mapped. Thus, parallel task scheduling is now more relevant than ever, especially under the moldable task model, in which tasks are allocated a fixed number of processors before execution. Scheduling algorithms commonly assume that the speedup function of moldable tasks is either non-decreasing, sub-linear or concave. In practice, however, the resulting speedup of parallel programs on current hardware with deep memory hierarchies is most often neither non-decreasing nor concave.

[1]  E.L. Lawler,et al.  Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey , 1977 .

[2]  Kuldip Singh,et al.  An improved two-step algorithm for task and data parallel scheduling in distributed memory machines , 2006, Parallel Comput..

[3]  Martin Skutella,et al.  Approximation Algorithms for the Discrete Time-Cost Tradeoff Problem , 1997, Math. Oper. Res..

[4]  Robert A. van de Geijn,et al.  SUMMA: scalable universal matrix multiplication algorithm , 1995, Concurr. Pract. Exp..

[5]  Bianca Schroeder,et al.  An Experimental Study of Online Scheduling Algorithms , 2000, Algorithm Engineering.

[6]  Sascha Hunold Low-Cost Tuning of Two-Step Algorithms for Scheduling Mixed-Parallel Applications onto Homogeneous Clusters , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[7]  Maciej Drozdowski,et al.  Scheduling for Parallel Processing , 2009, Computer Communications and Networks.

[8]  Uwe Schwiegelshohn,et al.  Theory and Practice in Parallel Job Scheduling , 1997, JSSPP.

[9]  Joseph Y.-T. Leung,et al.  Handbook of Scheduling: Algorithms, Models, and Performance Analysis , 2004 .

[10]  Arjan J. C. van Gemund,et al.  A low-cost approach towards mixed task and data parallel scheduling , 2001, International Conference on Parallel Processing, 2001..

[11]  RENAUD LEPÈRE,et al.  Approximation Algorithms for Scheduling Malleable Tasks Under Precedence Constraints , 2001, Int. J. Found. Comput. Sci..

[12]  Klaus Jansen,et al.  Scheduling malleable tasks with precedence constraints , 2012, J. Comput. Syst. Sci..

[13]  Sascha Hunold,et al.  Evolutionary Scheduling of Parallel Tasks Graphs onto Homogeneous Clusters , 2011, 2011 IEEE International Conference on Cluster Computing.

[14]  Nicole Megow,et al.  Scheduling and packing malleable and parallel tasks with precedence constraints of bounded width , 2014, J. Comb. Optim..

[15]  Jack J. Dongarra,et al.  Dense linear algebra solvers for multicore with GPU accelerators , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[16]  Frédéric Suter,et al.  A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[17]  Klaus Jansen,et al.  An Approximation Algorithm for Scheduling Malleable Tasks Under General Precedence Constraints , 2005, ISAAC.