Scheduling Moldable Jobs on Failure-Prone Platforms

This paper focuses on the resilient scheduling of moldable parallel jobson high-performance computing (HPC) platforms. Moldable jobs allow for choosing aprocessor allocation before execution, and their execution time obeys various speedup models. The scheduling objective is to minimize the overall completion time, or makespan, assuming that jobs are subject to arbitrary failure scenarios, and hence may need to bere-executed each time they fail until they complete successfully. This work generalizes the classical framework where jobs are known offline and do not fail. We introduce alist-based algorithm, and prove new approximation ratios for three prominent speedupmodels (roofline, communication, Amdahl). We also introduce a batch-based algorithm,where each job is allowed only a restricted number of failures per batch, and prove a new approximation ratio for the arbitrary speedup model. We conduct an extensive set of simulations to evaluate and compare different variants of the two algorithms, and the results show that they consistently outperform the baseline heuristics. In particular, the list algorithm performs better for the roofline and communication models, while the batch algorithm has better performance for the Amdahl’s model. Overall, our best algorithm is within a factor of 1.47 of a lower bound on average over the whole set of experiments, and within a factor of 1.8 in the worst case.

[1]  Larry Rudolph,et al.  Towards Convergence in Job Schedulers for Parallel Supercomputers , 1996, JSSPP.

[2]  Frédéric Vivien,et al.  Scheduling Trees of Malleable Tasks for Sparse Linear Algebra , 2014, Euro-Par.

[3]  Chih-Ping Chu,et al.  A 3.42-Approximation Algorithm for Scheduling Malleable Tasks under Precedence Constraints , 2013, IEEE Transactions on Parallel and Distributed Systems.

[4]  Zizhong Chen,et al.  Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methods , 2013, PPoPP '13.

[5]  Klaus Jansen,et al.  Scheduling Monotone Moldable Jobs in Linear Time , 2017, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[6]  Klaus Jansen,et al.  An approximation algorithm for scheduling malleable tasks under general precedence constraints , 2005, TALG.

[7]  Berit Johannes,et al.  Scheduling parallel jobs to minimize the makespan , 2006, J. Sched..

[8]  Klaus Jansen,et al.  Linear-Time Approximation Schemes for Scheduling Malleable Parallel Tasks , 1999, SODA '99.

[9]  RENAUD LEPÈRE,et al.  Approximation Algorithms for Scheduling Malleable Tasks Under Precedence Constraints , 2001, Int. J. Found. Comput. Sci..

[10]  Guochuan Zhang,et al.  A note on online strip packing , 2009, J. Comb. Optim..

[11]  Joseph Y.-T. Leung,et al.  Complexity of Scheduling Parallel Task Systems , 1989, SIAM J. Discret. Math..

[12]  Jessen T. Havill,et al.  Improved upper bounds for online malleable job scheduling , 2015, J. Sched..

[13]  Chi-Yeh Chen An Improved Approximation for Scheduling Malleable Tasks with Precedence Constraints via Iterative Method , 2018, IEEE Transactions on Parallel and Distributed Systems.

[14]  Prasoon Tiwari,et al.  Scheduling malleable and nonmalleable parallel tasks , 1994, SODA '94.

[15]  Y. Robert,et al.  Fault-Tolerance Techniques for High-Performance Computing , 2015, Computer Communications and Networks.

[16]  Philip S. Yu,et al.  Approximate algorithms scheduling parallelizable tasks , 1992, SPAA '92.

[17]  Weizhen Mao,et al.  Online scheduling of malleable parallel jobs , 2007 .

[18]  Prithviraj Banerjee,et al.  An Approximate Algorithm for the Partitionable Independent Task Scheduling Problem , 1990, ICPP.

[19]  Anne Benoit,et al.  Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs , 2020, 2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[20]  Klaus Jansen,et al.  Approximation Algorithms for Scheduling Parallel Jobs , 2010, SIAM J. Comput..

[21]  Franck Cappello,et al.  Lightweight and Accurate Silent Data Corruption Detection in Ordinary Differential Equation Solvers , 2016, Euro-Par.

[22]  Denis Trystram,et al.  A 3/2-Approximation Algorithm for Scheduling Independent Monotonic Malleable Tasks , 2007, SIAM J. Comput..

[23]  Feng Gao,et al.  Fault tolerant matrix-matrix multiplication: correcting soft errors on-line , 2011, ScalA '11.

[24]  Kam-Hoi Cheng,et al.  A Heuristic of Scheduling Parallel Tasks and its Analysis , 1992, SIAM J. Comput..

[25]  Jacek Blazewicz,et al.  Approximation Algorithms for Scheduling Independent Malleable Tasks , 2001, Euro-Par.

[26]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[27]  Santosh Pande,et al.  LADR: low-cost application-level detector for reducing silent output corruptions , 2018, HPDC.

[28]  Klaus Jansen,et al.  Scheduling malleable tasks with precedence constraints , 2005, SPAA '05.

[29]  Klaus Jansen,et al.  A(3/2+ε) approximation algorithm for scheduling moldable and non-moldable parallel tasks , 2012, SPAA '12.

[30]  Guochuan Zhang,et al.  Online scheduling of moldable parallel tasks , 2018, J. Sched..

[31]  Hans P. Muhlfeld,et al.  Cosmic ray soft error rates of 16-Mb DRAM memory chips , 1998, IEEE J. Solid State Circuits.

[32]  Denis Trystram,et al.  Efficient approximation algorithms for scheduling malleable tasks , 1999, SPAA '99.

[33]  Johann Hurink,et al.  Online Algorithm for Parallel Job Scheduling and Strip Packing , 2007, WAOA.

[34]  Anja Feldmann,et al.  Optimal On-Line Scheduling of Parallel Jobs with Dependencies , 1998, J. Comb. Optim..

[35]  Martin Skutella,et al.  Approximation Algorithms for the Discrete Time-Cost Tradeoff Problem , 1997, Math. Oper. Res..

[36]  Weizhen Mao,et al.  Competitive online scheduling of perfectly malleable jobs with setup times , 2008, Eur. J. Oper. Res..

[37]  T. J. O'Gorman The effect of cosmic rays on the soft error rate of a DRAM at ground level , 1994 .

[38]  G. N. Srinivasa Prasanna,et al.  The optimal control approach to generalized multiprocessor scheduling , 2005, Algorithmica.

[39]  Prithviraj Banerjee,et al.  A scheduling algorithm for parallelizable dependent tasks , 1991, [1991] Proceedings. The Fifth International Parallel Processing Symposium.

[40]  Franck Cappello,et al.  Addressing failures in exascale computing , 2014, Int. J. High Perform. Comput. Appl..