Resource Management in Heterogeneous Parallel Computing Environments with Soft and Hard Deadlines

Heuristic-based approaches are often used to perform the assignment and scheduling of parallel high-performance computing applications on machines (as it is an NP-hard problem). The timevarying importance of tasks has been represented using monotonically-decreasing value functions. The value function of a task gives the value that will be earned based on when the task completes execution and is a flexible method to incorporate soft and hard deadlines. To measure the amount of useful work accomplished in an oversubscribed heterogeneous parallel computing environment, we conduct a simulation study to evaluate the performance of schedulers based on the total value they earn from executing tasks. To optimize for such an objective (as opposed to traditional response timebased or fairness-based objectives), requires the design of novel parallel scheduling metaheuristics. We design new metaheuristics and create a new concept of “place-holder” tasks that gives our metaheuristics the advantages of reservations, but also allows future tasks to nullify those reservations. We examine the performance of our metaheuristics and other popular parallel scheduling techniques such as EASY Backfilling and Conservative Backfilling in a variety of environmental conditions. Our real workload trace-driven simulations show that our metaheuristics that use the concept of place-holders consistently outperform the other techniques, in terms of the total value earned, improving on average EASY Backfilling by over 50% and Conservative Backfilling by over 100%.

[1]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[2]  Nicholas Bambos,et al.  Adaptive data-aware utility-based scheduling in resource-constrained systems , 2010, J. Parallel Distributed Comput..

[3]  Dror G. Feitelson,et al.  Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling , 2001, IEEE Trans. Parallel Distributed Syst..

[4]  Cynthia Bailey Lee,et al.  Precise and realistic utility functions for user-centric performance analysis of schedulers , 2007, HPDC '07.

[5]  Ishfaq Ahmad,et al.  Optimal task assignment in heterogeneous distributed computing systems , 1998, IEEE Concurr..

[6]  Howard Jay Siegel,et al.  Representing Task and Machine Heterogeneities for Heterogeneous Computing Systems , 2000 .

[7]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[8]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[9]  Viktor K. Prasanna,et al.  Heterogeneous computing: challenges and opportunities , 1993, Computer.

[10]  Dalibor Klusácek,et al.  Performance and Fairness for Users in Parallel Job Scheduling , 2012, JSSPP.

[11]  Arif Ghafoor,et al.  A distributed heterogeneous supercomputing management system , 1993, Computer.

[12]  P. Sadayappan,et al.  Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement , 2003, JSSPP.

[13]  Keqin Li,et al.  Guarantee Strict Fairness and UtilizePrediction Better in Parallel Job Scheduling , 2014, IEEE Transactions on Parallel and Distributed Systems.

[14]  Gregory A. Koenig,et al.  Utility Functions and Resource Management in an Oversubscribed Heterogeneous Computing Environment , 2015, IEEE Transactions on Computers.

[15]  Anthony A. Maciejewski,et al.  Dynamically mapping tasks with priorities and multiple deadlines in a heterogeneous environment , 2007, J. Parallel Distributed Comput..

[16]  David A. Lifka,et al.  The ANL/IBM SP Scheduling System , 1995, JSSPP.