PV-EASY: a strict fairness guaranteed and prediction enabled scheduler in parallel job scheduling

As the most widely used parallel job scheduling strategy in production schedulers, EASY has achieved great success, not only because it can balance fairness and performance, but also because it is universally applicable to most HPC systems. However, unfairness still exists in EASY. For real workloads used in this work, our simulation shows that a blocked job can be delayed by later jobs for more than 90 hours. In addition, EASY cannot directly employ parallel job runtime prediction techniques, because this would lead to a serious situation called reservation violation. In this paper, we aim at guaranteeing strict fairness (no job is delayed by any jobs of lower priority) while achieving attractive performance, and employing prediction without causing reservation violation in parallel job scheduling. We propose two novel strategies, shadow load preemption (SLP) and venture backfilling (VB), which are together integrated into EASY to construct a preemptive venture EASY backfilling (PV-EASY) strategy. Experimental results on three workloads of real HPC systems demonstrate that: First, PV-EASY guarantees strict fairness, in addition to avoiding reservation violation when employing job runtime prediction techniques in scheduling; Second, PV-EASY achieves the same performance as EASY, and outperforms prediction employed EASY; Third, the preemption in PV-EASY is not resource costly and simple enough to be implemented in all HPC systems where EASY works. These advantages make PV-EASY more attractive than EASY in parallel job scheduling, both from academic and industry perspectives.

[1]  Dan Tsafrir,et al.  The Dynamics of Backfilling: Solving the Mystery of Why Increased Inaccuracy May Help , 2006, 2006 IEEE International Symposium on Workload Characterization.

[2]  Benjamin Avi-Itzhak,et al.  QUANTIFYING FAIRNESS IN QUEUEING SYSTEMS: PRINCIPLES AND APPLICATIONS , 2004 .

[3]  Thomas Fahringer,et al.  Predicting the execution time of grid workflow applications through local learning , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[4]  Kenneth C. Sevcik,et al.  Application Scheduling and Processor Allocation in Multiprogrammed Parallel Processing Systems , 1994, Perform. Evaluation.

[5]  Shonali Krishnaswamy,et al.  Estimating computation times of data-intensive applications , 2004, IEEE Distributed Systems Online.

[6]  Cynthia Bailey Lee,et al.  Are User Runtime Estimates Inherently Inaccurate? , 2004, JSSPP.

[7]  John Ngubiri,et al.  Characteristics of Fairness Metrics and their Effect on Perceived Scheduler Effectiveness , 2010 .

[8]  Dror G. Feitelson,et al.  Supporting priorities and improving utilization of the IBM SP scheduler using slack-based backfilling , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[9]  John E. West,et al.  Scheduling Jobs on Parallel Systems Using a Relaxed Backfill Strategy , 2002, JSSPP.

[10]  Pavan Balaji,et al.  Opportune Job Shredding: An Effective Approach for Scheduling Parameter Sweep Appli , 2003 .

[11]  Ivan Rodero,et al.  The Resource Usage Aware Backfilling , 2009, JSSPP.

[12]  Dror G. Feitelson,et al.  Backfilling with lookahead to optimize the packing of parallel jobs , 2005, J. Parallel Distributed Comput..

[13]  David P. Bunde,et al.  Scheduling Restartable Jobs with Short Test Runs , 2009, JSSPP.

[14]  Zhiling Lan,et al.  Fault-aware, utility-based job scheduling on Blue, Gene/P systems , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[15]  Benjamin Avi-Itzhak,et al.  SQF: A slowdown queueing fairness measure , 2007, Perform. Evaluation.

[16]  Peter J. Keleher,et al.  Randomization, Speculation, and Adaptation in Batch Schedulers , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[17]  Amin Vahdat,et al.  Evaluating the impact of inaccurate information in utility-based scheduling , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[18]  Andrea C. Arpaci-Dusseau,et al.  The Impact of More Accurate Requested Runtimes on Production Job Scheduling Performance , 2002, JSSPP.

[19]  James Patton Jones,et al.  Scheduling for Parallel Supercomputing: A Historical Perspective of Achievable Utilization , 1999, JSSPP.

[20]  Richard C. Larson,et al.  OR Forum - Perspectives on Queues: Social Justice and the Psychology of Queueing , 1987, Oper. Res..

[21]  D. Atkin OR scheduling algorithms. , 2000, Anesthesiology.

[22]  P. Sadayappan,et al.  Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement , 2003, JSSPP.

[23]  Carla E. Brodley,et al.  Predictive application-performance modeling in a computational grid environment , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[24]  Dmitry N. Zotkin,et al.  Job-length estimation and performance in backfilling schedulers , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[25]  Dror G. Feitelson,et al.  Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling , 2001, IEEE Trans. Parallel Distributed Syst..

[26]  Shikharesh Majumdar,et al.  Scheduling in multiprogrammed parallel systems , 1988, SIGMETRICS '88.

[27]  Benjamin Avi-Itzhak,et al.  QUANTIFYING FAIRNESS IN QUEUING SYSTEMS , 2008, Probability in the Engineering and Informational Sciences.

[28]  Dan Tsafrir,et al.  A Short Survey of Commercial Cluster Batch Schedulers , 2005 .

[29]  Benjamin Avi-Itzhak,et al.  A resource-allocation queueing fairness measure , 2004, SIGMETRICS '04/Performance '04.

[30]  Dror G. Feitelson,et al.  On Simulation and Design of Parallel-Systems Schedulers: Are We Doing the Right Thing? , 2009, IEEE Transactions on Parallel and Distributed Systems.

[31]  P. Sadayappan,et al.  Selective Reservation Strategies for Backfill Job Scheduling , 2002, JSSPP.

[32]  Dror G. Feitelson Experimental analysis of the root causes of performance evaluation results: a backfilling case study , 2005, IEEE Transactions on Parallel and Distributed Systems.

[33]  Cynthia Bailey Lee,et al.  On the User–Scheduler Dialogue: Studies of User-Provided Runtime Estimates and Utility Functions , 2006, Int. J. High Perform. Comput. Appl..

[34]  P. Sadayappan,et al.  Job fairness in non-preemptive job scheduling , 2004, International Conference on Parallel Processing, 2004. ICPP 2004..

[35]  Marco Aurélio Amaral Henriques,et al.  Contention-sensitive static performance prediction for parallel distributed applications , 2006, Perform. Evaluation.

[36]  Andrew V. Goldberg,et al.  Quincy: fair scheduling for distributed computing clusters , 2009, SOSP '09.

[37]  L. Mann Queue Culture: The Waiting Line as a Social System , 1969, American Journal of Sociology.

[38]  Yuichi Inadomi,et al.  Performance prediction of large-scale parallell system and application using macro-level simulation , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[39]  David A. Lifka,et al.  The ANL/IBM SP Scheduling System , 1995, JSSPP.

[40]  Jiri Sgall,et al.  On-line scheduling --- a survey , 1997 .

[41]  Cynthia Bailey Lee,et al.  Precise and realistic utility functions for user-centric performance analysis of schedulers , 2007, HPDC '07.

[42]  Dan Tsafrir,et al.  Backfilling Using System-Generated Predictions Rather than User Runtime Estimates , 2007, IEEE Transactions on Parallel and Distributed Systems.