Toward balanced and sustainable job scheduling for production supercomputers

Job scheduling on production supercomputers is complicated by diverse demands of system administrators and amorphous characteristics of workloads. Specifically, various scheduling goals such as queuing efficiency and system utilization are usually conflicting and thus need to be balanced. Also, changing workload characteristics often impact the effectiveness of the deployed scheduling policies. Thus it is challenging to design a versatile scheduling policy that is effective in all circumstances. In this paper, we propose a novel job scheduling strategy to balance diverse scheduling goals and mitigate the impact of workload characteristics. First, we introduce metric-aware scheduling, which enables the scheduler to balance competing scheduling goals represented by different metrics such as job waiting time, fairness, and system utilization. Second, we design a scheme to dynamically adjust scheduling policies based on feedback information of monitored metrics at runtime. We evaluate our design using real workloads from supercomputer centers. The results demonstrate that our scheduling mechanism can significantly improve system performance in a balanced, sustainable fashion.

[1]  Anat Rafaeli,et al.  The Effects of Queue Structure on Attitudes , 2002 .

[2]  Achim Streit A Self-Tuning Job Scheduler Family with Dynamic Policy Switching , 2002, JSSPP.

[3]  Zhiling Lan,et al.  Reducing Energy Costs for IBM Blue Gene/P via Power-Aware Job Scheduling , 2013, JSSPP.

[4]  Bharadwaj Veeravalli,et al.  Design and performance evaluation of combined first-fit task allocation and migration strategies in mesh multiprocessor systems , 2008, Parallel Comput..

[5]  John E. West,et al.  Scheduling Jobs on Parallel Systems Using a Relaxed Backfill Strategy , 2002, JSSPP.

[6]  Dan Tsafrir,et al.  A Short Survey of Commercial Cluster Batch Schedulers , 2005 .

[7]  Achim Streit Evaluation of an unfair decider mechanism for the self-tuning dynP job scheduler , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[8]  Achim Streit,et al.  On the comparison of CPLEX-computed job schedules with the self-tuning dynP job scheduler , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[9]  Zhiling Lan,et al.  Job scheduling with adjusted runtime estimates on production supercomputers , 2013, J. Parallel Distributed Comput..

[10]  Zhiling Lan,et al.  Fault-aware, utility-based job scheduling on Blue, Gene/P systems , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[11]  Zhiling Lan,et al.  Adaptive Metric-Aware Job Scheduling for Production Supercomputers , 2012, 2012 41st International Conference on Parallel Processing Workshops.

[12]  Dror G. Feitelson,et al.  Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling , 2001, IEEE Trans. Parallel Distributed Syst..

[13]  P. Sadayappan,et al.  Selective Reservation Strategies for Backfill Job Scheduling , 2002, JSSPP.

[14]  C. V. Ramamoorthy,et al.  Aspects of a Dynamically Adaptive Operating System , 1976, IEEE Transactions on Computers.

[15]  Zhiling Lan,et al.  Automatic and coordinated job recovery for high performance computing , 2010, 2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers.

[16]  Guangwen Yang,et al.  PV-EASY: a strict fairness guaranteed and prediction enabled scheduler in parallel job scheduling , 2010, HPDC '10.

[17]  Mark J. Clement,et al.  Core Algorithms of the Maui Scheduler , 2001, JSSPP.

[18]  Zhiling Lan,et al.  Reducing Fragmentation on Torus-Connected Supercomputers , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[19]  Saad Bani-Mohammad,et al.  A new window-based job scheduling scheme for 2D mesh multicomputers , 2011, Simul. Model. Pract. Theory.

[20]  P. Sadayappan,et al.  Job fairness in non-preemptive job scheduling , 2004 .

[21]  Anand Sivasubramaniam,et al.  Improving parallel job scheduling by combining gang scheduling and backfilling techniques , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[22]  Diwakar Krishnamurthy,et al.  Towards automated HPC scheduler configuration tuning , 2011, Concurr. Comput. Pract. Exp..

[23]  Dror G. Feitelson,et al.  Backfilling with lookahead to optimize the packing of parallel jobs , 2005, J. Parallel Distributed Comput..

[24]  Susan Coghlan,et al.  Petascale System Management Experiences , 2008, LISA.

[25]  Benjamin Avi-Itzhak,et al.  A resource-allocation queueing fairness measure , 2004, SIGMETRICS '04/Performance '04.

[26]  Sang Hyuk Son,et al.  Design and evaluation of a feedback control EDF scheduling algorithm , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).