Dynamic Multilevel Hybrid Scheduling Algorithms for Grid Computing

Abstract A ‘Grid’ is an infrastructure for resource sharing. It is used for large-scale data processing, many of the applications being scientific ones. Grid scheduling is a vital component of a Grid infrastructure. Reliability, efficiency (in terms of time consumption) and effectiveness in resource utilization are the desired characteristics of Grid scheduling systems. Many algorithms have been developed for Grid scheduling. In our previous work, we proposed two scheduling algorithms (the Multilevel Hybrid Scheduling Algorithm and the Multilevel Dual Queue Scheduling Algorithm) for optimum utilization of processors in a Grid computing environment. In this paper, we propose two more flavours of Multilevel Hybrid scheduling algorithms; i.e. the Dynamic Multilevel Hybrid Scheduling Algorithm using Median and the Dynamic Multilevel Hybrid Scheduling Algorithm using Square root. We evaluate our proposed Grid scheduling using real workload traces, taken from leading computational centers. The main idea of the proposed algorithms is to execute jobs optimally, i.e. with minimum average waiting, turnaround and response times. An extensive performance comparison is presented using real workload traces to evaluate the efficiency of scheduling algorithms. To facilitate the research, a software tool has been developed which produces a comprehensive simulation of a number of Grid scheduling algorithms. The tool's output is in the form of scheduling performance metrics. The experimental results, based on performance metrics, demonstrate that the performances of our Grid scheduling algorithms give good results. Our proposed scheduling algorithms also support true scalability, that is, they maintain an efficient approach when increasing the number of processors or nodes. This paper also includes a statistical analysis of real workload traces to present the nature and behavior of jobs. Our proposed scheduling algorithms are unique. They have three key features. First, they favor the shortest job for execution. Second, they execute the job on the basis of a dynamic time quantum, to fairly distribute processor time among Grid jobs. A third feature is that they always execute the longest job, thus avoiding starvation.

[1]  David Fernández-Baca,et al.  Allocating Modules to Processors in a Distributed System , 1989, IEEE Trans. Software Eng..

[2]  Jemal H. Abawajy,et al.  Job Scheduling Policy for High Throughput Grid Computing , 2005, ICA3PP.

[3]  Hui Li,et al.  Workload dynamics on clusters and grids , 2008, The Journal of Supercomputing.

[4]  Dror G. Feitelson,et al.  Backfilling with lookahead to optimize the packing of parallel jobs , 2005, J. Parallel Distributed Comput..

[5]  R Sharma,et al.  Notice of RetractionAn improved resource scheduling approach using Job Grouping strategy in grid computing , 2010, 2010 International Conference on Educational and Network Technology.

[6]  Alan Oxley,et al.  Development and Performance Analysis of Grid Scheduling Algorithms , 2009, IAIT.

[7]  Larry Rudolph,et al.  Metrics and Benchmarking for Parallel Job Scheduling , 1998, JSSPP.

[8]  Alan Oxley,et al.  Hybrid Scheduling and Dual Queue Scheduling , 2009, 2009 2nd IEEE International Conference on Computer Science and Information Technology.

[9]  Subhash Saini,et al.  Local grid scheduling techniques using performance prediction , 2003 .

[10]  Dan Tsafrir,et al.  Backfilling Using System-Generated Predictions Rather than User Runtime Estimates , 2007, IEEE Transactions on Parallel and Distributed Systems.

[11]  Minyi Guo,et al.  High-Performance Computing : Paradigm and Infrastructure , 2005 .

[12]  Evgenia Smirni,et al.  Self-adapting backfilling scheduling for parallel systems , 2002, Proceedings International Conference on Parallel Processing.

[13]  Sandeep Singh Rawat,et al.  Experiments with CPU Scheduling Algorithm on a Computational Grid , 2009, 2009 IEEE International Advance Computing Conference.

[14]  Rami Matarneh,et al.  Self-Adjustment Time Quantum in Round Robin Algorithm Depending on Burst Time of the Now Running Processes , 2009 .

[15]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[16]  Andrei Tchernykh,et al.  Idle regulation in non-clairvoyant scheduling of parallel jobs , 2009, Discret. Appl. Math..

[17]  Rajkumar Buyya,et al.  Model-Driven Simulation of Grid Scheduling Strategies , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[18]  Alan Oxley,et al.  Analysis and evaluation of grid scheduling algorithms using real workload traces , 2010, MEDES.

[19]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[20]  Jarek Nabrzyski,et al.  Grid resource management: state of the art and future trends , 2004 .

[21]  Anand Sivasubramaniam,et al.  Improving parallel job scheduling by combining gang scheduling and backfilling techniques , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.