Performance Analysis of Grid DAG Scheduling Algorithms using MONARC Simulation Tool

This paper presents a solution to analyze the performance of grid scheduling algorithms for tasks with dependencies. Finding the optimal procedures for DAG scheduling in Grid systems is important due to the latest computing necessities: large scale distributed computing and complex applications for different research areas. We propose a solution to evaluate DAG scheduling algorithms using simulation, an approach suitable to evaluate different scheduling algorithms using various task dependencies and considering a wide range of Grid system architectures. Our proposed solution is based on MONARC, a generic simulation framework designed for modeling large scale distributed systems. We present our research results in extending the simulation platform to accommodate various DAG scheduling procedures and, as a case study, we present a critical analysis of four well known DAG scheduling strategies: CCF (Cluster ready Children First), ETF (Earliest Time First), HLFET (Highest Level First with Estimated Times) and Hybrid Remapper. The obtained results show that the proposed solution is a very good instrument for evaluating performance in case of a wide range of DAG scheduling algorithms.

[1]  Howard Jay Siegel,et al.  A dynamic matching and scheduling algorithm for heterogeneous computing systems , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[2]  Ken Kennedy,et al.  Scheduling workflow applications in GrADS , 2004, IEEE International Symposium on Cluster Computing and the Grid, 2004. CCGrid 2004..

[3]  Frank D. Anger,et al.  Scheduling Precedence Graphs in Systems with Interprocessor Communication Times , 1989, SIAM J. Comput..

[4]  Valentin Cristea,et al.  A decentralized strategy for genetic scheduling in heterogeneous environments , 2006, Multiagent Grid Syst..

[5]  Ishfaq Ahmad,et al.  Optimal task assignment in heterogeneous distributed computing systems , 1998, IEEE Concurr..

[6]  C.M. Dobre,et al.  A Simulation Model for Large Scale Distributed Systems , 2007, 2007 Innovations in Information Technologies (IIT).

[7]  Daniel Gajski,et al.  A Programming Aid for Message-passing Systems , 1987, PPSC.

[8]  Chien-Chung Shen,et al.  A Graph Matching Approach to Optimal Task Assignment in Distributed Computing Systems Using a Minimax Criterion , 1985, IEEE Trans. Computers.

[9]  Dorian Gorgan,et al.  Satellite Image Processing Applications in MedioGRID , 2006, 2006 Fifth International Symposium on Parallel and Distributed Computing.

[10]  Valentin Cristea,et al.  A decentralized strategy for genetic scheduling in heterogeneous environments , 2007, Grid 2007.

[11]  Daniel Gajski,et al.  Hypertool: A Programming Aid for Message-Passing Systems , 1990, IEEE Trans. Parallel Distributed Syst..

[12]  David Fernández-Baca,et al.  Allocating Modules to Processors in a Distributed System , 1989, IEEE Trans. Software Eng..

[13]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[14]  A. Forti,et al.  Dag scheduling for grid computing systems , 2006 .

[15]  K. Mani Chandy,et al.  A comparison of list schedules for parallel processing systems , 1974, Commun. ACM.