A Survey of Job Scheduling in Grids

The problem of optimally scheduling tasks onto heterogeneous resources in grids, minimizing the makespan of these tasks, has proved to be NP-complete. There is no best scheduling algorithm for all grid computing systems. An alternative is to select an appropriate scheduling algorithm to use in a given grid environment because of the characteristics of the tasks, machines and network connectivity. In this paper a survey is presented on the problem and the different aspects of job scheduling in grids such as (a) fault-tolerance; (b) security; and (c) simulation of grid job scheduling strategies are discussed. This paper also presents a discussion on the future research topics and the challenges of job scheduling in grids.

[1]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[2]  Richard Wolski,et al.  Forecasting network performance to support dynamic scheduling using the network weather service , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[3]  Hesham El-Rewini,et al.  Distributed and Parallel Computing , 1998 .

[4]  Ramin Yahyapour,et al.  Design and evaluation of job scheduling strategies for grid computing , 2000, GRID.

[5]  Marvin Theimer,et al.  Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs , 2000, SIGMETRICS '00.

[6]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[7]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[8]  Uwe Schwiegelshohn,et al.  On Advantages of Grid Computing for Parallel Job Scheduling , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[9]  Rajkumar Buyya,et al.  Grid Computing — GRID 2000 , 2002, Lecture Notes in Computer Science.

[10]  Kenichi Hagihara,et al.  Near-optimal dynamic task scheduling of independent coarse-grained tasks onto a computational grid , 2003, 2003 International Conference on Parallel Processing, 2003. Proceedings..

[11]  Jeremy T. Bradley,et al.  Stochastic analysis of scheduling strategies in a Grid-based resource model , 2004, IEE Proc. Softw..

[12]  Soonwook Hwang,et al.  A Flexible Framework for Fault Tolerance in the Grid , 2003, Journal of Grid Computing.

[13]  Shanshan Song,et al.  Trusted Grid Computing with Security Assurance and Resource Optimization , 2004, PDCS.

[14]  Shanshan Song,et al.  Trusted Grid Computing with Security Binding and Trust Integration , 2005, Journal of Grid Computing.

[15]  Ying Chen,et al.  GridSec: Trusted Grid Computing with Security Binding and Self-defense Against Network Worms and DDoS Attacks , 2005, International Conference on Computational Science.

[16]  Jack Dongarra,et al.  Computational Science - ICCS 2005, 5th International Conference, Atlanta, GA, USA, May 22-25, 2005, Proceedings, Part I , 2005, International Conference on Computational Science.

[17]  Rizos Sakellariou,et al.  Application-level simulation modelling of large grids , 2005 .

[18]  David Abramson,et al.  The Grid Economy , 2005, Proceedings of the IEEE.

[19]  Jemal H. Abawajy Fault-Tolerant Dynamic Job Scheduling Policy , 2005, ICA3PP.

[20]  Henri Casanova,et al.  The SIMGRID Project Simulation and Deployment of Distributed Applications , 2006, 2006 15th IEEE International Conference on High Performance Distributed Computing.

[21]  Alan Harrison,et al.  The application of knowledge-based techniques to the monitoring of computers in a large heterogeneous distributed environment , 2006, Knowl. Based Syst..

[22]  Andrew C. Simpson,et al.  Securing web services for deployment in health grids , 2006, Future Gener. Comput. Syst..

[23]  Shanshan Song,et al.  Risk-resilient heuristics and genetic algorithms for security-assured grid job scheduling , 2006, IEEE Transactions on Computers.

[24]  Atakan Dogan,et al.  Scheduling of a meta-task with QoS requirements in heterogeneous computing systems , 2006, J. Parallel Distributed Comput..

[25]  Cevdet Aykanat,et al.  Iterative-Improvement-Based Heuristics for Adaptive Scheduling of Tasks Sharing Files on Heterogeneous Master-Slave Environments , 2006, IEEE Transactions on Parallel and Distributed Systems.

[26]  Larry Carter,et al.  Centralized versus distributed schedulers for multiple bag-of-task applications , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[27]  Anthony A. Maciejewski,et al.  A semi-static approach to mapping dynamic iterative tasks onto heterogeneous computing systems , 2006, J. Parallel Distributed Comput..

[28]  Nigel Thomas Modelling job allocation where service duration is unknown , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[29]  Anthony A. Maciejewski,et al.  Static allocation of resources to communicating subtasks in a heterogeneous ad hoc grid environment , 2006, J. Parallel Distributed Comput..

[30]  Ian Foster,et al.  Monitoring and Discovery in a Web Services Framework: Functionality and Performance of Globus Toolkit MDS4 , 2006 .