A Survey on Grid Scheduling Systems

Thanks to vast improvements in wide-area network performance and powerful yet low-cost computers, Grid computing has emerged as a promising attractive computing paradigm. Computational Grids aim to aggregate the power of heterogeneous, geographically distributed, multiple-domain-spanning computational resources to provide high performance or high-throughput computing. To achieve the promising potentials of computational Grids, an effective and efficient scheduling system is fundamentally important. Scheduling systems for traditional distributed environments do not work in Grid environments because the two classes of environments are radically distinct. Scheduling in Grid environments is significantly complicated by the unique characteristics of Grids. This survey focuses on the design of scheduling systems for computational Grids. First, this survey investigates challenges for designing scheduling system for computational Grids. Second, a Grid scheduling framework and a common Grid scheduler architecture are proposed. Next, a comprehensive taxonomy for describing Grid scheduling systems is also presented. Finally, a number of representative Grid scheduling systems are surveyed in detail.

[1]  P. Gradwell Overview of Grid Scheduling Systems , 2022 .

[2]  Warren Smith,et al.  Software infrastructure for the I-WAY metacomputing experiment , 1998, Concurrency Practice and Experience.

[3]  Peter M. A. Sloot,et al.  Dynamite - Blasting Obstacles to Parallel Cluster Computing , 1999, HPCN Europe.

[4]  Chuang Liu,et al.  Design and evaluation of a resource selection framework for Grid applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[5]  Subhash Saini,et al.  Local grid scheduling techniques using performance prediction , 2003 .

[6]  Derek Wright,et al.  Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor , 2007 .

[7]  David P. Anderson,et al.  SETI@home-massively distributed computing for SETI , 2001, Comput. Sci. Eng..

[8]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[9]  Thomas L. Casavant,et al.  A Taxonomy of Scheduling in General-Purpose Distributed Computing Systems , 1988, IEEE Trans. Software Eng..

[10]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[11]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[12]  Ramin Yahyapour,et al.  Design and evaluation of job scheduling strategies for grid computing , 2000, GRID.

[13]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[14]  David Abramson,et al.  An Economy Driven Resource Management Architecture for Global Computational Power Grids , 2000, PDPTA.

[15]  Henri Casanova,et al.  A Modular Scheduling Approach for Grid Application Development Environments , 2002 .

[16]  Editors , 1986, Brain Research Bulletin.

[17]  Steve J. Chapin,et al.  Distributed and multiprocessor scheduling , 1996, CSUR.

[18]  Francine Berman,et al.  Toward a framework for preparing and executing adaptive grid programs , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[19]  David Abramson,et al.  Nimrod: a tool for performing parametrised simulations using distributed workstations , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[20]  Veljko Milutinovic,et al.  A survey of heterogeneous computing: concepts and systems , 1996, Proc. IEEE.

[21]  Jonathan Armstrong,et al.  Introduction to grid computing with globus , 2003 .

[22]  Francine Berman,et al.  Application-Level Scheduling on Distributed Heterogeneous Networks , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[23]  Satoshi Matsuoka,et al.  Overview of a performance evaluation system for global computing scheduling algorithms , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[24]  Warren Smith,et al.  A Resource Management Architecture for Metacomputing Systems , 1998, JSSPP.

[25]  Mario J. Gonzalez Deterministic Processor Scheduling , 1977, CSUR.

[26]  David Abramson,et al.  High performance parametric modeling with Nimrod/G: killer application for the global grid? , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[27]  H. G. Rotithor Taxonomy of dynamic task scheduling schemes in distributed computing systems , 1994 .

[28]  Muthucumaru Maheswaran,et al.  Distributed dynamic scheduling of composite tasks on grid computing systems , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[29]  Francine Berman,et al.  Scheduling from the perspective of the application , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[30]  Francine Berman,et al.  The AppLeS Project: A Status Report , 1997 .

[31]  Francine Berman,et al.  Heuristics for scheduling parameter sweep applications in grid environments , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[32]  Rajesh Raman,et al.  Matchmaking: distributed resource management for high throughput computing , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[33]  John F. Karpovich,et al.  Resource management in Legion , 1999, Future Gener. Comput. Syst..

[34]  Francine Berman,et al.  Overview of the Book: Grid Computing – Making the Global Infrastructure a Reality , 2003 .

[35]  Andrew S. Grimshaw,et al.  The core Legion object model , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[36]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[37]  Jean-Yves Girard,et al.  Enabling applications for grid computing with globus , 2003 .

[38]  K. Mani Chandy,et al.  A General Resource Reservation Framework for Scientific Computing , 1997, ISCOPE.

[39]  Nicholas R. Jennings,et al.  The Evolution of the Grid , 2003 .

[40]  Steven Tuecke,et al.  The Anatomy of the Grid , 2003 .

[41]  David Abramson,et al.  Nimrod/G: an architecture for a resource management and scheduling system in a global computational grid , 2000, Proceedings Fourth International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region.

[42]  Rajkumar Buyya,et al.  Nature's heuristics for scheduling jobs on Computational Grids , 2000 .

[43]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[44]  Ian T. Foster,et al.  Condor-G: A Computation Management Agent for Multi-Institutional Grids , 2004, Cluster Computing.

[45]  Greg Lindahl,et al.  Metacomputing – What’s in it for me? , 1995 .

[46]  Bin Yao,et al.  A taxonomy for describing matching and scheduling heuristics for mixed-machine heterogeneous computing systems , 1998, Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281).

[47]  Ian T. Foster,et al.  A security architecture for computational grids , 1998, CCS '98.

[48]  Geoffrey C. Fox,et al.  Cluster Computing Review , 1995 .

[49]  Alexander Reinefeld,et al.  MARS - A framework for minimizing the job execution time in a metacomputing environment , 1996, Future Gener. Comput. Syst..

[50]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[51]  Michael J. Flynn,et al.  Very high-speed computing systems , 1966 .

[52]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[53]  Ladislau Bölöni,et al.  A comparison study of static mapping heuristics for a class of meta-tasks on heterogeneous computing systems , 1999, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99).

[54]  Rajkumar Buyya,et al.  Architectural Models for Resource Management in the Grid , 2000, GRID.

[55]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[56]  John Shalf,et al.  The Cactus Worm: Experiments with Dynamic Resource Discovery and Allocation in a Grid Environment , 2001, Int. J. High Perform. Comput. Appl..

[57]  Warren Smith,et al.  A directory service for configuring high-performance distributed computations , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).