CrossBroker: A Grid Metascheduler for Interactive and Parallel Jobs

Execution of parallel and interactive applications on a Grid environment is a challenging problem that requires the cooperation of several middleware tools and services. In this paper, we present our experiences in the development of CrossBroker, a job management service that provides transparent and reliable support for such types of applications. We outline the main components of CrossBroker and how they interact with other middleware services. We also describe specific features of the scheduler used to guarantee resource co-allocation for running MPI jobs remotely over multiple machines spread across several Grid sites or to start interactive applications as fast as possible. These features include a simple time-sharing mechanism that allows fast execution of interactive applications even under heavy occupancy of Grid resources.

[1]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[2]  Elisa Heymann,et al.  Resource Management for Interactive Jobs in a Grid Environment , 2006, 2006 IEEE International Conference on Cluster Computing.

[3]  Katarzyna Rycerz,et al.  CrossGrid - tools and services for interactive grid applications , 2004 .

[4]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[5]  Dieter Kranzlmüller,et al.  glogin - a multifunctional, interactive tunnel into the grid , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[6]  Jennifer M. Schopf,et al.  Ten actions when Grid scheduling: the user as a Grid scheduler , 2004 .

[7]  Anna Sikora,et al.  Workflow Management in the CrossGrid Project , 2005, EGC.

[8]  Michael M. Resch,et al.  Towards Efficient Execution of MPI Applications on the Grid: Porting and Optimization Issues , 2003, Journal of Grid Computing.

[9]  Elisa Heymann,et al.  Supporting Efficient Execution of MPI Applications Across Multiple Sites , 2006, Euro-Par.

[10]  Rajesh Raman,et al.  Matchmaking: distributed resource management for high throughput computing , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).