Integrating Computing Resources on Multiple Grid-Enabled Job Scheduling Systems Through a Grid RPC System

We present a framework for a parallel programming model by remote procedure calls, which bridge large-scale computing resource pools managed by multiple Grid-enabled job scheduling systems. With this system, the user can exploit not only remote servers and clusters, but also the computing resources provided by Grid-enabled job scheduling systems located on different sites. This framework requires a Grid remote procedure call (RPC) system to decouple the computation in a remote node from the Grid RPC mechanism and uses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RPC system for parallel programming. We designed a general interface to easily adapt the OmniRPC system to various Grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine. We show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as OmniRPC and can handle interruptions in worker programs on remote nodes.

[1]  Mitsuhisa Sato,et al.  OmniRPC: A Grid RPC Facility for Cluster and Global Computing in OpenMP , 2001, WOMPAT.

[2]  Mitsuhisa Sato,et al.  Implementation and performance evaluation of CONFLEX-G: grid-enabled molecular conformational space search program with OmniRPC , 2004, ICS '04.

[3]  Yoshio Tanaka,et al.  Design, implementation and performance evaluation of GridRPC programming middleware for a large-scale computational grid , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[4]  Mitsuhisa Sato,et al.  Ninf: A Network Based Information Library for Global World-Wide Computing Infrastructure , 1997, HPCN Europe.

[5]  David Abramson,et al.  A Computational Economy for Grid Computing and its Implementation in the Nimrod-G Resource Brok , 2001, Future Gener. Comput. Syst..

[6]  Ian T. Foster,et al.  Condor-G: A Computation Management Agent for Multi-Institutional Grids , 2004, Cluster Computing.

[7]  Satoshi Matsuoka,et al.  Ninf-G: A Reference Implementation of RPC-based Programming Middleware for Grid Computing , 2003, Journal of Grid Computing.

[8]  Mitsuhisa Sato,et al.  Performance evaluation of OmniRPC in a grid environment , 2004, 2004 International Symposium on Applications and the Internet Workshops. 2004 Workshops..

[9]  Henri Casanova,et al.  Overview of GridRPC: A Remote Procedure Call API for Grid Computing , 2002, GRID.

[10]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[11]  Jeff T. Linderoth,et al.  An enabling framework for master-worker applications on the Computational Grid , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[12]  Mitsuhisa Sato,et al.  OmniRPC: a grid RPC system for parallel programming in cluster and grid environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[13]  Y. Tanaka,et al.  The design and implementation of a fault-tolerant RPC system: Ninf-C , 2004, Proceedings. Seventh International Conference on High Performance Computing and Grid in Asia Pacific Region, 2004..

[14]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[15]  Frédéric Desprez DIET: Building Problem Solving Environments for the Grid , 2004, HiPC.

[16]  Jack Dongarra,et al.  Users' Guide to NetSolve v1.4.1 , 2002 .

[17]  Tomonori Yamashita,et al.  Development of grid middleware CyberGRIP and its applications , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[18]  Samir Djilali P2P-RPC: programming scientific applications on peer-to-peer systems with remote procedure call , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[19]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[20]  Gilles Fedak,et al.  XtremWeb: Building an Experimental Platform for Global Computing , 2000, GRID.

[21]  Wolfgang Gentzsch,et al.  Sun Grid Engine: towards creating a compute power grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[22]  Gilles Fedak,et al.  XtremWeb: a generic global computing system , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[23]  Thomas Hérault,et al.  RPC-V: Toward Fault-Tolerant RPC for Internet Connected Desktop Grids with Volatile Nodes , 2004, Proceedings of the ACM/IEEE SC2004 Conference.