GridOS: Operating System Services for Grid Architectures

In this work, we demonstrate the power of providing a common set of operating system services to Grid Architectures, including high-performance I/O, communication, resource management, and process management. In the last few years, a number of exciting projects like Globus, Legion, and UNICORE developed the software infrastructure needed for grid computing. However, operating system support for grid computing is minimal or non-existent. Tool writers are forced to re-invent the wheel by implementing from scratch. This is error prone and often results in sub-optimal solutions. To address these problems, we are developing GridOS, a set of operating system services that facilitate grid computing. The services are designed to make writing middleware easier and make a normal commodity operating system like Linux highly suitable for grid computing. The modules are designed to be policy neutral, exploit commonality in various grid infrastructures and provide high-performance. Experiments with GridOS verify that there is dramatic improvement in performance when compared to the existing grid file transfer protocols like GridFTP. Our proof-of-concept middleware shows that writing middleware is easy using GridOS.

[1]  Sudharshan S. Vazhkudai,et al.  PODOS -- The design and implementation of a performance oriented Linux cluster , 2002, Future Gener. Comput. Syst..

[2]  Matthew Mathis,et al.  Automatic TCP buffer tuning , 1998, SIGCOMM '98.

[3]  Willy Zwaenepoel,et al.  Flash: An efficient and portable Web server , 1999, USENIX Annual Technical Conference, General Track.

[4]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[5]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[6]  W. Richard Stevens,et al.  TCP/IP Illustrated, Volume 1: The Protocols , 1994 .

[7]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[8]  Paul Avery,et al.  The griphyn project: towards petascale virtual data grids , 2001 .

[9]  Warren Smith,et al.  A Resource Management Architecture for Metacomputing Systems , 1998, JSSPP.

[10]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[11]  Message Passing Interface Forum MPI: A message - passing interface standard , 1994 .

[12]  Jason Lee,et al.  A network-aware distributed storage cache for data intensive environments , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[13]  David E. Culler,et al.  WebOS: operating system services for wide area applications , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[14]  Valentina Huber UNICORE: A Grid Computing Environment for Distributed and Parallel Computing , 2001, PaCT.

[15]  Andrew S. Grimshaw,et al.  The Legion vision of a worldwide virtual computer , 1997, Commun. ACM.

[16]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[17]  P. Tobin Maginnis Design considerations for the transformation of MINIX into a distributed operating system , 1988, CSC '88.

[18]  Andrew S. Tanenbaum,et al.  An overview of the Amoeba distributed operating system , 1981, OPSR.

[19]  Andrew R. Cherenson,et al.  The Sprite network operating system , 1988, Computer.