FOBS: A Lightweight Communication Protocol for Grid Computing

In this paper, we discuss our work on developing an efficient, lightweight application-level communication protocol for the high-bandwidth, high-delay network environments typical of computational grids. The goal of this research is to provide congestion-control algorithms that allow the protocol to obtain a large percentage of the underlying bandwidth when it is available, and to be responsive (and eventually proactive) to developing contention for system resources. Towards this end, we develop and evaluate two application-level congestion-control algorithms, one of which incorporates historical knowledge and one that only uses current information. We compare the performance of these two algorithms with respect to each other and with respect to TCP.

[1]  Vern Paxson,et al.  TCP Congestion Control , 1999, RFC.

[2]  Andrew S. Grimshaw,et al.  Legion-a view from 50,000 feet , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[3]  Ian T. Foster,et al.  Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[4]  Francine Berman,et al.  Heuristics for scheduling parameter sweep applications in grid environments , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[5]  Brian D. Noble,et al.  The Effects of Systemic Packet Loss on Aggregate TCP Flows , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[6]  Jason Leigh,et al.  Reliable Blast UDP : predictable high performance bulk data transfer , 2002, Proceedings. IEEE International Conference on Cluster Computing.

[7]  B. Harrison Las Vegas, Nevada , 2002 .

[8]  F. Berman,et al.  Adaptive Performance Prediction for Distributed Data-Intensive Applications , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[9]  Van Jacobson,et al.  TCP Extensions for High Performance , 1992, RFC.

[10]  Francine Berman,et al.  The GrADS Project: Software Support for High-Level Grid Application Development , 2001, Int. J. High Perform. Comput. Appl..

[11]  Francine Berman,et al.  Using Apples to Schedule Simple SARA on the Computational Grid , 1999, Int. J. High Perform. Comput. Appl..

[12]  Robert L. Grossman,et al.  PSockets: The Case for Application-level Network Striping for Data Intensive Applications using High Speed Wide Area Networks , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[13]  Robert L. Grossman,et al.  Simple Available Bandwidth Utilization Library for High-Speed Wide Area Networks , 2005, The Journal of Supercomputing.

[14]  Henri Casanova,et al.  A decoupled scheduling approach for Grid application development environments , 2003, J. Parallel Distributed Comput..

[15]  Henri Casanova,et al.  A Modular Scheduling Approach for Grid Application Development Environments , 2002 .

[16]  William Gropp,et al.  High performance wide area data transfers over high performance networks , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[17]  William Gropp,et al.  An evaluation of object-based data transfers on high performance networks , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[18]  Francine Berman,et al.  Application-Level Scheduling on Distributed Heterogeneous Networks , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[19]  Wu-chun Feng,et al.  The Failure of TCP in High-Performance Computational Grids , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[20]  Francine Berman,et al.  Toward a framework for preparing and executing adaptive grid programs , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[21]  Rajeev Thakur,et al.  An evaluation of Java's I/O capabilities for high-performance computing , 2000, JAVA '00.

[22]  Rashid Ansari,et al.  Adaptive Networking for Tele-Immersion , 2001, EGVE/IPT.

[23]  Sally Floyd,et al.  2 What ’ s the Problem ? 2 . 1 Basics TCP uses the following algorithm to adjust its congestion window , 2002 .

[24]  Mark Allman,et al.  An Application-Level solution to TCP''s Satellite Inefficiencies , 1996 .

[25]  Francine Berman,et al.  Application-aware scheduling of a magnetohydrodynamics application in the Legion metasystem , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[26]  Francine Berman,et al.  Scheduling from the perspective of the application , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[27]  Sally Floyd,et al.  TCP Selective Acknowledgement Options , 1996 .