On parameter tuning of data transfer protocol GridFTP for wide-area grid computing

In wide-area grid computing, geographically distributed computational resources are connected for enabling efficient and large-scale scientific/engineering computations. In the wide-area grid computing, a data transfer protocol called GridFTP has been commonly used for large file transfers. GridFTP has the following features for solving problems of the existing TCP. First, for accelerating the start-up in TCP's slow start phase and achieving high throughput in TCP's congestion avoidance phase, multiple TCP connections can be established in parallel. Second, according to the bandwidth-delay product of a network, the TCP socket buffer size can be negotiated between GridFTP server and client. However, in the literature, sufficient investigation has not been performed either on the optimal number of TCP connections or the optimal TCP socket buffer size. In this paper, we therefore quantitatively investigate the optimal parameter configuration of GridFTP in terms of the number of TCP connections and the TCP socket buffer size. We first derive performance metrics of GridFTP in steady state (i.e., goodput and packet loss probability). We then derive the optimal parameter configuration for GridFTP and quantitatively show performance limitations of GridFTP through several numerical examples. We also demonstrate validity of our approximate analysis by comparing simulation results with analytic ones

[1]  J. Postel,et al.  File transfer protocol (FTP) , 1985 .

[2]  Matthew Mathis,et al.  Automatic TCP buffer tuning , 1998, SIGCOMM '98.

[3]  Brian Tierney,et al.  A TCP Tuning Daemon , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[4]  Brian D. Noble,et al.  The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[5]  Robert Elz,et al.  Feature negotiation mechanism for the File Transfer Protocol , 1998, RFC.

[6]  Yin Zhang,et al.  On individual and aggregate TCP performance , 1999, Proceedings. Seventh International Conference on Network Protocols.

[7]  Hiroyuki Ohsaki,et al.  On scalable modeling of TCP congestion control mechanism for large-scale IP networks , 2005, The 2005 Symposium on Applications and the Internet.

[8]  Peter A. Dinda,et al.  Modeling and taming parallel TCP on the wide area network , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[9]  Steven Tuecke,et al.  GridFTP: Protocol Extensions to FTP for the Grid , 2001 .

[10]  Jon Postel,et al.  Transmission Control Protocol , 1981, RFC.

[11]  Hiroyuki Ohsaki,et al.  Automatic parameter configuration mechanism for data transfer protocol GridFTP , 2006, International Symposium on Applications and the Internet (SAINT'06).

[12]  R. Elz FTP security extensions , 1997 .

[13]  Sally Floyd,et al.  HighSpeed TCP for Large Congestion Windows , 2003, RFC.

[14]  Robert L. Grossman,et al.  PSockets: The Case for Application-level Network Striping for Data Intensive Applications using High Speed Wide Area Networks , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[15]  Fernando Paganini,et al.  Internet congestion control , 2002 .

[16]  Donald F. Towsley,et al.  Modeling TCP throughput: a simple model and its empirical validation , 1998, SIGCOMM '98.

[17]  Wu-chun Feng,et al.  Optimizing GridFTP through dynamic right-sizing , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.