Network-aware end-to-end data throughput optimization

The rapidly advancing optical networking technology allows us high-bandwidth connectivity up to 100Gbps these days. However, the end-users and their applications can only observe a fraction of this available bandwidth capacity due to inefficient transport protocols and other end-system bottlenecks such as disk I/O limitations, processor speed, and NIC restrictions. In this paper, we present a novel network-aware end-to-end throughput prediction and optimization framework which provides us with the best parameter combination (i.e. parallel stream, disk, and CPU numbers) to achieve the highest end-to-end throughput between two end-systems (i.e. clusters, data centers, parallel disk systems) possible. Our experiments show that the model and algorithm we have developed enable us to achieve close-to-optimal end-to-end throughput performance with negligible sampling and prediction overhead.

[1]  Peter A. Dinda,et al.  Modeling and taming parallel TCP on the wide area network , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[2]  Ian T. Foster,et al.  A data transfer framework for large-scale science experiments , 2010, HPDC '10.

[3]  Sally Floyd,et al.  HighSpeed TCP for Large Congestion Windows , 2003, RFC.

[4]  Eitan Altman,et al.  Parallel TCP Sockets: Simple Model, Throughput and Validation , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[5]  Robert B. Ross,et al.  Using MPI-2: Advanced Features of the Message Passing Interface , 2003, CLUSTER.

[6]  Masayuki Murata,et al.  Scalable socket buffer tuning for high-performance Web servers , 2001, Proceedings Ninth International Conference on Network Protocols. ICNP 2001.

[7]  Matthew Mathis,et al.  Automatic TCP buffer tuning , 1998, SIGCOMM '98.

[8]  Wu-chun Feng,et al.  Dynamic right-sizing: a simulation study , 2001, Proceedings Tenth International Conference on Computer Communications and Networks (Cat. No.01EX495).

[9]  Paul Nowoczynski,et al.  DMOVER: Parallel Data Migration for Mainstream Users , 2005 .

[10]  Ravindra K. Ahuja,et al.  Network Flows , 2011 .

[11]  Jon Crowcroft,et al.  Differentiated end-to-end Internet services using a weighted proportional fair sharing TCP , 1998, CCRV.

[12]  Brian D. Noble,et al.  The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[13]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[14]  Manish Jain,et al.  The tcp bandwidth-delay product revisited: network buffering , 2003 .

[15]  Manish Jain,et al.  Socket Buffer Auto-Sizing for High-Performance Data Transfers , 2003, Journal of Grid Computing.

[16]  Fernando Paganini,et al.  FAST TCP: from theory to experiments , 2005, IEEE Network.

[17]  Miron Livny,et al.  Run-time Adaptation of Grid Data Placement Jobs , 2005, Scalable Comput. Pract. Exp..

[18]  Tevfik Kosar,et al.  Prediction of Optimal Parallelism Level in Wide Area Data Transfers , 2011, IEEE Transactions on Parallel and Distributed Systems.