End-to-End Data-Flow Parallelism for Throughput Optimization in High-Speed Networks

The increase in the data produced by large-scale scientific applications necessitates innovative solutions for efficient transfer of data. Although the current optical networking technology reached theoretical speeds of 100 Gbps, applications still suffer from inefficient transport protocols and bottlenecks on the end-systems (e.g. disk, CPU, NIC). High-performance systems provide us with parallel disks, processors and network interfaces. However the lack of orchestration of these end-system resources with the available network capacity results in underutilization of the network bandwidth. In this study, a model and two algorithms that use ‘end-to-end data-flow parallelism’ to optimize the use of network and end-system resources are proposed. This is achieved by using multiple parallel streams over the network; and multiple parallel disks and CPUs at the end systems. Our model predicts the optimal number of streams and disk/CPU stripes that maximizes the data transfer speed for any setting. Our algorithms use GridFTP parallel samplings and calculate the optimal level of parallelism based on our prediction model. The experiments conducted by using actual GridFTP transfers show that the predictions performed by our model and algorithms provide close-to-optimal performances with negligible overhead and use minimal number of resources. The end-to-end data transfer throughput is improved dramatically in existence of end-system bottlenecks compared to the non-optimized transfers.

[1]  Peter A. Dinda,et al.  Modeling and taming parallel TCP on the wide area network , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[2]  Paul Nowoczynski,et al.  DMOVER: Parallel Data Migration for Mainstream Users , 2005 .

[3]  Brian D. Noble,et al.  The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[4]  Wu-chun Feng,et al.  Dynamic right-sizing: a simulation study , 2001, Proceedings Tenth International Conference on Computer Communications and Networks (Cat. No.01EX495).

[5]  Frank B. Schmuck,et al.  GPFS: A Shared-Disk File System for Large Computing Clusters , 2002, FAST.

[6]  Ian T. Foster,et al.  A data transfer framework for large-scale science experiments , 2010, HPDC '10.

[7]  Fernando Paganini,et al.  FAST TCP: from theory to experiments , 2005, IEEE Network.

[8]  Sally Floyd,et al.  HighSpeed TCP for Large Congestion Windows , 2003, RFC.

[9]  Eitan Altman,et al.  Parallel TCP Sockets: Simple Model, Throughput and Validation , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[10]  Jeffrey S. Chase,et al.  End system optimizations for high-speed TCP , 2001, IEEE Commun. Mag..

[11]  Masayuki Murata,et al.  Scalable socket buffer tuning for high-performance Web servers , 2001, Proceedings Ninth International Conference on Network Protocols. ICNP 2001.

[12]  Matthew Mathis,et al.  Automatic TCP buffer tuning , 1998, SIGCOMM '98.

[13]  Wu-chun Feng,et al.  Dynamic right-sizing in TCP : a simulation study / , 2001 .

[14]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[15]  Manish Jain,et al.  The tcp bandwidth-delay product revisited: network buffering , 2003 .

[16]  Manish Jain,et al.  Socket Buffer Auto-Sizing for High-Performance Data Transfers , 2003, Journal of Grid Computing.

[17]  Robert B. Ross,et al.  Using MPI-2: Advanced Features of the Message Passing Interface , 2003, CLUSTER.

[18]  Himabindu Pucha,et al.  Adaptive File Transfers for Diverse Environments , 2008, USENIX Annual Technical Conference.

[19]  Miron Livny,et al.  Run-time Adaptation of Grid Data Placement Jobs , 2005, Scalable Comput. Pract. Exp..

[20]  Tevfik Kosar,et al.  Prediction of Optimal Parallelism Level in Wide Area Data Transfers , 2011, IEEE Transactions on Parallel and Distributed Systems.

[21]  Ravindra K. Ahuja,et al.  Network Flows , 2011 .

[22]  Jon Crowcroft,et al.  Differentiated end-to-end Internet services using a weighted proportional fair sharing TCP , 1998, CCRV.