Wide-area lustre file system using LNet routers

Scientific and big data computations are increasingly being distributed across wide-area networks, and they often require access to remote files. The file systems that are directly mounted over wide-area networks transparently support such computations, and also obviate the need for special purpose file transfer tools. In typical distributed file systems, the access is limited to local sites, and in particular, the reach of Lustre file system implemented over InfiniBand (IB) is limited to at most tens of miles due to 2.5ms latency bound. We describe LNet router methods that connect IB Lustre file system to remote Ethernet clients over wide-area networks. We collect extensive Lustre throughput measurements over 10Gbps connections with 0–366ms round-trip times. They demonstrate that Gbps throughput can be sustained over connections spanning the globe. We present Lustre throughput profiles over local and wide-area connections, which show the effects of various buffers and credits; in particular, they highlight the throughput limits for large transfers over wide-area connections. Furthermore, the measurements show the positive effects of pipelining in achieving higher throughput for successively file transfers compared to rates indicated by IOzone benchmark rates.

[1]  Jeffrey S. Vetter,et al.  Wide-area performance profiling of 10GigE and InfiniBand technologies , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[2]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[3]  Donald F. Towsley,et al.  TCP Throughput Profiles Using Measurements over Dedicated Connections , 2017, HPDC.

[4]  Qiang Liu,et al.  On Data Transfers Over Wide-Area Dedicated Connections , 2017 .

[5]  Scott Michael,et al.  A study of lustre networking over a 100 gigabit wide area network with 50 milliseconds of latency , 2012, DIDC '12.

[6]  Robert L. Grossman,et al.  UDT: UDP-based data transfer for high-speed wide area networks , 2007, Comput. Networks.

[7]  Scott Michael,et al.  Demonstrating Lustre over a 100Gbps wide area network of 3,500km , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[8]  Douglas J. Leith,et al.  H-TCP : TCP for high-speed and long-distance networks , 2004 .

[9]  Chase Qishi Wu,et al.  Experiments and Analyses of Data Transfers over Wide-Area Dedicated Connections , 2017, 2017 26th International Conference on Computer Communication and Networks (ICCCN).