MDTM: Optimizing Data Transfer Using Multicore-Aware I/O Scheduling

Bulk data transfer is facing significant challenges in the coming era of big data. There are multiple performance bottlenecks along the end-to-end path from the source to destination storage system. The limitations of current generation data transfer tools themselves can have a significant impact on end-to-end data transfer rates. In this paper, we identify the issues that lead to underperformance of these tools, and present a new data transfer tool with an innovative I/O scheduler called MDTM. The MDTM scheduler exploits underlying multicore layouts to optimize throughput by reducing delay and contention for I/O reading and writing operations. With our evaluations, we show how MDTM successfully avoids NUMA-based congestion and significantly improves end-to-end data transfer rates across high-speed wide area networks.

[1]  Janardhan R. Iyengar,et al.  Concurrent Multipath Transfer Using SCTP Multihoming Over Independent End-to-End Paths , 2006, IEEE/ACM Transactions on Networking.

[2]  Simon Fraser User-level scheduling on NUMA multicore systems under Linux , 2011 .

[3]  Eli Dart Fusion Energy Sciences Network Requirements , 2014 .

[4]  Eli Dart,et al.  Basic Energy Sciences Network Requirements Review - Final Report , 2015 .

[5]  Krishna R. Pattipati,et al.  Application-layer multipath data transfer via TCP: Schemes and performance tradeoffs , 2007, Perform. Evaluation.

[6]  D. Martin Swany,et al.  Exploiting Network Parallelism for Improving Data Transfer Performance , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[7]  R. Srikant,et al.  Multi-Path TCP: A Joint Congestion Control and Routing Scheme to Exploit Path Diversity in the Internet , 2006, IEEE/ACM Transactions on Networking.

[8]  Alexandra Fedorova,et al.  Addressing shared resource contention in multicore processors via scheduling , 2010, ASPLOS XV.

[9]  Brice Goglin,et al.  Impact of NUMA effects on high-speed networking with multi-opteron machines , 2007, PDCS 2007.

[10]  Josep Torrellas Architectures for Extreme-Scale Computing , 2009, Computer.

[11]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[12]  Steven Tuecke,et al.  GridFTP: Protocol Extensions to FTP for the Grid , 2001 .

[13]  Angelos Bilas,et al.  NUMA Implications for Storage I / O Throughput in Modern Servers , 2012 .

[14]  Liang Zhang,et al.  mdtmFTP and its evaluation on ESNET SDN testbed , 2018, Future Gener. Comput. Syst..

[15]  David M. Asner Belle-II Experiment Network Requirements , 2014 .

[16]  Wenji Wu,et al.  Why Can Some Advanced Ethernet NICs Cause Packet Reordering? , 2011, IEEE Communications Letters.

[17]  Ian Foster,et al.  GridFTP Pipelining , 2007 .

[18]  Michael Stumm,et al.  Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors , 2007, EuroSys '07.