论文信息 - MDTM: Optimizing Data Transfer Using Multicore-Aware I/O Scheduling

MDTM: Optimizing Data Transfer Using Multicore-Aware I/O Scheduling

Bulk data transfer is facing significant challenges in the coming era of big data. There are multiple performance bottlenecks along the end-to-end path from the source to destination storage system. The limitations of current generation data transfer tools themselves can have a significant impact on end-to-end data transfer rates. In this paper, we identify the issues that lead to underperformance of these tools, and present a new data transfer tool with an innovative I/O scheduler called MDTM. The MDTM scheduler exploits underlying multicore layouts to optimize throughput by reducing delay and contention for I/O reading and writing operations. With our evaluations, we show how MDTM successfully avoids NUMA-based congestion and significantly improves end-to-end data transfer rates across high-speed wide area networks.

Bockjoo Kim | Liang Zhang | Wenji Wu | Phil DeMar

[1] Janardhan R. Iyengar,et al. Concurrent Multipath Transfer Using SCTP Multihoming Over Independent End-to-End Paths , 2006, IEEE/ACM Transactions on Networking.

[2] Simon Fraser. User-level scheduling on NUMA multicore systems under Linux , 2011 .

[3] Eli Dart. Fusion Energy Sciences Network Requirements , 2014 .

[4] Eli Dart,et al. Basic Energy Sciences Network Requirements Review - Final Report , 2015 .

[5] Krishna R. Pattipati,et al. Application-layer multipath data transfer via TCP: Schemes and performance tradeoffs , 2007, Perform. Evaluation.

[6] D. Martin Swany,et al. Exploiting Network Parallelism for Improving Data Transfer Performance , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[7] R. Srikant,et al. Multi-Path TCP: A Joint Congestion Control and Routing Scheme to Exploit Path Diversity in the Internet , 2006, IEEE/ACM Transactions on Networking.

[8] Alexandra Fedorova,et al. Addressing shared resource contention in multicore processors via scheduling , 2010, ASPLOS XV.

[9] Brice Goglin,et al. Impact of NUMA effects on high-speed networking with multi-opteron machines , 2007, PDCS 2007.

[10] Josep Torrellas. Architectures for Extreme-Scale Computing , 2009, Computer.

[11] William E. Allcock,et al. The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[12] Steven Tuecke,et al. GridFTP: Protocol Extensions to FTP for the Grid , 2001 .

[13] Angelos Bilas,et al. NUMA Implications for Storage I / O Throughput in Modern Servers , 2012 .

[14] Liang Zhang,et al. mdtmFTP and its evaluation on ESNET SDN testbed , 2018, Future Gener. Comput. Syst..

[15] David M. Asner. Belle-II Experiment Network Requirements , 2014 .

[16] Wenji Wu,et al. Why Can Some Advanced Ethernet NICs Cause Packet Reordering? , 2011, IEEE Communications Letters.

[17] Ian Foster,et al. GridFTP Pipelining , 2007 .

[18] Michael Stumm,et al. Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors , 2007, EuroSys '07.