Fine-grained load balancing with traffic-aware rerouting in datacenter networks

Modern datacenters provide a wide variety of application services, which generate a mix of delay-sensitive short flows and throughput-oriented long flows, transmitting in the multi-path datacenter network. Though the existing load balancing designs successfully make full use of available parallel paths and attain high bisection network bandwidth, they reroute flows regardless of their dissimilar performance requirements. The short flows suffer from the problems of large queuing delay and packet reordering, while the long flows fail to obtain high throughput due to low link utilization and packet reordering. To address these inefficiency, we design a fine-grained load balancing scheme, namely TR (Traffic-aware Rerouting), which identifies flow types and executes flexible and traffic-aware rerouting to balance the performances of both short and long flows. Besides, to avoid packet reordering, TR leverages the reverse ACKs to estimate the switch-to-switch delay, thus excluding paths that potentially cause packet reordering. Moreover, TR is only deployed on the switch without any modification on end-hosts. The experimental results of large-scale NS2 simulations show that TR reduces the average and tail flow completion time for short flows by up to 60% and 80%, as well as provides up to 3.02x gain in throughput of long flows compared to the state-of-the-art load balancing schemes.

[1]  Nick McKeown,et al.  Reproducible network experiments using container-based emulation , 2012, CoNEXT '12.

[2]  Amin Vahdat,et al.  Annulus: A Dual Congestion Control Loop for Datacenter and WAN Traffic Aggregates , 2020, SIGCOMM.

[3]  George Varghese,et al.  CONGA: distributed congestion-aware load balancing for datacenters , 2015, SIGCOMM.

[4]  Keqin Li,et al.  Minimizing SLA violation and power consumption in Cloud data centers using adaptive energy-aware algorithms , 2017, Future Gener. Comput. Syst..

[5]  Vimalkumar Jeyakumar,et al.  Juggler: a practical reordering resilient network stack for datacenters , 2016, EuroSys.

[6]  Dongsu Han,et al.  Credit-Scheduled Delay-Bounded Congestion Control for Datacenters , 2017, SIGCOMM.

[7]  George Varghese,et al.  P4: programming protocol-independent packet processors , 2013, CCRV.

[8]  Keqin Li,et al.  D-SRTF: Distributed Shortest Remaining Time First Scheduling for Data Center Networks , 2018, IEEE Transactions on Cloud Computing.

[9]  Brighten Godfrey,et al.  Finishing flows quickly with preemptive scheduling , 2012, CCRV.

[10]  Nick McKeown,et al.  Virtualized Congestion Control , 2016, SIGCOMM.

[11]  Changhyun Lee,et al.  DX: Latency-Based Congestion Control for Datacenters , 2017, IEEE/ACM Transactions on Networking.

[12]  Hong Xu,et al.  Luopan: Sampling-Based Load Balancing in Data Center Networks , 2019, IEEE Transactions on Parallel and Distributed Systems.

[13]  David A. Maltz,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM 2010.

[14]  Amin Vahdat,et al.  TIMELY: RTT-based Congestion Control for the Datacenter , 2015, Comput. Commun. Rev..

[15]  Brighten Godfrey,et al.  DRILL: Micro Load Balancing for Low-latency Data Center Networks , 2017, SIGCOMM.

[16]  Keqiang He,et al.  AC/DC TCP: Virtual Congestion Control Enforcement for Datacenter Networks , 2016, SIGCOMM.

[17]  Lu Wang,et al.  A cost-effective low-latency overlaid torus-based data center network architecture , 2018, Comput. Commun..

[18]  Haitao Wu,et al.  Per-packet load-balanced, low-latency routing for clos-based data center networks , 2013, CoNEXT.

[19]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.

[20]  Roy Friedman,et al.  Nitrosketch: robust and general sketch-based monitoring in software switches , 2019, SIGCOMM.

[21]  Yi Wang,et al.  Aeolus: A Building Block for Proactive Transport in Datacenters , 2020, SIGCOMM.

[22]  Christian E. Hopps,et al.  Analysis of an Equal-Cost Multi-Path Algorithm , 2000, RFC.

[23]  Hong Zhang,et al.  Resilient Datacenter Load Balancing in the Wild , 2017, SIGCOMM.

[24]  D. Zats,et al.  DeTail: reducing the flow completion time tail in datacenter networks , 2012, CCRV.

[25]  Kai Chen,et al.  Scheduling Mix-flows in Commodity Datacenters with Karuna , 2016, SIGCOMM.

[26]  Keqiang He,et al.  Presto: Edge-based Load Balancing for Fast Datacenter Networks , 2015, Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication.

[27]  M. Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM.

[28]  Brighten Godfrey,et al.  Veriflow: verifying network-wide invariants in real time , 2012, CCRV.

[29]  Tao Zhang,et al.  Rethinking Fast and Friendly Transport in Data Center Networks , 2020, IEEE/ACM Transactions on Networking.