Achieving high utilization of flowlet-based load balancing in data center networks

Abstract Modern data center networks provide multiple paths between any host pairs. Load balancing traffic across these paths is critical to achieve low latency and high throughput. Despite prior solutions show that flowlet-based solutions are powerful in achieving load balancing in asymmetric topology, they suffer from the congestion mismatch problem in rerouting flowlets across different paths. That is, due to lacking the explicit congestion feedback, existing flowlet-based solutions are difficult to utilize bandwidth resource efficiently, which may result in low link utilization and packet loss. In this paper, we propose a congestion-aware load balancing scheme named CAF to eliminate the problem of congestion mismatch. The basic idea behind CAF is that whenever the sender perceives flowlet switching, it proactively sends probe packets to measure available bandwidth and uses the measurement result to properly set the congestion window, avoiding the unnecessary under-utilization and packet loss. Through a series of large-scale NS2 simulations and testbed experiments, we demonstrate that CAF reduces average flow completion time by up to 86% compared with the state-of-the-art mechanisms.

[1]  Carl M. Harris,et al.  Fundamentals of queueing theory , 1975 .

[2]  Dezun Dong,et al.  Congestion control in high-speed lossless data center networks: A survey , 2018, Future Gener. Comput. Syst..

[3]  Tao Zhang,et al.  Tuning the Aggressive TCP Behavior for Highly Concurrent HTTP Connections in Intra-Datacenter , 2017, IEEE/ACM Transactions on Networking.

[4]  Peter Steenkiste,et al.  Evaluation and characterization of available bandwidth probing techniques , 2003, IEEE J. Sel. Areas Commun..

[5]  Wenjun Lv,et al.  CAPS: Coding-Based Adaptive Packet Spraying to Reduce Flow Completion Time in Data Center , 2019, IEEE/ACM Transactions on Networking.

[6]  Meikang Qiu,et al.  A locality-aware shuffle optimization on fat-tree data centers , 2018, Future Gener. Comput. Syst..

[7]  Jiao Zhang,et al.  Flow distribution-aware load balancing for the datacenter , 2017, Comput. Commun..

[8]  Tianrong Zhang,et al.  Packet-Scale Congestion Control Paradigm , 2017, IEEE/ACM Transactions on Networking.

[9]  Jianxin Wang,et al.  Adjusting Packet Size to Mitigate TCP Incast in Data Center Networks with COTS Switches , 2020, IEEE Transactions on Cloud Computing.

[10]  Jun Li,et al.  A survey on TCP Incast in data center networks , 2014, Int. J. Commun. Syst..

[11]  Tao Zhang,et al.  Adaptive marking threshold method for delay-sensitive TCP in data center network , 2016, J. Netw. Comput. Appl..

[12]  Jianxin Wang,et al.  Task-Aware TCP in Data Center Networks , 2019, IEEE/ACM Transactions on Networking.

[13]  Jianxin Wang,et al.  Receiver-driven fair congestion control for TCP outcast in data center networks , 2019, J. Netw. Comput. Appl..