In this paper, we propose CLEO, which is a machine learning approach to equal-cost multipath routing (ECMP) schemes to distribute and balance traffic. ECMP-based traffic load-balancing is widely practiced by datacenters, but hash collision resulting from skewed ECMP hashing makes it difficult to achieve the desired throughputs over paths. Various solutions have been proposed to overcome the performance degradation caused by hash collision, but most of these solutions require modifying packet headers or replacing switches. To solve this problem, CLEO builds a neural-network model that characterizes the ECMP scheme of a switch. The proof-of-concept evaluation shows that CLEO improves the root mean square error fourfold between the desired and real path throughputs.
[1]
Jennifer Rexford,et al.
Clove: Congestion-Aware Load Balancing at the Virtual Edge
,
2017,
CoNEXT.
[2]
Mark Handley,et al.
Improving datacenter performance and robustness with multipath TCP
,
2011,
SIGCOMM.
[3]
George Varghese,et al.
CONGA: distributed congestion-aware load balancing for datacenters
,
2015,
SIGCOMM.
[4]
Min Zhu,et al.
WCMP: weighted cost multipathing for improved fairness in data centers
,
2014,
EuroSys '14.
[5]
F. Richard Yu,et al.
Load Balancing in Data Center Networks: A Survey
,
2018,
IEEE Communications Surveys & Tutorials.