Congestion control in Converged Ethernet with heterogeneous and time-varying delays

Congestion control is an indispensable mechanism in the new trend of enhanced Ethernet as a unified fabric for traditional LAN, SAN, and high-performance computing networks. A congestion management framework for Converged Ethernet (CE) networks has been standardized by IEEE 802.1 Qau work group, and QCN is recommended as the congestion control scheme in the standard draft. QCN is heuristically designed for 1/10Gbps Ethernet without considering the impact of delays. Recent work find that QCN will encounter stability issues with feedback delays, and these issues will be more serious as Ethernet extends to 40/100Gbps and the delays become heterogeneous and time-varying. This work aims to mitigate the negative impact of delays on congestion control scheme in CE. Specially, considering the delays are heterogeneous and time-varying, we build a model for Converged Ethernet with the standard congestion management framework. The model provides a new congestion detector to estimate the real congestion status under the impact of delays and regards the heterogeneous and time-varying feature as disturbances. Leveraging the new congestion detector and tolerating the disturbance through the sliding mode control method, we design the Delay-tolerant Sliding Mode (DSM) congestion control scheme. Extensive simulations show that DSM outperforms other congestion control schemes when the Ethernet ranges from 1Gbps to 100Gbps and the delays are heterogeneous and time-varying.

[1]  David G. Andersen,et al.  Using RDMA efficiently for key-value services , 2015, SIGCOMM 2015.

[2]  Raj Jain,et al.  An Explicit Rate Control Framework for Lossless Ethernet Operation , 2008, 2008 IEEE International Conference on Communications.

[3]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[4]  Alan Weiss,et al.  Analysis of a rate-based control strategy with delayed feedback , 1992, SIGCOMM '92.

[5]  Thomas Voice,et al.  Stability and fairness of explicit congestion control with small buffers , 2008, CCRV.

[6]  Sally Floyd,et al.  Metrics for the Evaluation of Congestion Control Mechanisms , 2008, RFC.

[7]  Seong-Ryong Kang,et al.  Delayed stability and performance of distributed congestion control , 2004, SIGCOMM 2004.

[8]  Chuang Lin,et al.  Phase Plane Analysis of Quantized Congestion Notification for Data Center Ethernet , 2015, IEEE/ACM Transactions on Networking.

[9]  Ming Zhang,et al.  Congestion Control for Large-Scale RDMA Deployments , 2015, Comput. Commun. Rev..

[10]  Balaji Prabhakar,et al.  Stability analysis of QCN: the averaging principle , 2011, SIGMETRICS.

[11]  Chuang Lin,et al.  Sliding Mode Congestion Control for Data Center Ethernet Networks , 2015, IEEE Transactions on Computers.

[12]  Dennis Abts,et al.  A Guided Tour through Data-center Networking , 2012, ACM Queue.

[13]  Randy H. Katz,et al.  DeTail: reducing the flow completion time tail in datacenter networks , 2012, SIGCOMM '12.

[14]  Robert Birke,et al.  Got loss? Get zOVN! , 2013, SIGCOMM.