Hysteresis-based Active Queue Management for TCP Traffic in Data Centers

Much of the incremental improvement to TCP over the past three decades had the ultimate goal of making it more effective in using the long-fat pipes of the global Internet. This resulted in a rigid set of mechanisms in the protocol that put TCP at a disadvantage in small-delay environments such as data centers. In particular, in the presence of the shallow buffers of commodity switches and the short round trip times in data centers, the continued use of a large TCP initial congestion window and a huge minimum retransmission timeout (both inherited from the Internet-centric design) results in a very short TCP loss cycle that affects particularly the flow completion times of short-lived incast flows. In this paper, we first investigate empirically the TCP loss cycle and discuss its impact on packet losses, recovery and delay; then we propose a switch-based congestion controller with hysteresis (HSCC) that aims to stretch the TCP loss cycle without modifying TCP itself. To protect incast flows from severe congestion, HSCC is designed to transparently induce the TCP source to alternate between its native TCP congestion control algorithm and a slower more conservative constant bit rate flow control mode that is activated when congestion is imminent. We show the stability of HSCC via analytical modelling, and demonstrate its effectiveness via simulation and implementation in a small testbed.

[1]  Junda Liu,et al.  Multi-enterprise networking , 2000 .

[2]  Avinoam Kolodny,et al.  Links as a service (LaaS): Guaranteed tenant isolation in the shared cloud , 2015, 2016 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS).

[3]  Keqiang He,et al.  AC/DC TCP: Virtual Congestion Control Enforcement for Datacenter Networks , 2016, SIGCOMM.

[4]  David A. Maltz,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM 2010.

[5]  Haitao Wu,et al.  Enabling ECN in Multi-Service Multi-Queue Data Centers , 2016, NSDI.

[6]  Nick McKeown,et al.  Virtualized Congestion Control , 2016, SIGCOMM.

[7]  Cheng Jin,et al.  FAST TCP: Motivation, Architecture, Algorithms, Performance , 2006, IEEE/ACM Transactions on Networking.

[8]  Srinivasan Seshan,et al.  On application-level approaches to avoiding TCP throughput collapse in cluster-based storage systems , 2007, PDSW '07.

[9]  Chuang Lin,et al.  Comprehensive understanding of TCP Incast problem , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[10]  Ming Zhang,et al.  Congestion Control for Large-Scale RDMA Deployments , 2015, Comput. Commun. Rev..

[11]  Amar Phanishayee,et al.  Safe and effective fine-grained TCP retransmissions for datacenter communication , 2009, SIGCOMM '09.

[12]  Van Jacobson,et al.  TCP Extensions for High Performance , 1992, RFC.

[13]  Vishal Misra,et al.  Fluid-based analysis of a network of AQM routers supporting TCP flows with an application to RED , 2000, SIGCOMM.

[14]  P. Olver Nonlinear Systems , 2013 .

[15]  Haitao Wu,et al.  ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2013, IEEE/ACM Transactions on Networking.

[16]  Ning Ding,et al.  The only constant is change: incorporating time-varying network reservations in data centers , 2012, SIGCOMM.

[17]  Matthew Mathis,et al.  The macroscopic behavior of the TCP congestion avoidance algorithm , 1997, CCRV.

[18]  Chuang Lin,et al.  Modeling and understanding TCP incast in data center networks , 2011, 2011 Proceedings IEEE INFOCOM.

[19]  Ming Zhang,et al.  Understanding data center traffic characteristics , 2010, CCRV.

[20]  Nick McKeown,et al.  Deconstructing datacenter packet transport , 2012, HotNets-XI.

[21]  Albert G. Greenberg,et al.  Sharing the Data Center Network , 2011, NSDI.

[22]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.

[23]  Randy H. Katz,et al.  DeTail: reducing the flow completion time tail in datacenter networks , 2012, SIGCOMM '12.

[24]  Albert G. Greenberg,et al.  The nature of data center traffic: measurements & analysis , 2009, IMC '09.

[25]  Balaji Prabhakar,et al.  Stability analysis of QCN: the averaging principle , 2011, SIGMETRICS.

[26]  Amin Vahdat,et al.  TIMELY: RTT-based Congestion Control for the Datacenter , 2015, Comput. Commun. Rev..

[27]  Ming Zhang,et al.  Congestion Control for Large-Scale RDMA Deployments , 2015, Comput. Commun. Rev..

[28]  Marco Mellia,et al.  TCP model for short lived flows , 2002, IEEE Communications Letters.