SRPT-based Congestion Control for Flows with Unknown Sizes

Modern datacenter transports are required to support latency constraints, usually represented by various forms of flow completion time (FCT). Most implemented congestion control mechanisms that minimize FCT are based on SRPT priorities (e.g., pFabric and Homa). However, SRPT-based scheduling requires prior knowledge of flow sizes, making this discipline problematic in general. Non-SRPT-based alternatives such as LAS and PIAS are able to cope with this level of uncertainty but suffer from their own limitations: LAS can lead to significant starvation of concurrent elephant flows, while PIAS requires a centralized entity for correct settings. In this work, we generalize SRPT-based scheduling to allow flows with known and unknown sizes to sojourn at the same time. We not only show analytic properties of this generalization but rigorously prove important properties of non-SRPT alternatives with competitive analysis. Based on the proposed SRPT generalization, we introduce a new ASCC congestion control. Our main goal is not to propose yet another congestion control but to identify preferable and pathological traffic patterns with unknown flow sizes for various scheduling disciplines. Our observations are validated by an extensive evaluation study.

[1]  Alex X. Liu,et al.  Friends, not Foes – Synthesizing Existing Transport Strategies for Data Center Networks , 2014 .

[2]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[4]  Nick McKeown,et al.  pFabric: minimal near-optimal datacenter transport , 2013, SIGCOMM.

[5]  Hari Balakrishnan,et al.  An experimental study of the learnability of congestion control , 2014, SIGCOMM.

[6]  Hari Balakrishnan,et al.  TCP ex machina: computer-generated congestion control , 2013, SIGCOMM.

[7]  Gautam Kumar,et al.  pHost: distributed near-optimal datacenter transport over commodity network fabric , 2015, CoNEXT.

[8]  Mor Harchol-Balter,et al.  SRPT for Multiserver Systems , 2018, PERV.

[9]  Wei Bai,et al.  Information-Agnostic Flow Scheduling for Commodity Data Centers , 2015, NSDI.

[10]  Antony I. T. Rowstron,et al.  Better never than late: meeting deadlines in datacenter networks , 2011, SIGCOMM.

[11]  Albert G. Greenberg,et al.  VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.

[12]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[13]  黄贻彬,et al.  Microsoft SQL Server中的星形连接查询优化 , 2011 .

[14]  Amin Vahdat,et al.  Sincronia: near-optimal network design for coflows , 2018, SIGCOMM.

[15]  Guillaume Urvoy-Keller,et al.  Performance analysis of LAS-based scheduling disciplines in a packet switched network , 2004, SIGMETRICS '04/Performance '04.

[16]  Devavrat Shah,et al.  Fastpass , 2014, SIGCOMM.

[17]  Haitao Wu,et al.  Towards minimal-delay deadline-driven data center TCP , 2013, HotNets.

[18]  Yang Xu,et al.  HyLine: a Simple and Practical Flow Scheduling for Commodity Datacenters , 2018, 2018 IFIP Networking Conference (IFIP Networking) and Workshops.

[19]  Rong Pan,et al.  Let It Flow: Resilient Asymmetric Load Balancing with Flowlet Switching , 2017, NSDI.

[20]  Roy T. Fielding,et al.  Hypertext Transfer Protocol - HTTP/1.1 , 1997, RFC.

[21]  Albert G. Greenberg,et al.  How fair is fair queuing , 1992, JACM.

[22]  Sangeetha Abdu Jyothi,et al.  Is advance knowledge of flow sizes a plausible assumption? , 2019, NSDI.

[23]  T. N. Vijaykumar,et al.  Deadline-aware datacenter tcp (D2TCP) , 2012, SIGCOMM '12.

[24]  Brighten Godfrey,et al.  Finishing flows quickly with preemptive scheduling , 2012, CCRV.

[25]  Alex X. Liu,et al.  PASE: Synthesizing Existing Transport Strategies for Near-Optimal Data Center Transport , 2017, IEEE/ACM Transactions on Networking.

[26]  John K. Ousterhout,et al.  Homa: a receiver-driven low-latency transport protocol using network priorities , 2018, SIGCOMM.

[27]  Nick McKeown,et al.  Why flow-completion time is the right metric for congestion control , 2006, CCRV.