SSP: Speeding up Small Flows for Proactive Transport in Datacenters

Proactive transports nowadays have drawn much attention because of fast convergence, near-zero queueing and low latency. Proactive protocols, however, need an extra RTT to allocate ideal sending rate for new flows. To solve this, some studies, such as pHost, Homa, send unscheduled packets with line rate in the first RTT, which will causes severe network congestion. To avoid the queue buildup, Aeolus directly drops unscheduled packets when congestion occurs. Nevertheless, based on our experiment, a considerable part of small flows (0–100 KB) will be completed in the first RTT under 100 Gbps network, so that dropping unscheduled packets will severely affect performance of the small flows. In this paper we propose SSP, a new scheme aimed to eliminate the extra RTT delay and improve the flow completion time (FCT) of small flows under the proactive mechanism. Like pHost and Homa, SSP sends unscheduled packets at line rate when new flow arrives. Different from Aeolus, SSP selectively drops scheduled packets once queue buildup happens in the switch, thus protecting unscheduled packets which are more likely belong to small flows. Besides, based on the short-job-first (SJF) principle, we give relative higher priorities for small flows at the sender. Our simulation results with realistic workloads show that SSP can improve the FCT of small flows significantly. Specifically, under Web Search workload, SSP facilitates nearly 63% of 0–100 KB flows to complete one RTT faster. Also, SSP reduces the tail FCT by 56.8% at the 99th percentile compared with Expresspass and 29.2% compared with Aeolus while not leads to large queue buildup.

[1]  Gautam Kumar,et al.  pHost: distributed near-optimal datacenter transport over commodity network fabric , 2015, CoNEXT.

[2]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[3]  Hong Liu,et al.  Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network , 2015, Comput. Commun. Rev..

[4]  Rong Pan,et al.  AF-QCN: Approximate Fairness with Quantized Congestion Notification for Multi-tenanted Data Centers , 2010, 2010 18th IEEE Symposium on High Performance Interconnects.

[5]  Amin Vahdat,et al.  Less Is More: Trading a Little Bandwidth for Ultra-Low Latency in the Data Center , 2012, NSDI.

[6]  Dezun Dong,et al.  Congestion control in high-speed lossless data center networks: A survey , 2018, Future Gener. Comput. Syst..

[7]  Larry L. Peterson,et al.  TCP Vegas: new techniques for congestion detection and avoidance , 1994 .

[8]  Kai Chen,et al.  Augmenting Proactive Congestion Control with Aeolus , 2018, APNet '18.

[9]  Ming Zhang,et al.  Congestion Control for Large-Scale RDMA Deployments , 2015, Comput. Commun. Rev..

[10]  Xiaowei Yang,et al.  A DoS-limiting network architecture , 2005, SIGCOMM '05.

[11]  George Varghese,et al.  High Speed Networks Need Proactive Congestion Control , 2015, HotNets.

[12]  Nick McKeown,et al.  Rate control protocol (rcp): congestion control to make flows complete quickly , 2008 .

[13]  Alex X. Liu,et al.  Friends, not Foes – Synthesizing Existing Transport Strategies for Data Center Networks , 2014 .

[14]  Nick McKeown,et al.  Deconstructing datacenter packet transport , 2012, HotNets-XI.

[15]  Yi Wang,et al.  Aeolus: A Building Block for Proactive Transport in Datacenters , 2020, SIGCOMM.

[16]  Dongsu Han,et al.  Credit-Scheduled Delay-Bounded Congestion Control for Datacenters , 2017, SIGCOMM.

[17]  Amin Vahdat,et al.  TIMELY: RTT-based Congestion Control for the Datacenter , 2015, Comput. Commun. Rev..

[18]  Ulrich Brüning,et al.  A versatile, low latency HyperTransport core , 2007, FPGA '07.

[19]  Albert G. Greenberg,et al.  VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.

[20]  Wei Bai,et al.  Information-Agnostic Flow Scheduling for Commodity Data Centers , 2015, NSDI.

[21]  H. T. Kung,et al.  Credit-based flow control for ATM networks: credit update protocol, adaptive credit allocation and statistical multiplexing , 1994, SIGCOMM.

[22]  Zihao Wei,et al.  EC4: ECN and Credit-Reservation Converged Congestion Control , 2019, 2019 IEEE 25th International Conference on Parallel and Distributed Systems (ICPADS).

[23]  Devavrat Shah,et al.  Fastpass , 2014, SIGCOMM.

[24]  Dezun Dong,et al.  Network Congestion Avoidance through Packet-chaining Reservation , 2019, ICPP.

[25]  Haitao Wu,et al.  ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2010, IEEE/ACM Transactions on Networking.

[26]  Nan Jiang,et al.  Channel reservation protocol for over-subscribed channels and destinations , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[27]  Changhyun Lee,et al.  Accurate Latency-based Congestion Feedback for Datacenters , 2015, USENIX Annual Technical Conference.

[28]  Nan Jiang,et al.  Network endpoint congestion control for fine-grained communication , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[29]  Fengyuan Ren,et al.  TFC: token flow control in data center networks , 2016, EuroSys.

[30]  Alex C. Snoeren,et al.  Inside the Social Network's (Datacenter) Network , 2015, Comput. Commun. Rev..

[31]  Alex X. Liu,et al.  Friends, not foes: synthesizing existing transport strategies for data center networks , 2015, SIGCOMM 2015.

[32]  Antony I. T. Rowstron,et al.  Better never than late: meeting deadlines in datacenter networks , 2011, SIGCOMM.

[33]  John K. Ousterhout,et al.  Homa: a receiver-driven low-latency transport protocol using network priorities , 2018, SIGCOMM.

[34]  Brighten Godfrey,et al.  Finishing flows quickly with preemptive scheduling , 2012, CCRV.

[35]  Nan Jiang,et al.  Network congestion avoidance through Speculative Reservation , 2012, IEEE International Symposium on High-Performance Comp Architecture.

[36]  Zihao Wei,et al.  ExpressPass++: Credit-Effecient Congestion Control for Data Centers , 2019, 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom).