Proactive coordination for low-congestion multi-path datacenter networks

Abstract Modern datacenters involve a rich mix of workloads, each of which puts forward different service-level objective, including high throughput, low latency, etc. Currently, most datacenters introduce statistical multiplexing technology and oversubscription to the network design to lower the total cost, which can easily lead to the occurrence of network congestion, especially when the network is highly occupied by throughput-intensive workloads. This paper describes ProCAM, a proactive congestion avoidance mechanism for datacenter networks. As throughput-intensive flows are the chief culprit of network congestion, ProCAM adapts the multi-path routing to control transmission bandwidth and utilizes the predictability of throughput-intensive flows to prearrange optimal coordinate scheme (desynchronize the sending time of concurrent long-flows) beforehand as a proactive manner, by solving the low-congestion transmission model which minimizes network-wide host-to-host transmission latency from a global perspective. In this way, queue length in buffers can be kept at a low level and the performance of latency-sensitive flows can be guaranteed. In the evaluation experiments based on simulation with Mininet and SDN controller Ryu, the extensive simulations show that the proposed ProCAM can achieve high throughput with nearly zero packet loss and low latency when the network is highly occupied.

[1]  M. Abadi,et al.  Naiad: a timely dataflow system , 2013, SOSP.

[2]  Ming Zhang,et al.  MicroTE: fine grained traffic engineering for data centers , 2011, CoNEXT '11.

[3]  Praveen Yalagandula,et al.  Mahout: Low-overhead datacenter traffic management using end-host-based elephant detection , 2011, 2011 Proceedings IEEE INFOCOM.

[4]  Nick McKeown,et al.  A network in a laptop: rapid prototyping for software-defined networks , 2010, Hotnets-IX.

[5]  Fengyuan Ren,et al.  Slowing Little Quickens More: Improving DCTCP for Massive Concurrent Flows , 2015, 2015 44th International Conference on Parallel Processing.

[6]  Rachid Anane,et al.  Implementation of a proactive load sharing scheme , 2003, SAC '03.

[7]  Halima Elbiaze,et al.  Fair Congestion Control Protocol for Data Center Bridging , 2019, IEEE Systems Journal.

[8]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[9]  Charles E. Leiserson,et al.  Fat-trees: Universal networks for hardware-efficient supercomputing , 1985, IEEE Transactions on Computers.

[10]  Werner Ebeling,et al.  Boltzmann-, Darwin-, and Haeckel-Strategies in Optimization Problems , 1990, PPSN.

[11]  Haitao Wu,et al.  ICTCP: Incast Congestion Control for TCP in Data-Center Networks , 2010, IEEE/ACM Transactions on Networking.

[12]  Amin Vahdat,et al.  Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[13]  Guangjie Han,et al.  IRPL: An energy efficient routing protocol for wireless sensor networks , 2017, J. Syst. Archit..

[14]  Christopher Frost,et al.  Spanner: Google's Globally-Distributed Database , 2012, OSDI.

[15]  Amin Vahdat,et al.  Practical TDMA for datacenter ethernet , 2012, EuroSys '12.

[16]  Alex C. Snoeren,et al.  Inside the Social Network's (Datacenter) Network , 2015, Comput. Commun. Rev..

[17]  Junda Liu,et al.  Multi-enterprise networking , 2000 .

[18]  Ben Y. Zhao,et al.  Cutting the cord: a robust wireless facilities network for data centers , 2014, MobiCom.

[19]  Jin Ye,et al.  Fine-Grained Congestion Control for Multipath TCP in Data Center Networks , 2019, IEEE Access.

[20]  Robert N. M. Watson,et al.  Queues Don't Matter When You Can JUMP Them! , 2015, NSDI.

[21]  David J. Sirag,et al.  Toward a unified thermodynamic genetic operator , 1987 .

[22]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[23]  Mark Handley,et al.  Design, Implementation and Evaluation of Congestion Control for Multipath TCP , 2011, NSDI.

[24]  Bo Peng,et al.  HybridPass: Hybrid Scheduling for Mixed Flows in Datacenter Networks , 2018, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[25]  Paramvir Bahl,et al.  Augmenting data center networks with multi-gigabit wireless links , 2011, SIGCOMM 2011.

[26]  Yue Cao,et al.  Highly-Efficient Bulk Data Transfer for Structured Dissemination in Wireless Embedded Network Systems , 2017, J. Syst. Archit..

[27]  George Varghese,et al.  CONGA: distributed congestion-aware load balancing for datacenters , 2015, SIGCOMM.

[28]  Nadjib Aitsaadi,et al.  A Novel Joint Routing and Channel Allocation Approach in Hybrid Data Center Network , 2017, 2017 14th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON).

[29]  Amin Vahdat,et al.  TIMELY: RTT-based Congestion Control for the Datacenter , 2015, Comput. Commun. Rev..

[30]  Lakshminarayanan Subramanian,et al.  One more bit is enough , 2005, SIGCOMM '05.

[31]  Devavrat Shah,et al.  Fastpass: a centralized "zero-queue" datacenter network , 2015, SIGCOMM 2015.

[32]  Jennifer Rexford,et al.  HULA: Scalable Load Balancing Using Programmable Data Planes , 2016, SOSR.

[33]  Guy Kortsarz,et al.  Sum Multi-coloring of Graphs , 1999, ESA.

[34]  Xin Xu,et al.  ProCAM: A Proactive Coordinating Mechanism for Low-Congestion Datacenter Networks , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[35]  Jianxin Wang,et al.  Receiver-driven fair congestion control for TCP outcast in data center networks , 2019, J. Netw. Comput. Appl..

[36]  P. Deepalakshmi,et al.  A Simple Congestion Avoidance mechanism for OpenDaylight (ODL) - Multipath TCP (MPTCP) Network structure in Software Defined Data Center (SDDC) , 2019, 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS).

[37]  Mendel Rosenblum,et al.  Fast crash recovery in RAMCloud , 2011, SOSP.

[38]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.