Scalable and Reliable Data Center Networks by Combining Source Routing and Automatic Labelling

Today, most user services are based on cloud computing, which leverages data center networks (DCNs) to efficiently route its communications. These networks process high volumes of traffic and require exhaustive failure management. Furthermore, expanding these networks is usually costly due to their constraint designs. In this article, we present enhanced Torii (eTorii), an automatic, scalable, reliable and flexible multipath routing protocol that aims to accomplish the demanding requirements of DCNs. We prove that eTorii is, by definition, applicable to a wide range of DCNs or any other type of hierarchical network and able to route with minimum forwarding table size and capable of rerouting around failed links on-the-fly with almost zero cost. A proof of concept of the eTorii protocol has been implemented using the Ryu SDN controller and the Mininet framework. Its evaluation shows that eTorii balances the load and preserves high-bandwidth utilization. Thus, it optimizes the use of DCN resources in comparison to other approaches, such as Equal-Cost Multi-Path (ECMP).

[1]  Rong Pan,et al.  Let It Flow: Resilient Asymmetric Load Balancing with Flowlet Switching , 2017, NSDI.

[2]  Wenjun Lv,et al.  CAPS: Coding-Based Adaptive Packet Spraying to Reduce Flow Completion Time in Data Center , 2019, IEEE/ACM Transactions on Networking.

[3]  Xin Jin,et al.  Your Data Center Switch is Trying Too Hard , 2016, SOSR.

[4]  Hong Zhang,et al.  Resilient Datacenter Load Balancing in the Wild , 2017, SIGCOMM.

[5]  Lixin Gao,et al.  Towards reliable and lightweight source switching for datacenter networks , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[6]  Elisa Rojas,et al.  Amaru: Plug&Play Resilient In-Band Control for SDN , 2019, IEEE Access.

[7]  Hong Liu,et al.  Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network , 2015, Comput. Commun. Rev..

[8]  Elisa Rojas,et al.  Stateless Flow-Zone Switching Using Software-Defined Addressing , 2021, IEEE Access.

[9]  Arturo Azcorra,et al.  Torii: multipath distributed Ethernet fabric protocol for data centres with zero‐loss path repair , 2015, Trans. Emerg. Telecommun. Technol..

[10]  Wenjun Lv,et al.  APS: Adaptive Packet Spraying to Isolate Mix-Flows in Data Center Network , 2022, IEEE Transactions on Cloud Computing.

[11]  Piotr Cholda,et al.  Network Recovery, Protection and Restoration of Optical, SONET-SDH, IP, and MPLS [Book Review] , 2005, IEEE Communications Magazine.

[12]  M. Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM.

[13]  Fernando M. V. Ramos,et al.  Software-Defined Networking: A Comprehensive Survey , 2014, Proceedings of the IEEE.

[14]  Mark Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM.

[15]  Mauro de Mesquita Spinola,et al.  Risks and Uncertainties in Cloud Computing: Literature Review, Trends and Gaps , 2017 .

[16]  Nirmala Shenoy,et al.  From Spanning Trees to Meshed Trees , 2020, 2020 International Conference on COMmunication Systems & NETworkS (COMSNETS).

[17]  Haitao Wu,et al.  BCube: a high performance, server-centric network architecture for modular data centers , 2009, SIGCOMM '09.

[18]  Weihe Li,et al.  Mitigating Packet Reordering for Random Packet Spraying in Data Center Networks , 2021, IEEE/ACM Transactions on Networking.

[19]  Diego Lopez-Pajares,et al.  TCP-path: Improving load balance by network exploration , 2017, 2017 IEEE 6th International Conference on Cloud Networking (CloudNet).

[20]  Pontus Sköldström,et al.  Scalable fault management for OpenFlow , 2012, 2012 IEEE International Conference on Communications (ICC).

[21]  Elisa Rojas,et al.  GA3: scalable, distributed address assignment for dynamic data center networks , 2017, Ann. des Télécommunications.

[22]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[23]  Albert G. Greenberg,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM '10.

[24]  Martín Casado,et al.  The Design and Implementation of Open vSwitch , 2015, NSDI.

[25]  Srihari Nelakuditi,et al.  On Selection of Paths for Multipath Routing , 2001, IWQoS.

[26]  Carl A. Sunshine,et al.  Source routing in computer networks , 1977, CCRV.

[27]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.