GA3: scalable, distributed address assignment for dynamic data center networks

Deployment and maintenance of current data center networks is costly and prone to errors. In order to avoid manual configuration, many of them require centralized administrators which constitute a clear bottleneck, while distributed approaches do not guarantee sufficient flexibility or robustness. This paper describes and evaluates GA3 (Generalized Automatic Address Assignment), a discovery protocol that assigns multiple unique labels to all the switches in a hierarchical network, without any modification of hosts or the standard Ethernet frames. Labeling is distributed and uses probes. These labels can be leveraged for shortest path routing without tables, as in the case of the Torii protocol, but GA3 also allows other label-based routing protocols (such as PortLand or ALIAS). Additionally, GA3 can detect miswirings in the network. Furthermore, control traffic is only necessary upon network deployment rather than periodically. Simulation results showed a reduced convergence time of less than 2 s and 100 kB/s of bandwidth (to send the GA3 frames) in the worst case for popular data center topologies, which outperforms other similar protocols.

[1]  Xingyu Ma,et al.  Error Tolerant Address Configuration for Data Center Networks with Malfunctioning Devices , 2012, 2012 IEEE 32nd International Conference on Distributed Computing Systems.

[2]  Arturo Azcorra,et al.  Torii: multipath distributed Ethernet fabric protocol for data centres with zero‐loss path repair , 2015, Trans. Emerg. Telecommun. Technol..

[3]  Amin Vahdat,et al.  Brief Announcement: A Randomized Algorithm for Label Assignment in Dynamic Networks , 2011, DISC.

[4]  Jing Yuan,et al.  DAC: Generic and Automatic Address Configuration for Data Center Networks , 2012, IEEE/ACM Transactions on Networking.

[5]  Xin Jin,et al.  Your Data Center Switch is Trying Too Hard , 2016, SOSR.

[6]  Rahul Potharaju,et al.  When the network crumbles: an empirical study of cloud network failures and their impact on services , 2013, SoCC.

[7]  Roberto Rojas-Cessa,et al.  Schemes for Fast Transmission of Flows in Data Center Networks , 2015, IEEE Communications Surveys & Tutorials.

[8]  T. S. Eugene Ng,et al.  Enabling Topological Flexibility for Data Centers Using OmniSwitch , 2015, HotCloud.

[9]  Yonggang Wen,et al.  A Survey on Data Center Networking (DCN): Infrastructure and Operations , 2017, IEEE Communications Surveys & Tutorials.

[10]  Tao Chen,et al.  The features, hardware, and architectures of data center networks: A survey , 2016, J. Parallel Distributed Comput..

[11]  Ramesh Govindan,et al.  A General Approach to Network Configuration Analysis , 2015, NSDI.

[12]  Christian Esteve Rothenberg,et al.  SlickFlow: Resilient source routing in Data Center Networks unlocked by OpenFlow , 2013, 38th Annual IEEE Conference on Local Computer Networks.

[13]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[14]  Ming Zhu,et al.  GARDEN: Generic Addressing and Routing for Data Center Networks , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[15]  Marcos Rogério Salvador,et al.  Automatic Discovery of Physical Topology in Ethernet Networks , 2008, 22nd International Conference on Advanced Information Networking and Applications (aina 2008).

[16]  Hong Liu,et al.  Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network , 2015, Comput. Commun. Rev..

[17]  Hong Wang,et al.  Tree-conf: A fast automatic address configuration method for tree-like data center networks , 2014, Fifth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[18]  Wei Liang,et al.  MTR: Fault tolerant routing in Clos data center network with miswiring links , 2014, 2014 IEEE 20th International Workshop on Local & Metropolitan Area Networks (LANMAN).

[19]  Alejandro López-Ortiz,et al.  REWIRE: An optimization-based framework for unstructured data center network design , 2012, 2012 Proceedings IEEE INFOCOM.

[20]  Amin Vahdat,et al.  PortLand: a scalable fault-tolerant layer 2 data center network fabric , 2009, SIGCOMM '09.

[21]  Elisa Rojas,et al.  All-Path bridging: Path exploration protocols for data center and campus networks , 2015, Comput. Networks.

[22]  Albert G. Greenberg,et al.  VL2: a scalable and flexible data center network , 2009, SIGCOMM '09.