FTLink: Efficient and flexible link fault tolerance scheme for data plane in Software-Defined Networking

Abstract Link fault tolerance has been a hotspot in Software-Defined Networking (SDN) for many years. Both proactive and reactive schemes are proposed to guarantee network availability and robustness. However, the former wastes plenty of Ternary Content Addressable Memory (TCAM) and bandwidth resources, while the latter faces long fault recovery time. In this paper, we propose FTLink, an efficient and flexible link fault tolerance scheme in SDN. Firstly, through collecting the current network state information, we pre-generate a set of backup links for each link of the primary path of the flow. We formulate the backup links planning as a multi-objective optimization problem that minimizes the required switch TCAMs and link bandwidths. Combining flow characteristics, we develop a two-step heuristic algorithm, namely planning backup links for each link that transmits elephant flows with the greedy tracing method in step 1 and for each link that transmits mice flows with the bidirectional searching method in step 2, to determine the backup links set. Then, we introduce a global matching table in the controller to maintain the generated backup links and additional flow rules. Once monitoring a faulty link, FTLink looks up the matching table entries and enables the backup links via installing the additional flow rules into the switches to realize flexible fault recovery. Simulations show that FTLink is effective in terms of high-efficiency TCAM and bandwidth usage and produces an acceptable recovery time of less than 30 ms, compared with baseline schemes. The prototype tests under realistic network further confirm the efficiency of FTLink.

[1]  Arne Ludwig,et al.  Efficient Loop-Free Rerouting of Multiple SDN Flows , 2018, IEEE/ACM Transactions on Networking.

[2]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[3]  Zhi-Li Zhang,et al.  On the Feasibility and Efficacy of Protection Routing in IP Networks , 2010, IEEE/ACM Transactions on Networking.

[4]  Wei Zhang,et al.  Link Failure Recovery in SDN: High Efficiency, Strong Scalability and Wide Applicability , 2018, J. Circuits Syst. Comput..

[5]  Didier Colle,et al.  Enabling fast failure recovery in OpenFlow networks , 2011, 2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN).

[6]  Yong Jiang,et al.  BOND: Flexible failure recovery in software defined networks , 2019, Comput. Networks.

[7]  Didier Colle,et al.  OpenFlow: Meeting carrier-grade recovery requirements , 2013, Comput. Commun..

[8]  Fernando M. V. Ramos,et al.  Software-Defined Networking: A Comprehensive Survey , 2014, Proceedings of the IEEE.

[9]  Chun-Wei Wang,et al.  Improving the Fault-Tolerance Under Software-Defined Network Based on New Sight of Agreement Protocol , 2018, IEEE Access.

[10]  Valliyammai Chinnaiah,et al.  Link Failure Recovery Using Shortest Path Fast Rerouting Technique in SDN , 2017, Wirel. Pers. Commun..

[11]  Piero Castoldi,et al.  OpenFlow-based segment protection in Ethernet networks , 2013, IEEE/OSA Journal of Optical Communications and Networking.

[12]  Thierry Turletti,et al.  A Survey of Software-Defined Networking: Past, Present, and Future of Programmable Networks , 2014, IEEE Communications Surveys & Tutorials.

[13]  H. Jonathan Chao,et al.  Balancing flow table occupancy and link utilization in software-defined networks , 2018, Future Gener. Comput. Syst..

[14]  Paulo César da Rocha Fonseca,et al.  A Survey on Fault Management in Software-Defined Networks , 2017, IEEE Communications Surveys & Tutorials.

[15]  Tram Truong-Huu,et al.  A Novel Model for Competition and Cooperation among Cloud Providers , 2014, IEEE Transactions on Cloud Computing.

[16]  Myungjin Lee,et al.  High-Fidelity Per-Flow Delay Measurements With Reference Latency Interpolation , 2013, IEEE/ACM Transactions on Networking.

[17]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[18]  P. Castoldi,et al.  Effective flow protection in OpenFlow rings , 2013, 2013 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC).

[19]  Subhasis Banerjee,et al.  Compact TCAM: Flow Entry Compaction in TCAM for Power Aware SDN , 2013, ICDCN.

[20]  Thar Baker,et al.  Multi-controller Based Software-Defined Networking: A Survey , 2018, IEEE Access.

[21]  Ulas C. Kozat,et al.  On Optimal Topology Verification and Failure Localization for Software Defined Networks , 2016, IEEE/ACM Transactions on Networking.

[22]  Ian F. Akyildiz,et al.  Towards Optimal Network Planning for Software-Defined Networks , 2018, IEEE Transactions on Mobile Computing.

[23]  Song Guo,et al.  Green DataPath for TCAM-Based Software-Defined Networks , 2016, IEEE Communications Magazine.

[24]  Lianshan Yan,et al.  Link failure detection in software defined networks: an active feedback mechanism , 2017 .