Efficient Recovery Path Computation for Fast Reroute in Large-Scale Software-Defined Networks

With an increasing demand for resilience in software-defined networks (SDN), it becomes critical to minimize service recovery delay upon route failures. Fast reroute (FRR) mechanisms are widely used in IP and MPLS networks by computing the recovery path before a failure occurs. The centralized control plane in SDN can potentially enhance path computation, so that FRR path computation can better scale in SDN than in traditional networks. However, the traditional FRR path computation algorithms could lead to a poor performance in large-scale SDN. The problem can become more severe for a highly dynamic network, which often sees dozens of failures or configuration changes in any single day. We propose a new algorithm that exploits pruned searching to quickly compute recovery paths for all-pair switches/hosts upon a link failure. For applications requiring stringent path robustness levels, we also extend this algorithm to quickly find the shortest guaranteed-cost path, which ensures that the recovery path used upon on-path link failures has the minimum cost. Compared with traditional solutions, our evaluations show that our algorithm is about 8 ~ 81 times faster than the practical implementation, 1.93 ~ 3.11 times faster than the state-of-the-art solution. Our results also show that the shortest guaranteed-cost path can reduce the cost of the recovery path significantly. Moreover, we design a prototype to show how to deploy our algorithm in an OpenFlow network.

[1]  Niels L. M. van Adrichem,et al.  Backup rules in Software-Defined Networks , 2016, 2016 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN).

[2]  Takuya Akiba,et al.  Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling , 2015, AAAI.

[3]  Oliver C. Ibe,et al.  A survey of IP and multiprotocol label switching fast reroute schemes , 2007, Comput. Networks.

[4]  Christian Esteve Rothenberg,et al.  SlickFlow: Resilient source routing in Data Center Networks unlocked by OpenFlow , 2013, 38th Annual IEEE Conference on Local Computer Networks.

[5]  Arjan Durresi,et al.  A survey: Control plane scalability issues and approaches in Software-Defined Networking (SDN) , 2017, Comput. Networks.

[6]  Athina Markopoulou,et al.  Characterization of failures in an IP backbone , 2004, IEEE INFOCOM 2004.

[7]  Michael Menth,et al.  Efficient Data Plane Protection for SDN , 2018, 2018 4th IEEE Conference on Network Softwarization and Workshops (NetSoft).

[8]  Antonio Capone,et al.  Detour planning for fast and reliable failure recovery in SDN with OpenState , 2014, 2015 11th International Conference on the Design of Reliable Communication Networks (DRCN).

[9]  Alia Atlas,et al.  Fast Reroute Extensions to RSVP-TE for LSP Tunnels , 2005, RFC.

[10]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[11]  Stefan Schmid,et al.  Provable data plane connectivity with local fast failover: introducing openflow graph algorithms , 2014, HotSDN.

[12]  Stewart Bryant,et al.  IP Fast Reroute Framework , 2010, RFC.

[13]  Vijay Srinivasan,et al.  RSVP-TE: Extensions to RSVP for LSP Tunnels , 2001, RFC.

[14]  Min Zhu,et al.  B4: experience with a globally-deployed software defined wan , 2013, SIGCOMM.

[15]  Yuan-Cheng Lai,et al.  Fast failover and switchover for link failures and congestion in software defined networks , 2016, 2016 IEEE International Conference on Communications (ICC).

[16]  Yashar Ganjali,et al.  On scalability of software-defined networking , 2013, IEEE Communications Magazine.

[17]  Alberto Dainotti,et al.  SWIFT: Predictive Fast Reroute , 2017, SIGCOMM.

[18]  Peter Sanders,et al.  [Delta]-stepping: a parallelizable shortest path algorithm , 2003, J. Algorithms.

[19]  Chen-Nee Chuah,et al.  Characterization of Failures in an Operational IP Backbone Network , 2008, IEEE/ACM Transactions on Networking.

[20]  Jan Medved,et al.  OpenDaylight: Towards a Model-Driven SDN Controller architecture , 2014, Proceeding of IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks 2014.

[21]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[22]  Fernando A. Kuipers,et al.  Fast Recovery in Software-Defined Networks , 2014, 2014 Third European Workshop on Software Defined Networks.

[23]  Aubin Jarry Fast reroute paths algorithms , 2013, Telecommun. Syst..

[24]  Paulo César da Rocha Fonseca,et al.  A Survey on Fault Management in Software-Defined Networks , 2017, IEEE Communications Surveys & Tutorials.

[25]  Nick Feamster,et al.  CORONET: Fault tolerance for Software Defined Networks , 2012, 2012 20th IEEE International Conference on Network Protocols (ICNP).

[26]  Mingwei Xu,et al.  IP Fast Reroute: NotVia with Early Decapsulation , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[27]  Hamid Farhadi,et al.  Software-Defined Networking: A survey , 2015, Comput. Networks.

[28]  Andreas Voellmy,et al.  Scalable software defined network controllers , 2012, SIGCOMM '12.

[29]  Gábor Rétvári,et al.  Optimizing IGP link costs for improving IP-level resilience , 2011, 2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN).

[30]  Alia Atlas,et al.  Basic Specification for IP Fast Reroute: Loop-Free Alternates , 2008, RFC.

[31]  Vishal Sharma,et al.  Framework for Multi-Protocol Label Switching (MPLS)-based Recovery , 2003, RFC.

[32]  Didier Colle,et al.  OpenFlow: Meeting carrier-grade recovery requirements , 2013, Comput. Commun..

[33]  Marco Canini,et al.  FatTire: declarative fault tolerance for software-defined networks , 2013, HotSDN '13.

[34]  Roberto Bifulco,et al.  Towards Scalable SDN Switches: Enabling Faster Flow Table Entries Installation , 2015, Comput. Commun. Rev..

[35]  Alan L. Cox,et al.  Scalable Multi-Failure Fast Failover via Forwarding Table Compression , 2016, SOSR.

[36]  Pavlin Radoslavov,et al.  ONOS: towards an open, distributed SDN OS , 2014, HotSDN.

[37]  Zhi-Li Zhang,et al.  On the Feasibility and Efficacy of Protection Routing in IP Networks , 2010, IEEE/ACM Transactions on Networking.

[38]  Takuya Akiba,et al.  Fast exact shortest-path distance queries on large networks by pruned landmark labeling , 2013, SIGMOD '13.

[39]  Davide Sanvito,et al.  Fast failure detection and recovery in SDN with stateful data plane , 2016, Int. J. Netw. Manag..

[40]  Dennis Cai,et al.  Evolve carrier ethernet architecture with SDN and segment routing , 2014, Proceeding of IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks 2014.

[41]  Ian F. Akyildiz,et al.  Research challenges for traffic engineering in software defined networks , 2016, IEEE Network.

[42]  Moshe Lewenstein,et al.  Improved algorithms for the k simple shortest paths and the replacement paths problems , 2009, Inf. Process. Lett..

[43]  Koji Okamura,et al.  Fast failover mechanism for software defined networking: OpenFlow based , 2014, CFI '14.

[44]  Subhash Suri,et al.  Vickrey prices and shortest paths: what is an edge worth? , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[45]  Navendu Jain,et al.  Understanding network failures in data centers: measurement, analysis, and implications , 2011, SIGCOMM.