Achieving Sub-50 Milliseconds Recovery Upon BGP Peering Link Failures

Recent measurements show that BGP peering links can fail as frequently as intradomain links and usually for short periods of time. We propose a new fast-reroute technique where routers are prepared to react quickly to interdomain link failures. For each of its interdomain links, a router precomputes a protection tunnel, i.e., an IP tunnel to an alternate nexthop which can reach the same destinations as via the protected link. We propose a BGP- based auto-discovery technique that allows each router to learn the candidate protection tunnels for its links. Each router selects the best protection tunnels for its links and when it detects an interdomain link failure, it immediately encapsulates the packets to send them through the protection tunnel. Our solution is applicable for the links between large transit ISPs and also for the links between multi-homed stub networks and their providers. Furthermore, we show that transient forwarding loops (and thus the corresponding packet losses) can be avoided during the routing convergence that follows the deactivation of a protection tunnel in BGP/MPLS VPNs and in IP networks using encapsulation.

[1]  Nick Feamster,et al.  Measuring the effects of internet path faults on reactive routing , 2003, SIGMETRICS '03.

[2]  Yakov Rekhter,et al.  BGP Extended Communities Attribute , 2006, RFC.

[3]  Danny McPherson,et al.  Practical BGP , 2004 .

[4]  Olivier Bonaventure,et al.  Avoiding transient loops during IGP convergence in IP networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[5]  Martina Zitterbart,et al.  Fast scoped rerouting for BGP , 2003, The 11th IEEE International Conference on Networks, 2003. ICON2003..

[6]  Ignacio Goyret,et al.  Layer Two Tunneling Protocol - Version 3 (L2TPv3) , 2005, RFC.

[7]  Piet Demeester,et al.  Network Recovery: Protection and Restoration of Optical, SONET-SDH, IP, and MPLS , 2004 .

[8]  Anja Feldmann,et al.  Locating internet routing instabilities , 2004, SIGCOMM 2004.

[9]  Krishna P. Gummadi,et al.  Improving the Reliability of Internet Paths with One-hop Source Routing , 2004, OSDI.

[10]  Yakov Rekhter,et al.  Application of the Border Gateway Protocol in the Internet , 1995, RFC.

[11]  Stewart Bryant,et al.  Pseudo Wire Emulation Edge-to-Edge (PWE3) Architecture , 2005, RFC.

[12]  Daniel Massey,et al.  BGP-RCN: improving BGP convergence through root cause notification , 2005, Comput. Networks.

[13]  Barry Raveendran Greene,et al.  Cisco ISP Essentials , 2002 .

[14]  Yin Zhang,et al.  BGP routing stability of popular destinations , 2002, IMW '02.

[15]  A. Kirstadter,et al.  Improving the resilience in IP networks , 2003, Workshop on High Performance Switching and Routing, 2003, HPSR..

[16]  Lixin Gao,et al.  A measurement study on the impact of routing events on end-to-end internet path performance , 2006, SIGCOMM.

[17]  Yakov Rekhter,et al.  BGP/MPLS VPNs , 1999, RFC.

[18]  Dave Katz,et al.  Application of the Border Gateway Protocol in the Internet , 1990, RFC.

[19]  Nick Feamster,et al.  BorderGuard: detecting cold potatoes from peers , 2004, IMC '04.

[20]  Olivier Bonaventure,et al.  Achieving Sub-50 Milliseconds Recovery Upon BGP Peering Link Failures , 2007, IEEE/ACM Transactions on Networking.

[21]  Lixin Gao,et al.  On Understanding Transient Interdomain Routing Failures , 2009, IEEE/ACM Transactions on Networking.

[22]  Olivier Bonaventure,et al.  Implications of the topological properties of Internet traffic on traffic engineering , 2004, SAC '04.

[23]  Yakov Rekhter,et al.  Encapsulating MPLS in IP or Generic Routing Encapsulation (GRE) , 2005, RFC.

[24]  Olivier Bonaventure,et al.  Achieving sub-second IGP convergence in large IP networks , 2005, CCRV.

[25]  Clarence Filsfils,et al.  Deploying diffserv in backbone networks for tight SLA control , 2005, IEEE Internet Computing.

[26]  Dan Pei,et al.  BGP convergence in virtual private networks , 2006, IMC '06.

[27]  Athina Markopoulou,et al.  Characterization of failures in an IP backbone , 2004, IEEE INFOCOM 2004.

[28]  Jaideep Chandrashekar,et al.  Limiting path exploration in BGP , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[29]  Yakov Rekhter,et al.  Mpls: Technology and Applications , 2000 .

[30]  Cristel Pelsser,et al.  Protection for inter-AS MPLS tunnels , 2003 .

[31]  Piotr Cholda,et al.  Network Recovery, Protection and Restoration of Optical, SONET-SDH, IP, and MPLS [Book Review] , 2005, IEEE Communications Magazine.

[32]  Jennifer Rexford,et al.  Stable internet routing without global coordination , 2001, TNET.

[33]  Yakov Rekhter Constructing intra-AS path segments for an inter-AS path , 1991, CCRV.

[34]  Christophe Diot,et al.  Detection and analysis of routing loops in packet traces , 2002, IMW '02.

[35]  Timothy G. Griffin,et al.  An experimental analysis of BGP convergence time , 2001, Proceedings Ninth International Conference on Network Protocols. ICNP 2001.

[36]  David Watson,et al.  Experiences with monitoring OSPF on a regional service provider network , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..