Improving the reliability of inter-AS economic inferences through a hygiene phase on BGP data

Over the last few years researchers have tried to shed light on the economic features that drive the inter-domain routing of the Internet, by inferring economic inter-AS relationships from raw BGP data collected by research projects such as BGPmon, PCH, RIS and RouteViews. Although this kind of data contains spurious entries mostly caused by router misconfigurations on BGP border routers and showing up during BGP path exploration, none of the methodologies provide an adequate data hygiene phase, thus affecting the accuracy of the inferences drawn. In this paper we outline a new methodology that can purge a large amount of spurious routes from BGP raw data by leveraging on robust statistical concepts rather than on debatable thresholds. To quantify the performance of our methodology we apply an enhanced version of an existing economic tagging algorithm on non-cleaned and cleaned data respectively. We found that 42.01% of different AS paths advertised to BGP route collectors in July 2013 appear only in spurious routes and that, in the absence of an appropriate data hygiene phase, they can affect the accuracy of the economic inferences regarding about 8% of connections found in BGP raw data.

[1]  Anja Feldmann,et al.  Locating internet routing instabilities , 2004, SIGCOMM 2004.

[2]  Marina Thottan,et al.  An Online Mechanism for BGP Instability Detection and Analysis , 2009, IEEE Transactions on Computers.

[3]  ZhangLixia,et al.  Longitudinal study of BGP monitor session failures , 2010 .

[4]  Enrico Gregori,et al.  Inferring geography from BGP raw data , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[5]  Anja Feldmann,et al.  Building an AS-topology model that captures route diversity , 2006, SIGCOMM 2006.

[6]  Prasant Mohapatra,et al.  Characterization of BGP Recovery Time under Large-Scale Failures , 2006, 2006 IEEE International Conference on Communications.

[7]  Ramesh Govindan,et al.  An analysis of Internet inter-domain topology and route stability , 1997, Proceedings of INFOCOM '97.

[8]  Walter Willinger,et al.  10 Lessons from 10 Years of Measuring and Modeling the Internet's Autonomous Systems , 2011, IEEE Journal on Selected Areas in Communications.

[9]  Balachander Krishnamurthy,et al.  Towards an AS-to-organization map , 2010, IMC '10.

[10]  Ron Larson,et al.  Elementary Statistics: Picturing the World , 1999 .

[11]  Xin Zhao,et al.  Longitudinal study of BGP monitor session failures , 2010, CCRV.

[12]  Daniel Massey,et al.  Identifying BGP routing table transfers , 2005, MineNet '05.

[13]  Lixin Gao,et al.  On inferring autonomous system relationships in the Internet , 2000, Globecom '00 - IEEE. Global Telecommunications Conference. Conference Record (Cat. No.00CH37137).

[14]  Manish Karir,et al.  Multi-Threaded Routing Toolkit (MRT) Routing Information Export Format , 2011, RFC.

[15]  Jennifer Rexford,et al.  Inherently safe backup routing with BGP , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[16]  Daniel Massey,et al.  An analysis of convergence delay in path vector routing protocols , 2006, Comput. Networks.

[17]  Sergey Gorinsky,et al.  Obscure Giants: Detecting the Provider-Free ASes , 2012, Networking.

[18]  Geoff Huston,et al.  ISP Survival Guide: Strategies for Running a Competitive ISP , 1998 .

[19]  Enrico Gregori,et al.  On the incompleteness of the AS-level graph: a novel methodology for BGP route collector placement , 2012, Internet Measurement Conference.

[20]  Enrico Gregori,et al.  BGP and Inter-AS Economic Relationships , 2011, Networking.

[21]  Benjamin Hummel,et al.  Acyclic type-of-relationship problems on the internet: an experimental analysis , 2007, IMC '07.

[22]  A. Flavel,et al.  CleanBGP: verifying the consistency of BGP data , 2008, 2008 IEEE Internet Network Management Workshop (INM).

[23]  Jennifer Rexford,et al.  There's something about MRAI: Timing diversity can exponentially worsen BGP convergence , 2011, 2011 Proceedings IEEE INFOCOM.

[24]  Yin Zhang,et al.  BGP routing stability of popular destinations , 2002, IMW '02.

[25]  Walter Willinger,et al.  The (In)Completeness of the Observed Internet AS-level Structure , 2010, IEEE/ACM Transactions on Networking.

[26]  Ratul Mahajan,et al.  Understanding BGP misconfiguration , 2002, SIGCOMM 2002.

[27]  Dmitri V. Krioukov,et al.  Inferring AS Relationships: Dead End or Lively Beginning? , 2005, WEA.

[28]  G. Di Battista,et al.  Computing the types of the relationships between autonomous systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[30]  Daniel Massey,et al.  Collecting the internet AS-level topology , 2005, CCRV.

[31]  Lixia Zhang,et al.  Quantifying Path Exploration in the Internet , 2006, IEEE/ACM Transactions on Networking.

[32]  Grenville J. Armitage,et al.  A Technique for Reducing BGP Update Announcements through Path Exploration Damping , 2010, IEEE Journal on Selected Areas in Communications.

[33]  Olaf Maennel,et al.  Modeling BGP Table Fluctuations , 2007, ITC.

[34]  Abhijit Bose,et al.  Delayed Internet routing convergence , 2000, SIGCOMM.

[35]  Daniel Massey,et al.  Identifying BGP routing table transfers , 2011, Comput. Networks.

[36]  Randy H. Katz,et al.  Characterizing the Internet hierarchy from multiple vantage points , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.