China's Internet: Topology mapping and geolocating

We perform a large-scale topology mapping and geolocation study for China's Internet. To overcome the limited number of Chinese PlanetLab nodes and looking glass servers, we leverage several unique features in China's Internet, including the hierarchical structure of the major ISPs and the abundance of IDCs. Using only 15 vantage points, we design a traceroute scheme that finds significantly more interfaces and links than iPlane with significantly fewer traceroute probes. We then consider the problem of geolocating router interfaces and end hosts in China. We develop a heuristic for clustering the interface topology of a hierarchical ISP, and then apply the heuristic to the major Chinese ISPs. We show that the clustering heuristic can geolocate router interfaces with significantly more detail and accuracy than can the existing geoIP databases in isolation, and the resulting clusters expose the major ISPs' provincial structure. Finally, using the clustering heuristic, we propose a methodology for improving commercial geoIP databases.

[1]  David Wetherall,et al.  Towards IP geolocation using delay and topology measurements , 2006, IMC '06.

[2]  Paul Barford,et al.  A Learning-Based Approach for IP Geolocation , 2010, PAM.

[3]  Dan Li,et al.  IP-Geolocation Mapping for Involving Moderately-Connected Internet Regions , 2009 .

[4]  Yuval Shavitt,et al.  A Structural Approach for PoP Geo-Location , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[5]  Jennifer Rexford,et al.  Impact of prefix-match changes on IP reachability , 2009, IMC '09.

[6]  Lakshminarayanan Subramanian,et al.  An investigation of geographic mapping techniques for internet hosts , 2001, SIGCOMM 2001.

[7]  Ratul Mahajan,et al.  Measuring ISP topologies with Rocketfuel , 2004, IEEE/ACM Transactions on Networking.

[8]  Brice Augustin,et al.  IXPs: mapped? , 2009, IMC '09.

[9]  Zhuoqing Morley Mao,et al.  Internet Censorship in China: Where Does the Filtering Occur? , 2011, PAM.

[10]  Yehuda Afek,et al.  On the structure and application of BGP policy atoms , 2002, IMW '02.

[11]  Serge Fdida,et al.  Constraint-Based Geolocation of Internet Hosts , 2004, IEEE/ACM Transactions on Networking.

[12]  Arun Venkataramani,et al.  iPlane: an information plane for distributed services , 2006, OSDI '06.

[13]  Robert Beverly,et al.  Primitives for active internet topology mapping: toward high-frequency characterization , 2010, IMC '10.

[14]  Dan Li,et al.  IP-Geolocation Mapping for Moderately Connected Internet Regions , 2013, IEEE Transactions on Parallel and Distributed Systems.

[15]  Aleksandar Kuzmanovic,et al.  Towards Street-Level Client-Independent IP Geolocation , 2011, NSDI.

[16]  Yuval Shavitt,et al.  A Study of Geolocation Databases , 2010, ArXiv.

[17]  Yuval Shavitt,et al.  A Structural Approach for PoP Geo-Location , 2010 .

[18]  Farnam Jahanian,et al.  Internet inter-domain traffic , 2010, SIGCOMM '10.

[19]  Mark Crovella,et al.  Efficient algorithms for large-scale topology discovery , 2004, SIGMETRICS '05.

[20]  Helen J. Wang,et al.  Mining the Web and the Internet for Accurate IP Address Geolocations , 2009, IEEE INFOCOM 2009.

[21]  Keith W. Ross,et al.  Xunlei: Peer-Assisted Download Acceleration on a Massive Scale , 2012, PAM.

[22]  Yuval Shavitt,et al.  An Optimal Median Calculation Algorithm for Estimating Internet Link Delays from Active Measurements , 2007, 2007 Workshop on End-to-End Monitoring Techniques and Services.