k-dense communities in the internet AS-level topology

Extracting a set of well connected subgraphs as communities from the Internet Autonomous System (AS) level topology graph is crucially important for a better understanding of the network structure and for designing new protocols. A huge number of community extraction methods have been proposed in the literature. In this paper we apply the k-dense algorithm as it represents a very interesting compromise between computational efficiency and precision. The paper provides two innovative contributions. The first is the application of the k-dense method to the Internet AS-level topology graph - obtained from the CAIDA, DIMES and IRL datasets - to identify well-connected communities and to analyze how these are connected to the rest of the graph. The second contribution relates to the study of the most well-connected communities with the support of two additional datasets: a geographical dataset (which lists, for each AS, the countries in which it has at least one geographical location) and the Internet eXchange Point dataset (which maintains, for each IXP, its geographical position and the list of its participants). We found that the k-max-dense community holds a central position in the Internet AS-level topology graph structure since its 101 ASes (less than the 0.3% of Internet ASes) are involved in more than 39% of all Internet connections. We also found that those ASes are connected to at least one IXP and have at least one geographical location in Europe (only 70.3% of them have at least one additional geographical location outside Europe).

[1]  Yong Tan,et al.  An Economic Analysis of Interconnection Arrangements Between Internet Backbone Providers , 2006, Oper. Res..

[2]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[4]  Walter Willinger,et al.  The workshop on internet topology (wit) report , 2006, CCRV.

[5]  Fergal Reid,et al.  Detecting highly overlapping community structure by greedy clique expansion , 2010, KDD 2010.

[6]  Alessandro Vespignani,et al.  K-core decomposition of Internet graphs: hierarchies, self-similarity and measurement biases , 2005, Networks Heterog. Media.

[7]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[8]  Cyril Gavoille,et al.  Routing in distributed networks: overview and open problems , 2001, SIGA.

[9]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[10]  Chiara Orsini,et al.  C Consiglio Nazionale delle Ricerche The Impact of IXPs on the AS-level Topology Structure of the Internet , 2010 .

[11]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[12]  Michalis Faloutsos,et al.  Jellyfish: A conceptual model for the as Internet topology , 2006, Journal of Communications and Networks.

[13]  Alessandro Vespignani,et al.  Exploring networks with traceroute-like probes: theory and simulations , 2004, Theor. Comput. Sci..

[14]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Chiara Orsini,et al.  k-dense communities in the internet AS-level topology , 2011, 2011 Third International Conference on Communication Systems and Networks (COMSNETS 2011).

[16]  Cristopher Moore,et al.  On the bias of traceroute sampling: Or, power-law degree distributions in regular graphs , 2005, JACM.

[17]  Lixin Gao On inferring autonomous system relationships in the internet , 2001, TNET.

[18]  Mohammed El-Beltagy,et al.  Analyzing Internet Connectivity Data Using Modified k-shell Analysis , 2008 .

[19]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[20]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[21]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[22]  Peng Xie,et al.  Sampling biases in IP topology measurements , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[23]  Xiaowei Yang,et al.  Compact routing on Internet-like graphs , 2003, IEEE INFOCOM 2004.

[24]  Brice Augustin,et al.  IXPs: mapped? , 2009, IMC '09.

[25]  Lenore Cowen,et al.  Compact Routing on Power Law Graphs with Additive Stretch , 2006, ALENEX.

[26]  Lixia Zhang,et al.  The (In)Completeness of the Observed Internet AS-level Structure , 2010, IEEE/ACM Transactions on Networking.

[27]  Yuval Shavitt,et al.  A model of Internet topology using k-shell decomposition , 2007, Proceedings of the National Academy of Sciences.

[28]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Yuval Shavitt,et al.  A Structural Approach for PoP Geo-Location , 2010 .

[30]  Michalis Faloutsos,et al.  Lord of the links: a framework for discovering missing links in the internet topology , 2009, IEEE/ACM Trans. Netw..

[31]  Jari Saramäki,et al.  Characterizing the Community Structure of Complex Networks , 2010, PloS one.

[32]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Kazumi Saito,et al.  Extracting Communities from Complex Networks by the k-Dense Method , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[34]  Cristopher Moore,et al.  On the bias of traceroute sampling: or, power-law degree distributions in regular graphs , 2005, STOC '05.

[35]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[36]  Mark Newman,et al.  Detecting community structure in networks , 2004 .