Enhanced reconstruction of weighted networks from strengths and degrees

Network topology plays a key role in many phenomena, from the spreading of diseases to that of financial crises. Whenever the whole structure of a network is unknown, one must resort to reconstruction methods that identify the least biased ensemble of networks consistent with the partial information available. A challenging case, frequently encountered due to privacy issues in the analysis of interbank flows and Big Data, is when there is only local (node-specific) aggregate information available. For binary networks, the relevant ensemble is one where the degree (number of links) of each node is constrained to its observed value. However, for weighted networks the problem is much more complicated. While the naive approach prescribes to constrain the strengths (total link weights) of all nodes, recent counter-intuitive results suggest that in weighted networks the degrees are often more informative than the strengths. This implies that the reconstruction of weighted networks would be significantly enhanced by the specification of both strengths and degrees, a computationally hard and bias-prone procedure. Here we solve this problem by introducing an analytical and unbiased maximum-entropy method that works in the shortest possible time and does not require the explicit generation of reconstructed samples. We consider several real-world examples and show that, while the strengths alone give poor results, the additional knowledge of the degrees yields accurately reconstructed networks. Information-theoretic criteria rigorously confirm that the degree sequence, as soon as it is non-trivial, is irreducible to the strength sequence. Our results have strong implications for the analysis of motifs and communities and whenever the reconstructed ensemble is required as a null model to detect higher-order patterns.

[1]  Diego Garlaschelli,et al.  Generalized Bose-Fermi statistics and structural correlations in weighted networks. , 2008, Physical review letters.

[2]  G. Bianconi Entropy of network ensembles. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  G. Caldarelli,et al.  Reconstructing a credit network , 2013, Nature Physics.

[4]  Matteo Marsili,et al.  Reconstruction of financial networks for robust estimation of systemic risk , 2011 .

[5]  Giorgio Fagiolo,et al.  Randomizing world trade. II. A weighted network analysis. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  P. Killworth,et al.  Informant Accuracy in Social Network Data , 1976 .

[7]  D. Garlaschelli,et al.  Maximum likelihood: extracting unbiased information from complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Giorgio Fagiolo,et al.  Null models of economic networks: the case of the world trade web , 2011, 1112.2895.

[9]  Alessandro Vespignani,et al.  Dynamical Processes on Complex Networks , 2008 .

[10]  C. Lynch Big data: How do your data grow? , 2008, Nature.

[11]  M. Newman,et al.  Statistical mechanics of networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Roma,et al.  Fitness model for the Italian interbank money market. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  M. Serrano,et al.  Weighted Configuration Model , 2005, cond-mat/0501750.

[14]  Diego Garlaschelli,et al.  Triadic motifs and dyadic self-organization in the World Trade Network , 2012, IWSOS.

[15]  G. Fagiolo Clustering in complex directed networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  E S Roberts,et al.  Unbiased degree-preserving randomization of directed binary networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Diego Garlaschelli,et al.  Analytical maximum-likelihood method to detect patterns in real networks , 2011, 1103.0701.

[18]  G. Caldarelli,et al.  On the rich-club effect in dense and weighted networks , 2008, 0807.0793.

[19]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[20]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[21]  Mauro Gallegati,et al.  Random Digraphs with Given Expected Degree Sequences: A Model for Economic Networks , 2011 .

[22]  P. Killworth,et al.  INFORMANT ACCURACY IN SOCIAL NETWORK DATA II , 1977 .

[23]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[25]  Diego Garlaschelli,et al.  Fitness-dependent topological properties of the world trade web. , 2004, Physical review letters.

[26]  Marián Boguñá,et al.  Correlations in weighted networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Ck Cheng,et al.  The Age of Big Data , 2015 .

[28]  K. Kaski,et al.  The International Trade Network: weighted network analysis and modelling , 2007, 0707.4343.

[29]  Giorgio Fagiolo,et al.  Randomizing world trade. I. A binary network analysis. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Simon J. Wells,et al.  Financial Interlinkages in the United Kingdom's Interbank Market and the Risk of Contagion , 2004 .

[31]  Guido Caldarelli,et al.  Bootstrapping Topological Properties and Systemic Risk of Complex Networks Using the Fitness Model , 2012, Journal of Statistical Physics.