Different flavors of randomness: comparing random graph models with fixed degree sequences

When a structural characteristic of a network is measured, the observed value needs to be compared to its expected value in a random graph model to assess the statistical significance of its occurrence. The random graph model with which the observed graph is compared is chosen to be structurally similar to the real-world network in some aspects and totally random in all others. To make the analysis of the expected value amenable, the random graph model is also chosen to be as simple as possible. The most common random graph models maintain the degree sequence of the observed graph or at least approximate it. In cases where multi-edges and self-loops are not allowed, typically the fixed degree sequence model (FDSM) is used. Since it is computationally expensive, in this article, we discuss whether one of the following three approximative models can replace it: the configuration model, its simplified version (eCFG), and the mathematical approximation we term simple independence model. While the latter models are more scalable than the FDSM, we show that there are several networks for which they cannot be meaningfully applied. We investigate based on some examples whether, and if so in which cases, these approximating models can replace the computationally more involved FDSM.

[1]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[2]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[3]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Paul Erdös,et al.  On random graphs, I , 1959 .

[5]  Aristides Gionis,et al.  Assessing data mining results via swap randomization , 2007, TKDD.

[6]  Johan A. Pouwelse,et al.  A network science perspective of a distributed reputation mechanism , 2013, 2013 IFIP Networking Conference.

[7]  Katharina Anna Zweig,et al.  How to Forget the Second Side of the Story: A New Method for the One-Mode Projection of Bipartite Graphs , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[8]  G. Hooghiemstra,et al.  Universality for distances in power-law random graphs , 2008 .

[9]  Katharina Anna Zweig,et al.  Network Representations of Complex Data , 2014, Encyclopedia of Social Network Analysis and Mining.

[10]  Richard A. Brualdi,et al.  Algorithms for constructing (0, 1)-matrices with prescribed row and column sum vectors , 2006, Discret. Math..

[11]  M. Malumbres,et al.  Molecular Systems Biology 8; Article number 569; doi:10.1038/msb.2012.2 Citation: Molecular Systems Biology 8:569 , 2022 .

[12]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[13]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[14]  Walter Ledermann,et al.  Handbook of applicable mathematics , 1980 .

[15]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[16]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[17]  Michael Kaufmann,et al.  A systematic approach to the one-mode projection of bipartite graphs , 2011, Social Network Analysis and Mining.

[18]  R. Milo,et al.  Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[20]  Yung-Pin Chen,et al.  An Application of Markov Chain Monte Carlo to Community Ecology , 2003, Am. Math. Mon..

[21]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[24]  Remco van der Hofstad,et al.  Random Graphs and Complex Networks , 2016, Cambridge Series in Statistical and Probabilistic Mathematics.

[25]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[26]  Mason A. Porter,et al.  Comparing Community Structure to Characteristics in Online Collegiate Social Networks , 2008, SIAM Rev..

[27]  Alessandro Vespignani,et al.  Epidemic spreading in complex networks with degree correlations , 2003, cond-mat/0301149.

[28]  Béla Bollobás,et al.  The Diameter of a Scale-Free Random Graph , 2004, Comb..

[29]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[30]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[31]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Remco van der Hofstad,et al.  Random Graphs and Complex Networks. Vol. II , 2014 .

[33]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[34]  Stefan Wiemann,et al.  Global microRNA level regulation of EGFR-driven cell-cycle protein network in breast cancer , 2012, Molecular systems biology.

[35]  Tore Opsahl,et al.  Clustering in weighted networks , 2009, Soc. Networks.

[36]  John Scott What is social network analysis , 2010 .

[37]  N. Gotelli,et al.  NULL MODELS IN ECOLOGY , 1996 .

[38]  L. Katz,et al.  Measurement of the Tendency Toward Reciprocation of Choice , 1955 .

[39]  Luciano da Fontoura Costa,et al.  The role of centrality for the identification of influential spreaders in complex networks , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  F. Chung,et al.  The average distances in random graphs with given expected degrees , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Jure Leskovec,et al.  Discovering social circles in ego networks , 2012, ACM Trans. Knowl. Discov. Data.

[42]  Peter Donnelly,et al.  Superfamilies of Evolved and Designed Networks , 2004 .