Engineering Uniform Sampling of Graphs with a Prescribed Power-law Degree Sequence

We consider the following common network analysis problem: given a degree sequence d = (d1, . . . , dn) ∈ N return a uniform sample from the ensemble of all simple graphs with matching degrees. In practice, the problem is typically solved using Markov Chain Monte Carlo approaches, such as Edge-Switching or Curveball, even if no practical useful rigorous bounds are known on their mixing times. In contrast, Arman et al. sketch Inc-Powerlaw, a novel and much more involved algorithm capable of generating graphs for power-law bounded degree sequences with γ ' 2.88 in expected linear time. For the first time, we give a complete description of the algorithm and add novel switchings. To the best of our knowledge, our open-source implementation of Inc-Powerlaw is the first practical generator with rigorous uniformity guarantees for the aforementioned degree sequences. In an empirical investigation, we find that for small average-degrees Inc-Powerlaw is very efficient and generates graphs with one million nodes in less than a second. For larger average-degrees, parallelism can partially mitigate the increased running-time. 2012 ACM Subject Classification Mathematics of computing → Random graphs

[1]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[3]  Matthieu Latapy,et al.  Efficient and simple generation of random simple connected graphs with prescribed degree sequence , 2005, J. Complex Networks.

[4]  Edward A. Bender,et al.  The Asymptotic Number of Labeled Graphs with Given Degree Sequences , 1978, J. Comb. Theory A.

[5]  Christian Staudt,et al.  NetworKit: A tool suite for large-scale complex network analysis , 2014, Network Science.

[6]  Isabelle Stanton,et al.  Sampling Graphs with a Prescribed Joint Degree Distribution Using Markov Chains , 2011, ALENEX.

[7]  Daniel Lemire,et al.  Fast Random Integer Generation in an Interval , 2018, ACM Trans. Model. Comput. Simul..

[8]  M. Newman,et al.  On the uniform generation of random graphs with prescribed degree sequences , 2003, cond-mat/0312028.

[9]  Pu Gao,et al.  Uniform Generation of Random Regular Graphs , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[10]  Madhav V. Marathe,et al.  An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[11]  Béla Bollobás,et al.  A Probabilistic Proof of an Asymptotic Formula for the Number of Labelled Regular Graphs , 1980, Eur. J. Comb..

[12]  Nicholas C. Wormald,et al.  Generating Random Regular Graphs Quickly , 1999, Combinatorics, Probability and Computing.

[13]  Priya Mahadevan,et al.  Systematic topology analysis and generation using degree correlations , 2006, SIGCOMM.

[14]  Giovanni Strona,et al.  A fast and unbiased procedure to randomize ecological binary matrices with fixed row and column totals , 2014, Nature Communications.

[15]  Johannes Singler,et al.  The GNU libstdc++ parallel mode: software engineering considerations , 2008, IWMSE '08.

[16]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[17]  N. Gotelli,et al.  NULL MODELS IN ECOLOGY , 1996 .

[18]  Martin E. Dyer,et al.  Sampling regular graphs and a peer-to-peer network , 2005, SODA '05.

[19]  Joel C. Miller,et al.  Efficient Generation of Networks with Given Expected Degrees , 2011, WAW.

[20]  Jennifer Neville,et al.  Scalable and exact sampling method for probabilistic generative graph models , 2018, Data Mining and Knowledge Discovery.

[21]  Ulrik Brandes,et al.  What is network science? , 2013, Network Science.

[22]  Pu Gao,et al.  Uniform generation of random graphs with power-law degree sequences , 2017, SODA.

[23]  F. Chung,et al.  Connected Components in Random Graphs with Given Expected Degree Sequences , 2002 .

[24]  David P. Rodgers,et al.  Improvements in multiprocessor system design , 1985, ISCA '85.

[25]  Svante Janson The Probability That a Random Multigraph is Simple , 2009, Comb. Probab. Comput..

[26]  N. Verhelst An Efficient MCMC Algorithm to Sample Binary Matrices with Fixed Marginals , 2008 .

[27]  Manuel Penschuck,et al.  I/O-Efficient Generation of Massive Graphs Following the LFR Benchmark , 2016, ALENEX.

[28]  Peter Sanders,et al.  Communication-Free Massively Distributed Graph Generation , 2017, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[29]  James Y. Zhao Expand and Contract: Sampling graphs with given degrees and other combinatorial families , 2013, ArXiv.

[30]  Amin Saberi,et al.  A Sequential Algorithm for Generating Random Graphs , 2007, Algorithmica.

[31]  Engineering In-place (Shared-memory) Sorting Algorithms , 2020, ACM Transactions on Parallel Computing.

[32]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[33]  Madhav V. Marathe,et al.  Fast Parallel Algorithms for Edge-Switching to Achieve a Target Visit Rate in Heterogeneous Graphs , 2014, 2014 43rd International Conference on Parallel Processing.

[34]  Mark Jerrum,et al.  Fast Uniform Generation of Regular Graphs , 1990, Theor. Comput. Sci..

[35]  Pu Gao,et al.  Fast Uniform Generation of Random Graphs with Given Degree Sequences , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[36]  Peter Sanders,et al.  Random Permutations on Distributed, External and Hierarchical Memory , 1998, Inf. Process. Lett..

[37]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[38]  Katharina Anna Zweig,et al.  Different flavors of randomness: comparing random graph models with fixed degree sequences , 2015, Social Network Analysis and Mining.

[39]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[40]  Catherine S. Greenhill The switch Markov chain for sampling irregular graphs (Extended Abstract) , 2014, SODA.

[41]  Ulrik Brandes,et al.  Recent Advances in Scalable Network Generation , 2020, ArXiv.

[42]  Ali Pinar,et al.  Are We There Yet? When to Stop a Markov Chain while Generating Random Graphs , 2012, WAW.

[43]  Christos Gkantsidis,et al.  The Markov Chain Simulation Method for Generating Connected Power Law Random Graphs , 2003, ALENEX.

[44]  Brendan D. McKay,et al.  Uniform Generation of Random Regular Graphs of Moderate Degree , 1990, J. Algorithms.

[45]  Ulrich Meyer,et al.  Parallel and I/O-efficient Randomisation of Massive Networks using Global Curveball Trades , 2018, ESA.

[46]  Van H. Vu,et al.  Generating Random Regular Graphs , 2003, STOC '03.

[47]  Prasad Tetali,et al.  Simple Markov-chain algorithms for generating bipartite graphs and tournaments , 1997, SODA '97.