Network comparison and the within-ensemble graph distance

Quantifying the differences between networks is a challenging and ever-present problem in network science. In recent years, a multitude of diverse, ad hoc solutions to this problem have been introduced. Here, we propose that simple and well-understood ensembles of random networks—such as Erdős–Rényi graphs, random geometric graphs, Watts–Strogatz graphs, the configuration model and preferential attachment networks—are natural benchmarks for network comparison methods. Moreover, we show that the expected distance between two networks independently sampled from a generative model is a useful property that encapsulates many key features of that model. To illustrate our results, we calculate this within-ensemble graph distance and related quantities for classic network models (and several parameterizations thereof) using 20 distance measures commonly used to compare graphs. The within-ensemble graph distance provides a new framework for developers of graph distances to better understand their creations and for practitioners to better choose an appropriate tool for their particular task.

[1]  M. Newman,et al.  Statistical mechanics of networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , 2018, KDD.

[3]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[4]  Elena Deza,et al.  Encyclopedia of Distances , 2014 .

[5]  Giulio Cimini,et al.  The statistical physics of real-world networks , 2018, Nature Reviews Physics.

[6]  Cleve B. Moler,et al.  Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later , 1978, SIAM Rev..

[7]  David K. Hammond,et al.  Graph diffusion distance: A difference measure for weighted graphs based on the graph Laplacian exponential kernel , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[8]  Z. Duan,et al.  GRAPH THEORY AND COMPLEX NETWORKS , 2008 .

[9]  Panos M. Pardalos,et al.  Quantification of network structural dissimilarities , 2017, Nature Communications.

[10]  J. Herskowitz,et al.  Proceedings of the National Academy of Sciences, USA , 1996, Current Biology.

[11]  M. E. J. Newman,et al.  Power laws, Pareto distributions and Zipf's law , 2005 .

[12]  Thilo Gross,et al.  All scale-free networks are sparse. , 2011, Physical review letters.

[13]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[14]  D. S. Johnson,et al.  Proceedings of the twenty-first annual ACM symposium on Theory of computing , 1989, STOC 1989.

[15]  Petter Holme,et al.  Detecting sequences of system states in temporal networks , 2018, Scientific Reports.

[16]  Cesare Furlanello,et al.  The HIM glocal metric and kernel for network comparison and classification , 2012, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[17]  Samir Chowdhury,et al.  Distances and Isomorphism between Networks and the Stability of Network Invariants , 2017, ArXiv.

[18]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[19]  James P. Bagrow,et al.  An information-theoretic, all-scales approach to comparing networks , 2018, Applied Network Science.

[20]  Richard C. Wilson,et al.  Characterizing graph symmetries through quantum Jensen-Shannon divergence. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Danai Koutra,et al.  NetSimile: A Scalable Approach to Size-Independent Network Similarity , 2012, ArXiv.

[22]  Miro Kraetzl,et al.  Graph distances using graph union , 2001, Pattern Recognit. Lett..

[23]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[24]  Peter Wills,et al.  Metrics for graph comparison: A practitioner’s guide , 2020, PloS one.

[25]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[26]  Ping Zhu,et al.  A study of graph spectra for comparing graphs and trees , 2008, Pattern Recognit..

[27]  George T. Cantwell,et al.  Message passing on networks with loops , 2019, Proceedings of the National Academy of Sciences.

[28]  Christos Faloutsos,et al.  It's who you know: graph mining using recursive structural features , 2011, KDD.

[29]  Bruce A. Reed,et al.  A Critical Point for Random Graphs with a Given Degree Sequence , 1995, Random Struct. Algorithms.

[30]  Martín Gómez Ravetti,et al.  Analyzing complex networks evolution through Information Theory quantifiers , 2011 .

[31]  Andrew Mellor,et al.  Graph Comparison via the Non-backtracking Spectrum , 2018, Physical review. E.

[32]  Manlio De Domenico,et al.  Spectral entropies as information-theoretic tools for complex network comparison , 2016, 1609.01214.

[33]  Cesare Furlanello,et al.  An introduction to spectral distances in networks , 2010, WIRN.

[34]  D. Garlaschelli,et al.  Maximum likelihood: extracting unbiased information from complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  V. Paquet,et al.  A Practitioner’s Guide , 2018 .

[36]  Timothy LaRock,et al.  netrd: A library for network reconstruction and graph distances , 2020, ArXiv.

[37]  Stratis Ioannidis,et al.  A Family of Tractable Graph Distances , 2018, SDM.

[38]  S. Redner,et al.  Connectivity of growing random networks. , 2000, Physical review letters.

[39]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[40]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[41]  Alexander S Mikhailov,et al.  Evolutionary reconstruction of networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  Yongtang Shi,et al.  Fifty years of graph matching, network alignment and network comparison , 2016, Inf. Sci..

[43]  Emmanuel Müller,et al.  NetLSD: Hearing the Shape of a Graph , 2018, KDD.

[44]  Mathew D. Penrose,et al.  Random Geometric Graphs , 2003 .

[45]  S. Holmes,et al.  Tracking network dynamics: A survey using graph distances , 2018, The Annals of Applied Statistics.

[46]  Stratis Ioannidis,et al.  A family of tractable graph metrics , 2019, Appl. Netw. Sci..

[47]  Victor Y. Pan,et al.  The complexity of the matrix eigenproblem , 1999, STOC '99.

[48]  Danai Koutra,et al.  DeltaCon: Principled Massive-Graph Similarity Function with Attribution , 2016, ACM Trans. Knowl. Discov. Data.

[49]  Tina Eliassi-Rad,et al.  Non-backtracking cycles: length spectrum theory and graph mining applications , 2019, Appl. Netw. Sci..

[50]  Dan Chen,et al.  Complex network comparison based on communicability sequence entropy. , 2018, Physical review. E.

[51]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[52]  Edwin R. Hancock,et al.  A quantum Jensen-Shannon graph kernel for unattributed graphs , 2015, Pattern Recognit..

[53]  Maarten van Steen,et al.  Graph Theory and Complex Networks: An Introduction , 2010 .

[54]  François G. Meyer,et al.  The Resistance Perturbation Distance: A Metric for the Analysis of Dynamic Networks , 2016, Discret. Appl. Math..

[55]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[56]  E. M. Bollt,et al.  Portraits of complex networks , 2008 .

[57]  Béla Bollobás,et al.  A Probabilistic Proof of an Asymptotic Formula for the Number of Labelled Regular Graphs , 1980, Eur. J. Comb..

[58]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[59]  Wilson A. Sutherland,et al.  Introduction to Metric and Topological Spaces , 1975 .

[60]  Xuelong Li,et al.  A survey of graph edit distance , 2010, Pattern Analysis and Applications.

[61]  Ieee Staff,et al.  2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP) , 2013 .

[62]  J. Dall,et al.  Random geometric graphs. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .