Centrality Measures for Graphons: Accounting for Uncertainty in Networks

As relational datasets modeled as graphs keep increasing in size and their data-acquisition is permeated by uncertainty, graph-based analysis techniques can become computationally and conceptually challenging. In particular, node centrality measures rely on the assumption that the graph is perfectly known – a premise not necessarily fulfilled for large, uncertain networks. Accordingly, centrality measures may fail to faithfully extract the importance of nodes in the presence of uncertainty. To mitigate these problems, we suggest a statistical approach based on graphon theory: we introduce formal definitions of centrality measures for graphons and establish their connections to classical graph centrality measures. A key advantage of this approach is that centrality measures defined at the modeling level of graphons are inherently robust to stochastic variations of specific graph realizations. Using the theory of linear integral operators, we define degree, eigenvector, Katz and PageRank centrality functions for graphons and establish concentration inequalities demonstrating that graphon centrality functions arise naturally as limits of their counterparts defined on sequences of graphs of increasing size. The same concentration inequalities also provide high-probability bounds between the graphon centrality functions and the centrality measures on any sampled graph, thereby establishing a measure of uncertainty of the measured centrality score.

[1]  Chandler Davis The rotation of eigenvectors by a perturbation , 1963 .

[2]  Marco Tomassini,et al.  Worldwide spreading of economic crisis , 2010, 1008.3893.

[3]  Andrei Z. Broder,et al.  Workshop on Algorithms and Models for the Web Graph , 2007, WAW.

[4]  V. Sós,et al.  Convergent Sequences of Dense Graphs II. Multiway Cuts and Statistical Physics , 2012 .

[5]  Noah E. Friedkin,et al.  Theoretical Foundations for Centrality Measures , 1991, American Journal of Sociology.

[6]  Gábor Lugosi,et al.  Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[7]  D. Whittaker,et al.  A Course in Functional Analysis , 1991, The Mathematical Gazette.

[8]  P. Diaconis,et al.  Graph limits and exchangeable random graphs , 2007, 0712.2749.

[9]  Harry Crane Dynamic random networks and their graph limits , 2016 .

[10]  V. Sós,et al.  Convergent Sequences of Dense Graphs I: Subgraph Frequencies, Metric Properties and Testing , 2007, math/0702004.

[11]  L. Devroye Non-Uniform Random Variate Generation , 1986 .

[12]  Edward R. Scheinerman,et al.  Random Dot Product Graph Models for Social Networks , 2007, WAW.

[13]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockmodels for Graphs with Latent Block Structure , 1997 .

[14]  R. M. Dudley,et al.  Real Analysis and Probability , 1989 .

[15]  Toka Diagana,et al.  Almost Periodic Stochastic Processes , 2011 .

[16]  P. Wolfe,et al.  Nonparametric graphon estimation , 2013, 1309.5936.

[17]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[18]  Diana Baader,et al.  Theoretical Numerical Analysis A Functional Analysis Framework , 2016 .

[19]  Martin Bálek,et al.  Lovász , Large Networks and Graph Limits , 2013 .

[20]  Stéphane Robin,et al.  Variational Bayes model averaging for graphon functions and motif frequencies inference in W-graph models , 2013, Statistics and Computing.

[21]  Edoardo M. Airoldi,et al.  Nonparametric estimation and testing of exchangeable graph models , 2014, AISTATS.

[22]  Bal'azs Szegedy,et al.  The automorphism group of a graphon , 2014 .

[23]  Beom Jun Kim,et al.  Attack vulnerability of complex networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Prakash Balachandran,et al.  Inference of Network Summary Statistics Through Network Denoising , 2013, 1310.0423.

[25]  D. Aldous Representations for partially exchangeable arrays of random variables , 1981 .

[26]  Martin G. Everett,et al.  A Graph-theoretic perspective on centrality , 2006, Soc. Networks.

[27]  S. Boucheron,et al.  Concentration inequalities for order statistics , 2012, 1207.7209.

[28]  Balázs Szegedy Limits of kernel operators and the spectral regularity lemma , 2011, Eur. J. Comb..

[29]  E. Levina,et al.  Estimating network edge probabilities by neighborhood smoothing , 2015, 1509.08588.

[30]  Walter Dempsey,et al.  Edge exchangeable models for network data , 2016, ArXiv.

[31]  Michele Benzi,et al.  On the Limiting Behavior of Parameter-Dependent Network Centrality Measures , 2013, SIAM J. Matrix Anal. Appl..

[32]  Daniel M. Roy,et al.  Bayesian Models of Graphs, Arrays and Other Exchangeable Random Structures , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Thomas W. Valente,et al.  The stability of centrality measures when networks are sampled , 2003, Soc. Networks.

[34]  Jari Saramäki,et al.  Path lengths, correlations, and centrality in temporal networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Krishna Dasaratha,et al.  Distributions of Centrality on Networks , 2017, Games Econ. Behav..

[36]  L. Baggett,et al.  Functional Analysis: A Primer , 1991 .

[37]  M. Bálek,et al.  Large Networks and Graph Limits , 2022 .

[38]  C. Borgs,et al.  Moments of Two-Variable Functions and the Uniqueness of Graph Limits , 2008, 0803.1244.

[39]  D. Shah,et al.  Unifying Framework for Crowd-sourcing via Graphon Estimation , 2017 .

[40]  Daniel M. Roy,et al.  The Class of Random Graphs Arising from Exchangeable Random Measures , 2015, ArXiv.

[41]  László Lovász,et al.  Large Networks and Graph Limits , 2012, Colloquium Publications.

[42]  P. Bickel,et al.  A nonparametric view of network models and Newman–Girvan and other modularities , 2009, Proceedings of the National Academy of Sciences.

[43]  A. Tsybakov,et al.  Oracle inequalities for network models and sparse graphon estimation , 2015, 1507.04118.

[44]  Johan Bollen,et al.  Co-authorship networks in the digital library research community , 2005, Inf. Process. Manag..

[45]  Davide Brunelli,et al.  Wireless Sensor Networks , 2012, Lecture Notes in Computer Science.

[46]  X ZhengAlice,et al.  A Survey of Statistical Network Models , 2010 .

[47]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[48]  Ian F. Akyildiz,et al.  Wireless sensor networks , 2007 .

[49]  S. Janson Graphons, cut norm and distance, couplings and rearrangements , 2010, 1009.2376.

[50]  Harrison H. Zhou,et al.  Rate-optimal graphon estimation , 2014, 1410.5837.

[51]  Edoardo M. Airoldi,et al.  Stochastic blockmodel approximation of a graphon: Theory and consistent estimation , 2013, NIPS.

[52]  Devavrat Shah,et al.  Reducing Crowdsourcing to Graphon Estimation, Statistically , 2017, AISTATS.

[53]  F. Browder Nonlinear functional analysis , 1970 .

[54]  Christian Borgs,et al.  Graphons: A Nonparametric Method to Model, Estimate, and Design Algorithms for Massive Networks , 2017, EC.

[55]  Weichen Wang,et al.  An $\ell_{\infty}$ Eigenvector Perturbation Bound and Its Application , 2017, J. Mach. Learn. Res..

[56]  David F. Gleich,et al.  PageRank beyond the Web , 2014, SIAM Rev..

[57]  László Lovász,et al.  Limits of dense graph sequences , 2004, J. Comb. Theory B.

[58]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[59]  Shuang Gao,et al.  The control of arbitrary size networks of linear systems via graphon limits: An initial investigation , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[60]  Hyunjoong Kim,et al.  Functional Analysis I , 2017 .

[61]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[62]  Friedrich Sauvigny Partial Differential Equations 2: Functional Analytic Methods , 2006 .

[63]  Geert Leus,et al.  Signal processing on kernel-based random graphs , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[64]  Peter Grindrod,et al.  A Matrix Iteration for Dynamic Network Summaries , 2013, SIAM Rev..

[65]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[66]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[67]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[68]  P. Bickel,et al.  The method of moments and degree distributions for network models , 2011, 1202.5101.

[69]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[70]  Fan Chung Graham,et al.  On the Spectra of General Random Graphs , 2011, Electron. J. Comb..

[71]  Kathleen M. Carley,et al.  On the robustness of centrality measures under conditions of imperfect data , 2006, Soc. Networks.

[72]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[73]  Chao Gao,et al.  Optimal Estimation and Completion of Matrices with Biclustering Structures , 2016, J. Mach. Learn. Res..

[74]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[75]  Marianna Pensky,et al.  Dynamic network models and graphon estimation , 2016, The Annals of Statistics.

[76]  Kristina Lerman,et al.  Centrality metric for dynamic networks , 2010, MLG '10.

[77]  Trevor Campbell,et al.  Edge-exchangeable graphs and sparsity , 2016, NIPS.

[78]  Christian Borgs,et al.  Private Graphon Estimation for Sparse Graphs , 2015, NIPS.