Beyond ranking nodes: Predicting epidemic outbreak sizes by network centralities

Identifying important nodes for disease spreading is a central topic in network epidemiology. We investigate how well the position of a node, characterized by standard network measures, can predict its epidemiological importance in any graph of a given number of nodes. This is in contrast to other studies that deal with the easier prediction problem of ranking nodes by their epidemic importance in given graphs. As a benchmark for epidemic importance, we calculate the exact expected outbreak size given a node as the source. We study exhaustively all graphs of a given size, so do not restrict ourselves to certain generative models for graphs, nor to graph data sets. Due to the large number of possible nonisomorphic graphs of a fixed size, we are limited to ten-node graphs. We find that combinations of two or more centralities are predictive (R2 scores of 0.91 or higher) even for the most difficult parameter values of the epidemic simulation. Typically, these successful combinations include one normalized spectral centrality (such as PageRank or Katz centrality) and one measure that is sensitive to the number of edges in the graph.

[1]  Joel C. Miller,et al.  Effective vaccination strategies for realistic social networks , 2007 .

[2]  Dawei Zhao,et al.  Statistical physics of vaccination , 2016, ArXiv.

[3]  Piet Van Mieghem,et al.  Epidemic processes in complex networks , 2014, ArXiv.

[4]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[5]  Mile Šikić,et al.  Epidemic centrality — is there an underestimated epidemic impact of network peripheral nodes? , 2011, 1110.2558.

[6]  Dylan B. George,et al.  Using network properties to predict disease dynamics on human contact networks , 2011, Proceedings of the Royal Society B: Biological Sciences.

[7]  Kathleen M. Carley,et al.  On the robustness of centrality measures under conditions of imperfect data , 2006, Soc. Networks.

[8]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[9]  Stefan Richter,et al.  Centrality Indices , 2004, Network Analysis.

[10]  Jari Saramäki,et al.  Ranking influential spreaders is an ill-defined problem , 2017, EPL (Europhysics Letters).

[11]  Petter Holme,et al.  Three faces of node importance in network epidemiology: Exact results for small graphs , 2017, Physical review. E.

[12]  Petter Holme,et al.  Efficient local strategies for vaccination and network attack , 2004, q-bio/0403021.

[13]  A. Barrat,et al.  Dynamical Patterns of Cattle Trade Movements , 2011, PloS one.

[14]  Brendan D. McKay,et al.  Practical graph isomorphism, II , 2013, J. Symb. Comput..

[15]  C. Viboud,et al.  Mathematical models to characterize early epidemic growth: A review. , 2016, Physics of life reviews.

[16]  Kevin G. Stanley,et al.  Design and methods of a social network isolation study for reducing respiratory infection transmission: The eX-FLU cluster randomized trial , 2016, Epidemics.

[17]  Ellen Brooks-Pollock,et al.  Epidemic predictions in an imperfect world: modelling disease spread with partial data , 2015, Proceedings of the Royal Society B: Biological Sciences.

[18]  Caterina De Bacco,et al.  Sampling on networks: estimating eigenvector centrality on incomplete graphs , 2019, ArXiv.

[19]  Caterina De Bacco,et al.  Sampling on Networks: Estimating Eigenvector Centrality on Incomplete Networks , 2019, COMPLEX NETWORKS.

[20]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[21]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[22]  Luciano da Fontoura Costa,et al.  The role of centrality for the identification of influential spreaders in complex networks , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  J. Giesecke,et al.  Modern Infectious Disease Epidemiology , 1994 .

[24]  A. J. Hall Infectious diseases of humans: R. M. Anderson & R. M. May. Oxford etc.: Oxford University Press, 1991. viii + 757 pp. Price £50. ISBN 0-19-854599-1 , 1992 .

[25]  N. Christakis,et al.  Social Network Sensors for Early Detection of Contagious Outbreaks , 2010, PloS one.

[26]  Herbert W. Hethcote,et al.  The Mathematics of Infectious Diseases , 2000, SIAM Rev..

[27]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[28]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[29]  Alessandro Vespignani,et al.  The role of the airline transportation network in the prediction and predictability of global epidemics , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Giovanni Petri,et al.  On the predictability of infectious disease outbreaks , 2017, Nature Communications.

[31]  M. Ožana Incipient spanning cluster on small-world networks , 2001 .

[32]  Joel C. Miller,et al.  Mathematics of Epidemics on Networks: From Exact to Approximate Models , 2017 .

[33]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[34]  D. J. Rogers,et al.  Global Transport Networks and Infectious Disease Spread , 2006, Advances in Parasitology.

[35]  Alain Barrat,et al.  Can co-location be used as a proxy for face-to-face contacts? , 2017, EPJ Data Science.

[36]  Claudio Castellano,et al.  Fundamental difference between superblockers and superspreaders in networks , 2016, Physical review. E.

[37]  Reka Albert,et al.  Mean-field theory for scale-free random networks , 1999 .

[38]  Lauren Ancel Meyers,et al.  Network-based vaccination improves prospects for disease control in wild chimpanzees , 2014, Journal of The Royal Society Interface.

[39]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[40]  Bruce D. Spencer,et al.  Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks , 2013, 1305.4977.

[41]  Dylan B. George,et al.  Big Data Opportunities for Global Infectious Disease Surveillance , 2013, PLoS medicine.

[42]  M. Salathé,et al.  A low-cost method to assess the epidemiological importance of individuals in controlling infectious disease outbreaks , 2013, BMC Medicine.

[43]  Duanbing Chen,et al.  Vital nodes identification in complex networks , 2016, ArXiv.

[44]  Geoffrey L. Chupp,et al.  Pathways Activated during Human Asthma Exacerbation as Revealed by Gene Expression Patterns in Blood , 2011, PloS one.

[45]  Joseph T. Lizier,et al.  Identifying influential spreaders and efficiently estimating infection numbers in epidemic models: A walk counting approach , 2012, 1203.0502.

[46]  Nicole Immorlica,et al.  Uncharted but not Uninfluenced: Influence Maximization with an Uncertain Network , 2017, AAMAS.