Using multiple infection history to infer contact network structure

The structure of host interactions within a population shapes the spread of infectious diseases but contact patterns between hosts are difficult to access. We hypothesised that key properties of these patterns can be inferred by using multiple infections data from individual longitudinal follow-ups. To show this, we simulated multiple infections on a contact network in an unbiased way by implementing a non-Markovian extension of the Gillespie algorithm for a community of parasites spreading on this network. We then analysed the resulting individual infection time series in an original way by introducing the concept of ‘infection barcodes’ to represent the infection history in each host. We find that, depending on infection multiplicity and immunity assumptions, knowledge about the barcode topology makes it possible to recover key properties of the network topology and even of individual nodes. The combination of individual-based simulations and barcode analysis of infection histories opens promising perspectives for the study of infectious disease transmission networks. Significance Statement The way hosts interact with each other is known to shape epidemics spread. However, these interactions are difficult to infer, especially in human populations. Using recent developments in stochastic epidemiological modeling and barcode theory, we show that the diversity of infections each host has undergone over time contains key information about contact network between hosts. This means that longitudinal follow-ups of some individuals in a population can tell us how hosts are in contact with each other. It can also inform us on how connected a particular individual is. This opens new possibilities regarding the use of genetic diversity of infectious diseases in epidemiology.

[1]  S. Wacholder,et al.  Human papillomavirus infection with multiple types: pattern of coinfection and risk of cervical disease. , 2011, The Journal of infectious diseases.

[2]  Frédéric Chazal,et al.  An Introduction to Topological Data Analysis: Fundamental and Practical Aspects for Data Scientists , 2017, Frontiers in Artificial Intelligence.

[3]  C. Scoglio,et al.  Competitive epidemic spreading over arbitrary multilayer networks. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Luis Mauricio Bini,et al.  Mantel test in population genetics , 2013, Genetics and molecular biology.

[5]  L. Braitman,et al.  Applied Longitudinal Data Analysis for Epidemiology: A Practical Guide , 2004, Annals of Internal Medicine.

[6]  A. Dobson,et al.  Patterns of macroparasite abundance and aggregation in wildlife populations: a quantitative review , 1995, Parasitology.

[7]  A L Lloyd,et al.  Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics. , 2001, Theoretical population biology.

[8]  Joel C. Miller,et al.  Percolation and epidemics in random clustered networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  M. Sofonea,et al.  Exposing the diversity of multiple infection patterns. , 2017, Journal of theoretical biology.

[10]  Dylan B. George,et al.  Using network properties to predict disease dynamics on human contact networks , 2011, Proceedings of the Royal Society B: Biological Sciences.

[11]  Andrew Read,et al.  Exposing malaria in-host diversity and estimating population diversity by capture-recapture using massively parallel pyrosequencing , 2010, Proceedings of the National Academy of Sciences.

[12]  Frank Diederich,et al.  Mathematical Epidemiology Of Infectious Diseases Model Building Analysis And Interpretation , 2016 .

[13]  Jos W. R. Twisk,et al.  Applied Longitudinal Data Analysis for Epidemiology: References , 2013 .

[14]  T. Day,et al.  Risk factors for the evolutionary emergence of pathogens , 2010, Journal of The Royal Society Interface.

[15]  Shweta Bansal,et al.  Eight challenges for network epidemic models. , 2015, Epidemics.

[16]  I. Kiss,et al.  Disease contact tracing in random and clustered networks , 2005, Proceedings of the Royal Society B: Biological Sciences.

[17]  Carmen Lía Murall,et al.  Detecting within-host interactions from genotype combination prevalence data , 2018, bioRxiv.

[18]  Raúl Toral,et al.  Simulating non-Markovian stochastic processes. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Katia Koelle,et al.  The effects of host contact network structure on pathogen diversity and strain structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Sarah C. Goslee,et al.  The ecodist Package for Dissimilarity-based Analysis of Ecological Data , 2007 .

[21]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[22]  K Dietz,et al.  Distribution of survival times of deliberate Plasmodium falciparum infections in tertiary syphilis patients. , 2006, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[23]  Huldrych F. Günthard,et al.  Inferring Epidemic Contact Structure from Phylogenetic Trees , 2012, PLoS Comput. Biol..

[24]  Roy M. Anderson,et al.  Transmission dynamics of HIV infection , 1987, Nature.

[25]  Hui Jiang,et al.  Comparative epidemiology of human infections with avian influenza A H7N9 and H5N1 viruses in China: a population-based study of laboratory-confirmed cases , 2013, The Lancet.

[26]  Herbert Edelsbrunner,et al.  Computational Topology - an Introduction , 2009 .

[27]  Leonidas J. Guibas,et al.  Persistence barcodes for shapes , 2004, SGP '04.

[28]  Gaël Varoquaux,et al.  Proceedings of the 20th Python in Science Conference 2021 (SciPy 2021), Virtual Conference, July 12 - July 18, 2021 , 2008, SciPy.

[29]  N. Maire,et al.  The distribution of Plasmodium falciparum infection durations. , 2011, Epidemics.

[30]  M E J Newman,et al.  Random graphs with clustering. , 2009, Physical review letters.

[31]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[32]  Tanja S. Maier,et al.  Choosing and using diversity indices: insights for ecological applications from the German Biodiversity Exploratories , 2014, Ecology and evolution.

[33]  Jean-Marie Cornuet,et al.  ABC model choice via random forests , 2014, 1406.6288.

[34]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Alessandro Vespignani,et al.  Characterising two-pathogen competition in spatially structured environments , 2014, Scientific Reports.

[36]  Shashank Khandelwal,et al.  Exploring biological network structure with clustered random networks , 2009, BMC Bioinformatics.

[37]  P. Castle,et al.  A joint model of persistent human papilloma virus infection and cervical cancer risk: implications for cervical cancer screening , 2015, Journal of the Royal Statistical Society. Series A,.

[38]  Jacques Ravel,et al.  Interplay between the temporal dynamics of the vaginal microbiota and human papillomavirus detection. , 2014, The Journal of infectious diseases.

[39]  N. Mideo Parasite adaptations to within-host competition. , 2009, Trends in parasitology.

[40]  Martin A. Nowak,et al.  Evolution and emergence of infectious diseases in theoretical and real-world networks , 2015, Nature Communications.

[41]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[42]  Huldrych F. Günthard,et al.  Phylodynamics on local sexual contact networks , 2016, bioRxiv.

[43]  Pierre-Yves Boëlle,et al.  Host contact dynamics shapes richness and dominance of pathogen strains , 2018, bioRxiv.

[44]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[45]  Cornelia Metzig,et al.  Phylogenies from dynamic networks , 2019, PLoS Comput. Biol..

[46]  P. Kaye Infectious diseases of humans: Dynamics and control , 1993 .

[47]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[48]  Gavin Simpson,et al.  Analogue Methods in Palaeoecology: Using the analogue Package , 2007 .

[49]  Alessandro Vespignani,et al.  Host Mobility Drives Pathogen Competition in Spatially Structured Populations , 2013, PLoS Comput. Biol..

[50]  K. Gough The estimation of latent and infectious periods , 1977 .

[51]  J. Wallinga,et al.  Capturing multiple-type interactions into practical predictors of type replacement following human papillomavirus vaccination , 2019, Philosophical Transactions of the Royal Society B.

[52]  M. Keeling,et al.  Modeling Infectious Diseases in Humans and Animals , 2007 .

[53]  M. Keeling,et al.  The effects of local spatial structure on epidemiological invasions , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[54]  Lara Savini,et al.  Predicting Epidemic Risk from Past Temporal Contact Data , 2014, PLoS Comput. Biol..

[55]  M. Keeling,et al.  Networks and epidemic models , 2005, Journal of The Royal Society Interface.

[56]  Alessandro Vespignani,et al.  The role of the airline transportation network in the prediction and predictability of global epidemics , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Norman T. J. Bailey ON ESTIMATING THE LATENT AND INFECTIOUS PERIODS OF MEASLES , 1956 .

[58]  Joel C Miller,et al.  Cocirculation of infectious diseases on networks. , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[59]  B. T. Grenfell,et al.  Disease Extinction and Community Size: Modeling the Persistence of Measles , 1997, Science.

[60]  B. Althouse,et al.  Complex dynamics of synergistic coinfections on realistically clustered networks , 2015, Proceedings of the National Academy of Sciences.

[61]  Carmen Lía Murall,et al.  Detecting within-host interactions from genotype combination prevalence data , 2019, Epidemics.

[62]  Martin Eichner,et al.  Transmission potential of smallpox: estimates based on detailed data from an outbreak. , 2003, American journal of epidemiology.

[63]  K Eames,et al.  Six challenges in measuring contact networks for use in modelling. , 2015, Epidemics.

[64]  T. Day,et al.  Time-varying and state-dependent recovery rates in epidemiological models , 2017, Infectious Disease Modelling.