nosoi: A stochastic agent‐based transmission chain simulation framework in r

The transmission process of an infectious agent creates a connected chain of hosts linked by transmission events, known as a transmission chain. Reconstructing transmission chains remains a challenging endeavor, except in rare cases characterized by intense surveillance and epidemiological inquiry. Inference frameworks attempt to estimate or approximate these transmission chains but the accuracy and validity of such methods generally lack formal assessment on datasets for which the actual transmission chain was observed. We here introduce nosoi, an open-source R package that offers a complete, tunable, and expandable agent-based framework to simulate transmission chains under a wide range of epidemiological scenarios for single-host and dual-host epidemics. nosoi is accessible through GitHub and CRAN, and is accompanied by extensive documentation, providing help and practical examples to assist users in setting up their own simulations. Once infected, each host or agent can undergo a series of events during each time step, such as moving (between locations) or transmitting the infection, all of these being driven by user-specified rules or data, such as travel patterns between locations. nosoi is able to generate a multitude of epidemic scenarios, that can – for example – be used to validate a wide range of reconstruction methods, including epidemic modeling and phylodynamic analyses. nosoi also offers a comprehensive framework to leverage empirically acquired data, allowing the user to explore how variations in parameters can affect epidemic potential. Aside from research questions, nosoi can provide lecturers with a complete teaching tool to offer students a handson exploration of the dynamics of epidemiological processes and the factors that impact it. Because the package does not rely on mathematical formalism but uses a more intuitive algorithmic approach, even extensive changes of the entire model can be easily and quickly implemented.

[1]  Tanja Stadler,et al.  Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[2]  Pedro M. Valero-Mora,et al.  ggplot2: Elegant Graphics for Data Analysis , 2010 .

[3]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[4]  Dong Xie,et al.  BEAST 2: A Software Platform for Bayesian Evolutionary Analysis , 2014, PLoS Comput. Biol..

[5]  Niema Moshiri,et al.  FAVITES: simultaneous simulation of transmission networks, phylogenetic trees and sequences , 2019, Bioinform..

[6]  Sebastián Duchêne,et al.  BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis , 2018, bioRxiv.

[7]  Xavier Didelot,et al.  Bayesian Inference of Infectious Disease Transmission from Whole-Genome Sequence Data , 2014, Molecular biology and evolution.

[8]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[9]  David K. Smith,et al.  ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data , 2017 .

[10]  Yi Guan,et al.  treeio: an R package for phylogenetic tree input and output with richly annotated and associated data. , 2019, Molecular biology and evolution.

[11]  Nicola De Maio,et al.  Bayesian reconstruction of transmission within outbreaks using genomic variants , 2017, bioRxiv.

[12]  Alexei J. Drummond,et al.  Bayesian Phylogeography Finds Its Roots , 2009, PLoS Comput. Biol..

[13]  Nicola De Maio,et al.  New Routes to Phylogeography: A Bayesian Structured Coalescent Approximation , 2015, PLoS genetics.

[14]  Thibaut Jombart,et al.  Bayesian inference of transmission chains using timing of symptoms, pathogen genomes and contact data , 2019, PLoS Comput. Biol..

[15]  Frederick Albert Matsen IV,et al.  A Format for Phylogenetic Placements , 2012, PloS one.

[16]  A. Nyamathi,et al.  A Current Review of Ebola Virus: Pathogenesis, Clinical Presentation, and Diagnostic Assessment , 2003, Biological research for nursing.

[17]  Daniel L. Ayres,et al.  Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10 , 2018, Virus evolution.

[18]  Christl A. Donnelly,et al.  A review of epidemiological parameters from Ebola outbreaks to inform early public health decision-making , 2015, Scientific Data.

[19]  Colin J. Worby,et al.  'SEEDY' (Simulation of Evolutionary and Epidemiological Dynamics): An R Package to Follow Accumulation of Within-Host Mutation in Pathogens , 2015, PloS one.

[20]  Moritz U G Kraemer,et al.  Emergence of the Asian lineage of Zika virus in Angola: an outbreak investigation , 2019, The Lancet. Infectious diseases.

[21]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[22]  Theodore Kypraios,et al.  Reconstructing transmission trees for communicable diseases using densely sampled genetic data. , 2014, The annals of applied statistics.

[23]  M. Suchard,et al.  Phylogeography takes a relaxed random walk in continuous space and time. , 2010, Molecular biology and evolution.

[24]  Guy Baele,et al.  πBUSS: a parallel BEAST/BEAGLE utility for sequence simulation under complex evolutionary scenarios , 2013, BMC Bioinformatics.

[25]  Kristof Theys,et al.  SANTA-SIM: simulating viral sequence evolution dynamics under selection and recombination , 2018, bioRxiv.

[26]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[27]  Tanja Stadler,et al.  The Structured Coalescent and Its Approximations , 2016, bioRxiv.

[28]  Emily H. Chan,et al.  Global capacity for emerging infectious disease detection , 2010, Proceedings of the National Academy of Sciences.

[29]  Sebastian Lequime,et al.  Epidemiological significance of dengue virus genetic variation in mosquito infection dynamics , 2018, PLoS pathogens.

[30]  Ethan Romero-Severson,et al.  Timing and order of transmission events is not directly reflected in a pathogen phylogeny. , 2014, Molecular biology and evolution.

[31]  Olivier Gascuel,et al.  A Fast Likelihood Method to Reconstruct and Visualize Ancestral Scenarios , 2018, bioRxiv.

[32]  Trevor Bedford,et al.  MERS-CoV spillover at the camel-human interface , 2017, bioRxiv.

[33]  Rebecca Rose,et al.  SERAPHIM: studying environmental rasters and phylogenetically informed movements , 2016, Bioinform..

[34]  Thibaut Jombart,et al.  outbreaker2: a modular platform for outbreak reconstruction , 2018, BMC Bioinformatics.

[35]  Pieter Libin,et al.  SANTA-SIM: Simulating Viral Sequence Evolution Dynamics Under Selection and Recombination , 2018 .

[36]  Yi Guan,et al.  Two Methods for Mapping and Visualizing Associated Data on Phylogeny Using Ggtree. , 2018, Molecular biology and evolution.

[37]  Guy Baele,et al.  Emerging Concepts of Data Integration in Pathogen Phylodynamics , 2016, Systematic biology.

[38]  Guy Baele,et al.  Travel Surveillance and Genomics Uncover a Hidden Zika Outbreak during the Waning Epidemic , 2019, Cell.

[39]  Samuel Soubeyrand,et al.  A Bayesian approach for inferring the dynamics of partially observed endemic infectious diseases from space-time-genetic data , 2014, Proceedings of the Royal Society B: Biological Sciences.

[40]  Alison P Galvani,et al.  Characterizing risk of Ebola transmission based on frequency and type of case–contact exposures , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[41]  Guy Baele,et al.  Phylodynamic assessment of intervention strategies for the West African Ebola virus outbreak , 2018, Nature Communications.

[42]  Xavier Didelot,et al.  Genomic Infectious Disease Epidemiology in Partially Sampled and Ongoing Outbreaks , 2016, bioRxiv.