outbreaker2: a modular platform for outbreak reconstruction

BackgroundReconstructing individual transmission events in an infectious disease outbreak can provide valuable information and help inform infection control policy. Recent years have seen considerable progress in the development of methodologies for reconstructing transmission chains using both epidemiological and genetic data. However, only a few of these methods have been implemented in software packages, and with little consideration for customisability and interoperability. Users are therefore limited to a small number of alternatives, incompatible tools with fixed functionality, or forced to develop their own algorithms at considerable personal effort.ResultsHere we present outbreaker2, a flexible framework for outbreak reconstruction. This R package re-implements and extends the original model introduced with outbreaker, but most importantly also provides a modular platform allowing users to specify custom models within an optimised inferential framework. As a proof of concept, we implement the within-host evolutionary model introduced with TransPhylo, which is very distinct from the original genetic model in outbreaker, and demonstrate how even complex model results can be successfully included with minimal effort.Conclusionsoutbreaker2 provides a valuable starting point for future outbreak reconstruction tools, and represents a unifying platform that promotes customisability and interoperability. Implemented in the R software, outbreaker2 joins a growing body of tools for outbreak analysis.

[1]  P. E. Kopp,et al.  Superspreading and the effect of individual variation on disease emergence , 2005, Nature.

[2]  Mirjam Kretzschmar,et al.  Infectious disease transmission as a forensic problem: who infected whom? , 2013, Journal of The Royal Society Interface.

[3]  Marc Lipsitch,et al.  Shared Genomic Variants: Identification of Transmission Routes Using Pathogen Deep-Sequence Data , 2017, American journal of epidemiology.

[4]  Christl A. Donnelly,et al.  Transmission intensity and impact of control policies on the foot and mouth epidemic in Great Britain , 2001, Nature.

[5]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[6]  Jukka Corander,et al.  Two-phase importance sampling for inference about transmission trees , 2014, Proceedings of the Royal Society B: Biological Sciences.

[7]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[9]  Tanja Stadler,et al.  Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[10]  Simon Cauchemez,et al.  Chains of transmission and control of Ebola virus disease in Conakry, Guinea, in 2014: an observational study. , 2015, The Lancet. Infectious diseases.

[11]  Samuel Soubeyrand,et al.  A Bayesian approach for inferring the dynamics of partially observed endemic infectious diseases from space-time-genetic data , 2014, Proceedings of the Royal Society B: Biological Sciences.

[12]  Samuel Soubeyrand,et al.  A Bayesian Inference Framework to Reconstruct Transmission Trees Using Epidemiological and Genetic Data , 2012, PLoS Comput. Biol..

[13]  W. Team,et al.  West African Ebola Epidemic after One Year — Slowing but Not Yet under Control , 2015 .

[14]  S. Günther,et al.  Complete Genome Sequences of Three Ebola Virus Isolates from the 2014 Outbreak in West Africa , 2014, Genome Announcements.

[15]  Nicola De Maio,et al.  SCOTTI: Efficient Reconstruction of Transmission within Outbreaks with the Structured Coalescent , 2016, PLoS Comput. Biol..

[16]  Xavier Didelot,et al.  Genomic Infectious Disease Epidemiology in Partially Sampled and Ongoing Outbreaks , 2016, bioRxiv.

[17]  Xavier Didelot,et al.  Bayesian Inference of Infectious Disease Transmission from Whole-Genome Sequence Data , 2014, Molecular biology and evolution.

[18]  Katharina T. Huber,et al.  ape 3.0: New tools for distance-based phylogenetics and evolutionary analysis in R , 2012, Bioinform..

[19]  Theodore Kypraios,et al.  Reconstructing transmission trees for communicable diseases using densely sampled genetic data. , 2014, The annals of applied statistics.

[20]  T Jombart,et al.  Reconstructing disease outbreaks from genetic data: a graph approach , 2010, Heredity.

[21]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[22]  Stephan Günther,et al.  Emergence of Zaire Ebola virus disease in Guinea. , 2014, The New England journal of medicine.

[23]  Tom Britton,et al.  Molecular Infectious Disease Epidemiology: Survival Analysis and Algorithms Linking Phylogenies to Transmission Trees , 2015, PLoS Comput. Biol..

[24]  Xavier Didelot,et al.  Simultaneous inference of phylogenetic and transmission trees in infectious disease outbreaks , 2017, PLoS Comput. Biol..

[25]  Thibaut Jombart,et al.  outbreaker2: Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic Data , 2018 .

[26]  Rachel S. G. Sealfon,et al.  Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak , 2014, Science.

[27]  Samuel Soubeyrand,et al.  Construction of semi-Markov genetic-space-time SEIR models and inference , 2016 .

[28]  Matthew Hall,et al.  Epidemic Reconstruction in a Phylogenetics Framework: Transmission Trees as Partitions of the Node Set , 2014, PLoS Comput. Biol..

[29]  W. Team Ebola Virus Disease in West Africa — The First 9 Months of the Epidemic and Forward Projections , 2014 .

[30]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[31]  Gaël Thébaud,et al.  Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus , 2008, Proceedings of the Royal Society B: Biological Sciences.

[32]  Gavin J. Gibson,et al.  A Systematic Bayesian Integration of Epidemiological and Genetic Data , 2015, PLoS Comput. Biol..

[33]  Dirk Eddelbuettel,et al.  Rcpp: Seamless R and C++ Integration , 2011 .

[34]  Di Liu,et al.  Genetic diversity and evolutionary dynamics of Ebola virus in Sierra Leone , 2015, Nature.

[35]  J. Wallinga,et al.  Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures , 2004, American journal of epidemiology.

[36]  M. Aldrin,et al.  Modelling the spread of infectious salmon anaemia among salmon farms based on seaway distances between farms and genetic relationships between infectious salmon anaemia virus isolates , 2011, Journal of The Royal Society Interface.

[37]  C. Althaus,et al.  Ebola superspreading , 2015, The Lancet Infectious Diseases.

[38]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[39]  Mikiko Senga,et al.  Ebola virus disease in West Africa--the first 9 months of the epidemic and forward projections. , 2014, The New England journal of medicine.