OutbreakTools: A new platform for disease outbreak analysis using the R software

The investigation of infectious disease outbreaks relies on the analysis of increasingly complex and diverse data, which offer new prospects for gaining insights into disease transmission processes and informing public health policies. However, the potential of such data can only be harnessed using a number of different, complementary approaches and tools, and a unified platform for the analysis of disease outbreaks is still lacking. In this paper, we present the new R package OutbreakTools, which aims to provide a basis for outbreak data management and analysis in R. OutbreakTools is developed by a community of epidemiologists, statisticians, modellers and bioinformaticians, and implements classes and methods for storing, handling and visualizing outbreak data. It includes real and simulated outbreak datasets. Together with a number of tools for infectious disease epidemiology recently made available in R, OutbreakTools contributes to the emergence of a new, free and open-source platform for the analysis of disease outbreaks.

[1]  Emmanuel Paradis,et al.  pegas: an R package for population genetics with an integrated-modular approach , 2010, Bioinform..

[2]  Thibaut Jombart,et al.  adegenet: a R package for the multivariate analysis of genetic markers , 2008, Bioinform..

[3]  P. Cowpertwait,et al.  Introductory Time Series with R , 2009 .

[4]  J. Wallinga,et al.  Robust reconstruction and analysis of outbreak data: influenza A(H1N1)v transmission in a school-based population. , 2012, American journal of epidemiology.

[5]  Jeffrey W. Eaton,et al.  HIV Treatment as Prevention: Systematic Comparison of Mathematical Models of the Potential Impact of Antiretroviral Therapy on HIV Incidence in South Africa , 2012, PLoS medicine.

[6]  Carter T. Butts,et al.  network: A Package for Managing Relational Data in R , 2008 .

[7]  N. Ferguson,et al.  Transmission Parameters of the 2001 Foot and Mouth Epidemic in Great Britain , 2007, PloS one.

[8]  N. Ferguson,et al.  Role of social networks in shaping disease transmission during a community outbreak of 2009 H1N1 pandemic influenza , 2011, Proceedings of the National Academy of Sciences.

[9]  Thibaut Jombart,et al.  outbreaker2: Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic Data , 2018 .

[10]  J. Faraway Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2005 .

[11]  R. Pebody,et al.  Pandemic (H1N1) 2009 virus outbreak in a school in London, April–May 2009: an observational study , 2009, Epidemiology and Infection.

[12]  T Jombart,et al.  Reconstructing disease outbreaks from genetic data: a graph approach , 2010, Heredity.

[13]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[14]  Samuel Soubeyrand,et al.  A Bayesian Inference Framework to Reconstruct Transmission Trees Using Epidemiological and Genetic Data , 2012, PLoS Comput. Biol..

[15]  Julian Parkhill,et al.  Whole-genome sequencing for analysis of an outbreak of meticillin-resistant Staphylococcus aureus: a descriptive study , 2013, The Lancet. Infectious Diseases.

[16]  Pierre-Yves Boëlle,et al.  The R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks , 2012, BMC Medical Informatics and Decision Making.

[17]  Edzer Pebesma,et al.  Applied Spatial Data Analysis with R. Springer , 2008 .

[18]  R Core Team,et al.  Read Data Stored by Minitab, S, SAS, SPSS, Stata, Systat, Weka,dBase, ... , 2015 .

[19]  E. Holmes,et al.  Transmission of Equine Influenza Virus during an Outbreak Is Characterized by Frequent Mixed Infections and Loose Transmission Bottlenecks , 2012, PLoS pathogens.

[20]  Tao Zhang,et al.  Mutational dynamics of the SARS coronavirus in cell culture and human populations isolated in 2003 , 2004, BMC infectious diseases.

[21]  Thibaut Jombart,et al.  adegenet 1.3-1: new tools for the analysis of genome-wide SNP data , 2011, Bioinform..

[22]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[23]  J. Wallinga,et al.  Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures , 2004, American journal of epidemiology.

[24]  Edzer J. Pebesma,et al.  Applied Spatial Data Analysis with R - Second Edition , 2008, Use R!.

[25]  Tanja Stadler,et al.  Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[26]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[27]  Mirjam Kretzschmar,et al.  Infectious disease transmission as a forensic problem: who infected whom? , 2013, Journal of The Royal Society Interface.

[28]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[29]  Anne-Béatrice Dufour,et al.  The ade4 Package: Implementing the Duality Diagram for Ecologists , 2007 .

[30]  James Truscott,et al.  Control of a highly pathogenic H5N1 avian influenza outbreak in the GB poultry flock , 2007, Proceedings of the Royal Society B: Biological Sciences.

[31]  Michael Höhle,et al.  surveillance: An R package for the monitoring of infectious diseases , 2007, Comput. Stat..

[32]  C. Fraser,et al.  A New Framework and Software to Estimate Time-Varying Reproduction Numbers During Epidemics , 2013, American journal of epidemiology.

[33]  Samuel Soubeyrand,et al.  A Bayesian approach for inferring the dynamics of partially observed endemic infectious diseases from space-time-genetic data , 2014, Proceedings of the Royal Society B: Biological Sciences.

[34]  Niel Hens,et al.  Practice of Epidemiology Robust Reconstruction and Analysis of Outbreak Data : Influenza A ( H 1 N 1 ) v Transmission in a School-based Population , 2012 .

[35]  Chris Beeley,et al.  Web Application Development with R Using Shiny , 2013 .

[36]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .