Reconstructing transmission trees for communicable diseases using densely sampled genetic data.

Whole genome sequencing of pathogens from multiple hosts in an epidemic offers the potential to investigate who infected whom with unparalleled resolution, potentially yielding important insights into disease dynamics and the impact of control measures. We considered disease outbreaks in a setting with dense genomic sampling, and formulated stochastic epidemic models to investigate person-to-person transmission, based on observed genomic and epidemiological data. We constructed models in which the genetic distance between sampled genotypes depends on the epidemiological relationship between the hosts. A data augmented Markov chain Monte Carlo algorithm was used to sample over the transmission trees, providing a posterior probability for any given transmission route. We investigated the predictive performance of our methodology using simulated data, demonstrating high sensitivity and specificity, particularly for rapidly mutating pathogens with low transmissibility. We then analyzed data collected during an outbreak of methicillin-resistant Staphylococcus aureus in a hospital, identifying probable transmission routes and estimating epidemiological parameters. Our approach overcomes limitations of previous methods, providing a framework with the flexibility to allow for unobserved infection times, multiple independent introductions of the pathogen, and within-host genetic diversity, as well as allowing forward simulation.

[1]  Samuel Soubeyrand,et al.  A Bayesian Inference Framework to Reconstruct Transmission Trees Using Epidemiological and Genetic Data , 2012, PLoS Comput. Biol..

[2]  Julian Parkhill,et al.  Whole-genome sequencing for analysis of an outbreak of meticillin-resistant Staphylococcus aureus: a descriptive study , 2013, The Lancet. Infectious Diseases.

[3]  G. Roberts,et al.  Bayesian inference for partially observed stochastic epidemics , 1999 .

[4]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[5]  Stephan Harbarth,et al.  Health-care workers: source, vector, or victim of MRSA? , 2008, The Lancet. Infectious diseases.

[6]  Colin J. Worby,et al.  The Distribution of Pairwise Genetic Distances: A Tool for Investigating Disease Transmission , 2014, Genetics.

[7]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[8]  Samuel Soubeyrand,et al.  A Bayesian approach for inferring the dynamics of partially observed endemic infectious diseases from space-time-genetic data , 2014, Proceedings of the Royal Society B: Biological Sciences.

[9]  David J. Hand,et al.  ROC Curves for Continuous Data , 2009 .

[10]  Andrew Rambaut,et al.  Evolutionary analysis of the dynamics of viral infectious disease , 2009, Nature Reviews Genetics.

[11]  P. O’Neill,et al.  Estimating the Effectiveness of Isolation and Decolonization Measures in Reducing Transmission of Methicillin-resistant Staphylococcus aureus in Hospital General Wards , 2013, American journal of epidemiology.

[12]  B. Cooper,et al.  Preliminary analysis of the transmission dynamics of nosocomial infections: stochastic and management effects. , 1999, The Journal of hospital infection.

[13]  Thibaut Jombart,et al.  outbreaker2: Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic Data , 2018 .

[14]  Evan S Snitkin,et al.  Tracking a Hospital Outbreak of Carbapenem-Resistant Klebsiella pneumoniae with Whole-Genome Sequencing , 2012, Science Translational Medicine.

[15]  T Jombart,et al.  Reconstructing disease outbreaks from genetic data: a graph approach , 2010, Heredity.

[16]  Susan S. Huang,et al.  Assessing the role of undetected colonization and isolation precautions in reducing Methicillin-Resistant Staphylococcus aureus transmission in intensive care units , 2010, BMC infectious diseases.

[17]  Julian Parkhill,et al.  Evolution of MRSA During Hospital Transmission and Intercontinental Spread , 2010, Science.

[18]  Colin J. Worby,et al.  Within-Host Bacterial Diversity Hinders Accurate Reconstruction of Transmission Networks from Genomic Distance Data , 2014, PLoS Comput. Biol..

[19]  Sergei L. Kosakovsky Pond,et al.  Phylodynamics of Infectious Disease Epidemics , 2009, Genetics.

[20]  F. K. Gould,et al.  Development and Evaluation of a Chromogenic Agar Medium for Methicillin-Resistant Staphylococcus aureus , 2004, Journal of Clinical Microbiology.

[21]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[22]  G. Dziekan,et al.  Infection control as a major World Health Organization priority for developing countries. , 2008, The Journal of hospital infection.

[23]  O. Pybus,et al.  The Epidemic Behavior of the Hepatitis C Virus , 2001, Science.

[24]  Julian Parkhill,et al.  Rapid whole-genome sequencing for investigation of a neonatal MRSA outbreak. , 2012, The New England journal of medicine.

[25]  Jukka Corander,et al.  Two-phase importance sampling for inference about transmission trees , 2014, Proceedings of the Royal Society B: Biological Sciences.

[26]  Gaël Thébaud,et al.  Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus , 2008, Proceedings of the Royal Society B: Biological Sciences.

[27]  Julian Parkhill,et al.  Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data , 2013, BMC Infectious Diseases.

[28]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[29]  Steven J. M. Jones,et al.  Whole-genome sequencing and social-network analysis of a tuberculosis outbreak. , 2011, The New England journal of medicine.

[30]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[31]  N. Khardori Rapid Whole-Genome Sequencing for Investigation of a Neonatal MRSA Outbreak , 2012 .

[32]  David A. Rasmussen,et al.  Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series , 2011, PLoS Comput. Biol..

[33]  Ethan Romero-Severson,et al.  Timing and order of transmission events is not directly reflected in a pathogen phylogeny. , 2014, Molecular biology and evolution.