Shared Genomic Variants: Identification of Transmission Routes Using Pathogen Deep-Sequence Data

Abstract Sequencing pathogen samples during a communicable disease outbreak is becoming an increasingly common procedure in epidemiologic investigations. Identifying who infected whom sheds considerable light on transmission patterns, high-risk settings and subpopulations, and the effectiveness of infection control. Genomic data shed new light on transmission dynamics and can be used to identify clusters of individuals likely to be linked by direct transmission. However, identification of individual routes of infection via single genome samples typically remains uncertain. We investigated the potential of deep sequence data to provide greater resolution on transmission routes, via the identification of shared genomic variants. We assessed several easily implemented methods to identify transmission routes using both shared variants and genetic distance, demonstrating that shared variants can provide considerable additional information in most scenarios. While shared-variant approaches identify relatively few links in the presence of a small transmission bottleneck, these links are highly accurate. Furthermore, we propose a hybrid approach that also incorporates phylogenetic distance to provide greater resolution. We applied our methods to data collected during the 2014 Ebola outbreak, identifying several likely routes of transmission. Our study highlights the power of data from deep sequencing of pathogens as a component of outbreak investigation and epidemiologic analyses.

[1]  Freddy Hidalgo,et al.  Ebola Virus , 2018, Definitions.

[2]  Q. Songsheng,et al.  Microcalorimetric study of bacterial growth , 1988 .

[3]  W. Team Ebola Virus Disease in West Africa — The First 9 Months of the Epidemic and Forward Projections , 2014 .

[4]  Tim E A Peto,et al.  Assessment of Mycobacterium tuberculosis transmission in Oxfordshire, UK, 2007–12, with whole pathogen genome sequences: an observational study , 2014, The Lancet. Respiratory medicine.

[5]  Marshall Crumiller,et al.  Influenza A virus transmission bottlenecks are defined by infection route and recipient host. , 2014, Cell host & microbe.

[6]  Colin J. Worby,et al.  'SEEDY' (Simulation of Evolutionary and Epidemiological Dynamics): An R Package to Follow Accumulation of Within-Host Mutation in Pathogens , 2015, PloS one.

[7]  K. C. Zoon,et al.  Mutation rate and genotype variation of Ebola virus from Mali case sequences , 2015, Science.

[8]  Gaël Thébaud,et al.  Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus , 2008, Proceedings of the Royal Society B: Biological Sciences.

[9]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[10]  Xavier Didelot,et al.  Bayesian Inference of Infectious Disease Transmission from Whole-Genome Sequence Data , 2014, Molecular biology and evolution.

[11]  Anna Rosa Garbuglia,et al.  Use of the Minimum Spanning Tree Model for Molecular Epidemiological Investigation of a Nosocomial Outbreak of Hepatitis C Virus Infection , 2004, Journal of Clinical Microbiology.

[12]  Timothy B. Stockwell,et al.  Quantifying influenza virus diversity and transmission in humans , 2016, Nature Genetics.

[13]  High-resolution genomic surveillance of 2014 ebolavirus using shared subclonal variants , 2014 .

[14]  M. Struelens,et al.  Epidemiologic typing and delineation of genetic relatedness of methicillin-resistant Staphylococcus aureus by macrorestriction analysis of genomic DNA by using pulsed-field gel electrophoresis , 1992, Journal of clinical microbiology.

[15]  E. Holmes,et al.  Inferring the inter-host transmission of influenza A virus using patterns of intra-host genetic variation , 2013, Proceedings of the Royal Society B: Biological Sciences.

[16]  Mikiko Senga,et al.  Ebola virus disease in West Africa--the first 9 months of the epidemic and forward projections. , 2014, The New England journal of medicine.

[17]  E. Dengremont,et al.  Statistical approach for comparison of the growth rates of five strains of Staphylococcus aureus , 1995, Applied and environmental microbiology.

[18]  Sergei L. Kosakovsky Pond,et al.  The global transmission network of HIV-1. , 2014, The Journal of infectious diseases.

[19]  Thibaut Jombart,et al.  outbreaker2: Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic Data , 2018 .

[20]  T Jombart,et al.  Reconstructing disease outbreaks from genetic data: a graph approach , 2010, Heredity.

[21]  Daniel Falush,et al.  Bacterial Population Genetics in Infectious Disease , 1994 .

[22]  Colin J. Worby,et al.  Within-Host Bacterial Diversity Hinders Accurate Reconstruction of Transmission Networks from Genomic Distance Data , 2014, PLoS Comput. Biol..

[23]  F. Balloux Demographic Influences on Bacterial Population Structure , 2010 .

[24]  Rachel S. G. Sealfon,et al.  Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak , 2014, Science.

[25]  Peter Donnelly,et al.  Evolutionary dynamics of Staphylococcus aureus during progression from carriage to disease , 2012, Proceedings of the National Academy of Sciences.

[26]  Julian Parkhill,et al.  Capturing the cloud of diversity reveals complexity and heterogeneity of MRSA carriage, infection and transmission , 2015, Nature Communications.

[27]  James M. Musser,et al.  spa Typing Method for Discriminating among Staphylococcus aureus Isolates: Implications for Use of a Single Marker To Detect Genetic Micro- and Macrovariation , 2004, Journal of Clinical Microbiology.

[28]  Colin J. Worby,et al.  The Distribution of Pairwise Genetic Distances: A Tool for Investigating Disease Transmission , 2014, Genetics.

[29]  Tanja Stadler,et al.  Insights into the Early Epidemic Spread of Ebola in Sierra Leone Provided by Viral Sequence Data , 2014, PLoS currents.

[30]  U. Nübel,et al.  spa Typing of Staphylococcus aureus as a Frontline Tool in Epidemiological Typing , 2007, Journal of Clinical Microbiology.

[31]  Marc Lipsitch,et al.  Epidemiologic data and pathogen genome sequences: a powerful synergy for public health , 2014, Genome Biology.

[32]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[33]  Paolo Piazza,et al.  Microevolutionary analysis of Clostridium difficile genomes to investigate transmission , 2012, Genome Biology.

[34]  Evan S Snitkin,et al.  Tracking a Hospital Outbreak of Carbapenem-Resistant Klebsiella pneumoniae with Whole-Genome Sequencing , 2012, Science Translational Medicine.

[35]  Julian Parkhill,et al.  Evolution of MRSA During Hospital Transmission and Intercontinental Spread , 2010, Science.

[36]  Hui Li,et al.  Wide Variation in the Multiplicity of HIV-1 Infection among Injection Drug Users , 2010, Journal of Virology.

[37]  Eduardo P C Rocha,et al.  Comparisons of dN/dS are time dependent for closely related bacterial genomes. , 2006, Journal of theoretical biology.

[38]  Steven J. M. Jones,et al.  Whole-genome sequencing and social-network analysis of a tuberculosis outbreak. , 2011, The New England journal of medicine.

[39]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[40]  Trevor Bedford,et al.  Ebola Virus Epidemiology, Transmission, and Evolution during Seven Months in Sierra Leone , 2015, Cell.

[41]  E. Holmes,et al.  Transmission of Equine Influenza Virus during an Outbreak Is Characterized by Frequent Mixed Infections and Loose Transmission Bottlenecks , 2012, PLoS pathogens.

[42]  Julian Parkhill,et al.  Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data , 2013, BMC Infectious Diseases.

[43]  Kevin J. Emmett,et al.  High-resolution Genomic Surveillance of 2014 Ebolavirus Using Shared Subclonal Variants , 2014, bioRxiv.

[44]  Andrew Rambaut,et al.  Evolutionary analysis of the dynamics of viral infectious disease , 2009, Nature Reviews Genetics.

[45]  R. Swanstrom,et al.  Bottlenecks in HIV-1 transmission: insights from the study of founder viruses , 2015, Nature Reviews Microbiology.

[46]  J. Peiris,et al.  Viral genetic sequence variations in pandemic H1N1/2009 and seasonal H3N2 influenza viruses within an individual, a household and a community. , 2011, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[47]  S. West,et al.  Mechanisms of Pathogenesis, Infective Dose and Virulence in Human Parasites , 2012, PLoS pathogens.

[48]  E. Holmes,et al.  Evolution of an Eurasian Avian-like Influenza Virus in Naïve and Vaccinated Pigs , 2012, PLoS pathogens.