Identifying likely transmission pairs with pathogen sequence data using Kolmogorov Forward Equations; an application to M.bovis in cattle and badgers

Established methods for whole-genome-sequencing (WGS) technology allow for the detection of single-nucleotide polymorphisms (SNPs) in the pathogen genomes sourced from host samples. The information obtained can be used to track the pathogen’s evolution in time and potentially identify ‘who-infected-whom’ with unprecedented accuracy. Successful methods include ‘phylodynamic approaches’ that integrate evolutionary and epidemiological data. However, they are typically computationally intensive, require extensive data, and are best applied when there is a strong molecular clock signal and substantial pathogen diversity. To determine how much transmission information can be inferred when pathogen genetic diversity is low and metadata limited, we propose an analytical approach that combines pathogen WGS data and sampling times from infected hosts. It accounts for ‘between-scale’ processes, in particular within-host pathogen evolution and between-host transmission. We applied this to a well-characterised population with an endemic Mycobacterium bovis (the causative agent of bovine/zoonotic tuberculosis, bTB) infection. Our results show that, even with such limited data and low diversity, the computation of the transmission probability between host pairs can help discriminate between likely and unlikely infection pathways and therefore help to identify potential transmission networks, but can be sensitive to assumptions about within-host evolution.

[1]  Daniel J. Wilson,et al.  Combining genomics and epidemiology to analyse bi-directional transmission of Mycobacterium bovis in a multi-host system , 2019, eLife.

[2]  I. Comas,et al.  Large genomics datasets shed light on the evolution of the Mycobacterium tuberculosis complex. , 2019, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[3]  Soon-Chul Choi,et al.  A retrospective observational study , 2019 .

[4]  R. Kao,et al.  Disease management at the wildlife‐livestock interface: Using whole‐genome sequencing to study the role of elk in Mycobacterium bovis transmission in Michigan, USA , 2019, Molecular ecology.

[5]  Simon M Firestone,et al.  Reconstructing foot-and-mouth disease outbreaks: a methods comparison of transmission network models , 2019, Scientific Reports.

[6]  Thibaut Jombart,et al.  Bayesian inference of transmission chains using timing of symptoms, pathogen genomes and contact data , 2019, PLoS Comput. Biol..

[7]  C. Dubé,et al.  The spread of bovine tuberculosis in Canadian shared pastures: Data, model, and simulations , 2018, Transboundary and emerging diseases.

[8]  D. Mellor,et al.  “Feelings and Fitness” Not “Feelings or Fitness”–The Raison d'être of Conservation Welfare, Which Aligns Conservation and Animal Welfare Objectives , 2018, Front. Vet. Sci..

[9]  M. Schoenbaum,et al.  Whole Genome Sequencing of Mycobacterium bovis Isolated From Livestock in the United States, 1989–2018 , 2018, Front. Vet. Sci..

[10]  Ayman Habib,et al.  OpenSim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement , 2018, PLoS Comput. Biol..

[11]  Thibaut Jombart,et al.  When are pathogen genome sequences informative of transmission events? , 2018, PLoS pathogens.

[12]  Astrid Gall,et al.  PHYLOSCANNER: Inferring Transmission from Within- and Between-Host Pathogen Genetic Diversity , 2017, bioRxiv.

[13]  Nicola De Maio,et al.  Bayesian reconstruction of transmission within outbreaks using genomic variants , 2017, bioRxiv.

[14]  Christophe Fraser,et al.  Quantifying Transmission Heterogeneity Using Both Pathogen Phylogenies and Incidence Time Series , 2017, Molecular biology and evolution.

[15]  S. Harris,et al.  Using whole genome sequencing to investigate transmission in a multi-host system: bovine tuberculosis in New Zealand , 2017, BMC Genomics.

[16]  Ryan R. Wick,et al.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads , 2016, bioRxiv.

[17]  Huldrych F. Günthard,et al.  Phylodynamics on local sexual contact networks , 2016, bioRxiv.

[18]  Lu Lu,et al.  Role for migratory wild birds in the global spread of avian influenza H5N8 , 2016, Science.

[19]  Xavier Didelot,et al.  Genomic Infectious Disease Epidemiology in Partially Sampled and Ongoing Outbreaks , 2016, bioRxiv.

[20]  N. Smith,et al.  Bovine TB surveillance in Great Britain in 2014 , 2016, Veterinary Record.

[21]  R. Kao,et al.  Use of bacterial whole-genome sequencing to investigate local persistence and spread in bovine tuberculosis , 2016, Epidemics.

[22]  Ingo Bulla,et al.  Phylogenetically resolving epidemiologic linkage , 2016, Proceedings of the National Academy of Sciences.

[23]  Roland R. Regoes,et al.  Investigating the Consequences of Interference between Multiple CD8+ T Cell Escape Mutations in Early HIV Infection , 2016, PLoS Comput. Biol..

[24]  Daniel A. Grear,et al.  Descriptive Epidemiology and Whole Genome Sequencing Analysis for an Outbreak of Bovine Tuberculosis in Beef Cattle and White-Tailed Deer in Northwestern Minnesota , 2016, PloS one.

[25]  Anders Lansner,et al.  Computing the Local Field Potential (LFP) from Integrate-and-Fire Network Models , 2015, PLoS Comput. Biol..

[26]  Gavin J. Gibson,et al.  A Systematic Bayesian Integration of Epidemiological and Genetic Data , 2015, PLoS Comput. Biol..

[27]  Luca Bolzoni,et al.  Epidemiological modelling for the assessment of bovine tuberculosis surveillance in the dairy farm network in Emilia-Romagna (Italy). , 2014, Epidemics.

[28]  O. Pybus,et al.  Measurably evolving pathogens in the genomic era. , 2015, Trends in ecology & evolution.

[29]  M. Craft Infectious disease transmission and contact networks in wildlife and livestock , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[30]  J. Álvarez,et al.  Bovine tuberculosis: within-herd transmission models to support and direct the decision-making process. , 2014, Research in veterinary science.

[31]  Ellen Brooks-Pollock,et al.  A dynamic model of bovine tuberculosis spread and control in Great Britain , 2014, Nature.

[32]  R. Kao,et al.  Estimating epidemiological parameters for bovine tuberculosis in British cattle using a Bayesian partial-likelihood approach , 2014, Proceedings of the Royal Society B: Biological Sciences.

[33]  David A. Rasmussen,et al.  Phylodynamic Inference for Structured Epidemiological Models , 2014, PLoS Comput. Biol..

[34]  Rowland R Kao,et al.  Supersize me: how whole-genome sequencing and big data are transforming epidemiology , 2014, Trends in Microbiology.

[35]  V. Arcus,et al.  Whole Genome Sequencing of Mycobacterium tuberculosis Reveals Slow Growth and Low Mutation Rates during Latent Infections in Humans , 2014, PloS one.

[36]  F. De Filippis,et al.  A Selected Core Microbiome Drives the Early Stages of Three Popular Italian Cheese Manufactures , 2014, PloS one.

[37]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[38]  A. Conlan,et al.  Age-dependent patterns of bovine tuberculosis in cattle , 2013, Veterinary Research.

[39]  M. Chambers,et al.  Long-term temporal trends and estimated transmission rates for Mycobacterium bovis infection in an undisturbed high-density badger (Meles meles) population , 2013, Epidemiology and Infection.

[40]  Stefan Niemann,et al.  Whole Genome Sequencing versus Traditional Genotyping for Investigation of a Mycobacterium tuberculosis Outbreak: A Longitudinal Molecular Epidemiological Study , 2013, PLoS medicine.

[41]  J. S. Batista,et al.  Pathogenesis of reproductive failure induced by Trypanosoma vivax in experimentally infected pregnant ewes , 2013, Veterinary Research.

[42]  R. Kao,et al.  Whole Genome Sequencing Reveals Local Transmission Patterns of Mycobacterium bovis in Sympatric Cattle and Badger Populations , 2012, PLoS pathogens.

[43]  Samuel Soubeyrand,et al.  A Bayesian Inference Framework to Reconstruct Transmission Trees Using Epidemiological and Genetic Data , 2012, PLoS Comput. Biol..

[44]  Y. Michalakis,et al.  Virus population bottlenecks during within-host progression and host-to-host transmission. , 2012, Current opinion in virology.

[45]  Ellen Brooks-Pollock,et al.  Estimating the Hidden Burden of Bovine Tuberculosis in Great Britain , 2012, PLoS Comput. Biol..

[46]  Julian Parkhill,et al.  Rapid whole-genome sequencing for investigation of a neonatal MRSA outbreak. , 2012, The New England journal of medicine.

[47]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[48]  R. Agarwal,et al.  Chemotherapy and human chorionic gonadotropin concentrations 6 months after uterine evacuation of molar pregnancy: a retrospective cohort study , 2012, The Lancet.

[49]  David A. Rasmussen,et al.  Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series , 2011, PLoS Comput. Biol..

[50]  M. Chase,et al.  Use of whole genome sequencing to estimate the mutation rate of Mycobacterium tuberculosis during latent infection , 2011, Nature Genetics.

[51]  N. Walker,et al.  Diagnostic Accuracy and Optimal Use of Three Tests for Tuberculosis in Live Badgers , 2010, PloS one.

[52]  Andrew Rambaut,et al.  Evolutionary analysis of the dynamics of viral infectious disease , 2009, Nature Reviews Genetics.

[53]  Craig Packer,et al.  Distinguishing epidemic waves from disease spillover in a wildlife population , 2009, Proceedings of the Royal Society B: Biological Sciences.

[54]  Matt J. Keeling,et al.  Representing the UK's cattle herd as static and dynamic networks , 2008, Proceedings of the Royal Society B: Biological Sciences.

[55]  Gaël Thébaud,et al.  Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virus , 2008, Proceedings of the Royal Society B: Biological Sciences.

[56]  K. Sharkey Deterministic epidemiological models at the individual level , 2008, Journal of mathematical biology.

[57]  M. Keeling,et al.  On methods for studying stochastic disease dynamics , 2008, Journal of The Royal Society Interface.

[58]  Rowland R Kao,et al.  Disease dynamics over very different time-scales: foot-and-mouth disease and scrapie on the network of livestock movements in the UK , 2007, Journal of The Royal Society Interface.

[59]  Alessandro Vespignani,et al.  Epidemic modeling in complex realities. , 2007, Comptes rendus biologies.

[60]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[61]  Katia Koelle,et al.  The effects of host contact network structure on pathogen diversity and strain structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[62]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[63]  Nico Stollenwerk,et al.  Meningitis, pathogenicity near criticality: the epidemiology of meningococcal disease as a model for accidental pathogens. , 2003, Journal of theoretical biology.

[64]  R R Kao,et al.  A model of bovine tuberculosis control in domesticated cattle herds , 1997, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[65]  N. Khardori Rapid Whole-Genome Sequencing for Investigation of a Neonatal MRSA Outbreak , 2012 .

[66]  Evon M. O. Abu-Taieh,et al.  Comparative Study , 2020, Definitions.

[67]  S. Hamilton,et al.  Epidemiological models. , 1976, Indian journal of public health.