Construction of semi-Markov genetic-space-time SEIR models and inference

Identifying transmission links of an infectious disease through a host population is critical to understanding its epidemiology and informing measures for its control. Infected hosts close together in their locations and timings are often thought to be linked, but timing and locations alone are usually consistent with many different scenarios of who infected whom. To infer more reliably who-transmitted-to-whom over the course of a disease outbreak caused by a fast-evolving pathogen, pathogen genomic data have been combined with spatial and temporal data. However, the manner to combine these data remains today a modeling and statistical challenge. One of the approaches recently proposed is based on an extension of stochastic Susceptible-Exposed-Infectious-Removed (SEIR) models. In this article, we present this extension that combines (i) an individual-based, spatial, semi-Markov SEIR model for the spatio-temporal dynamics of the pathogen, and (ii) a Markovian evolutionary model for the temporal evolution of genetic sequences of the pathogen. The resulting model is a state-space model including latent vectors of high dimension. Then, we describe a new algorithm that allows an approximate Bayesian inference of model parameters and latent variables. Finally, the capacity of the estimation algorithm to reconstruct transmission trees (i.e. who infected whom) is assessed with a simulation study. We especially investigate how the inference method performs when only a fraction of pathogen genomic data is available.

[1]  Rowland R. Kao,et al.  The UK foot-and-mouth disease outbreak — the aftermath , 2004, Nature Reviews Microbiology.

[2]  N. Capote,et al.  Plum pox virus and the estimated costs associated with sharka disease , 2006 .

[3]  Nikolaos Limnios,et al.  Semi-Markov Chains and Hidden Semi-Markov Models toward Applications: Their Use in Reliability and DNA Analysis , 2008 .

[4]  C. Viboud,et al.  Explorer The genomic and epidemiological dynamics of human influenza A virus , 2016 .

[5]  J. Grandell Doubly stochastic Poisson processes , 1976 .

[6]  D. Brooks,et al.  Quantitative Phylogenetic Analysis in the 21st Century , 2007 .

[7]  M. Kimura,et al.  A model of evolutionary base substitutions and its application with special reference to rapid change of pseudogenes. , 1981, Genetics.

[8]  L. Brocchieri,et al.  Phylogenetic inferences from molecular sequences: review and critique. , 2001, Theoretical population biology.

[9]  Bryan D. Kolaczkowski,et al.  Robustness of Ancestral Sequence Reconstruction to Phylogenetic Uncertainty , 2010, Molecular biology and evolution.

[10]  David A. Rasmussen,et al.  Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series , 2011, PLoS Comput. Biol..

[11]  Alexei J. Drummond,et al.  Bayesian Phylogeography Finds Its Roots , 2009, PLoS Comput. Biol..

[12]  Tanja Stadler,et al.  Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  Mike Steel,et al.  Links between maximum likelihood and maximum parsimony under a simple model of site substitution , 1997 .

[14]  C. Donnelly,et al.  Transmission dynamics and epidemiology of BSE in British cattle , 1996, Nature.

[15]  Byron Hall Bayesian Inference , 2011 .

[16]  Jonathan P. Bollback,et al.  Empirical and hierarchical Bayesian estimation of ancestral states. , 2001, Systematic biology.

[17]  J. Hartigan MINIMUM MUTATION FITS TO A GIVEN TREE , 1973 .

[18]  David Sankoff,et al.  Locating the vertices of a steiner tree in an arbitrary metric space , 1975, Math. Program..

[19]  M. Suchard,et al.  Phylogeography takes a relaxed random walk in continuous space and time. , 2010, Molecular biology and evolution.

[20]  Thibaut Jombart,et al.  outbreaker2: Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic Data , 2018 .

[21]  Tom Britton,et al.  Introduction to statistical inference for infectious diseases , 2014, 1411.3138.

[22]  J Wallinga,et al.  Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data , 2012, Proceedings of the Royal Society B: Biological Sciences.

[23]  Matthew Hall,et al.  Epidemic Reconstruction in a Phylogenetics Framework: Transmission Trees as Partitions of the Node Set , 2014, PLoS Comput. Biol..

[24]  Craig Packer,et al.  Transmission Dynamics and Prospects for the Elimination of Canine Rabies , 2009, PLoS biology.

[25]  Forrest W. Crawford,et al.  Unifying the spatial epidemiology and molecular evolution of emerging epidemics , 2012, Proceedings of the National Academy of Sciences.

[26]  T. Gojobori,et al.  A Large Variation in the Rates of Synonymous Substitution for RNA Viruses and Its Relationship to a Diversity of Viral Infection and Transmission Modes , 2004, Molecular biology and evolution.

[27]  Marion Koopmans,et al.  Genetic data provide evidence for wind-mediated transmission of highly pathogenic avian influenza. , 2013, The Journal of infectious diseases.

[28]  Ulrike Goldschmidt,et al.  An Introduction To The Theory Of Point Processes , 2016 .

[29]  Samuel Soubeyrand,et al.  A Bayesian approach for inferring the dynamics of partially observed endemic infectious diseases from space-time-genetic data , 2014, Proceedings of the Royal Society B: Biological Sciences.

[30]  Samuel Soubeyrand,et al.  A Bayesian Inference Framework to Reconstruct Transmission Trees Using Epidemiological and Genetic Data , 2012, PLoS Comput. Biol..

[31]  C. Donnelly,et al.  Transmission dynamics and epidemiology of BSE in British cattle , 1997, Nature.

[32]  M. Kimura Estimation of evolutionary distances between homologous nucleotide sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Edward C. Holmes,et al.  Rates of Molecular Evolution in RNA Viruses: A Quantitative Phylogenetic Analysis , 2002, Journal of Molecular Evolution.

[34]  H. Linder,et al.  Model uncertainty in ancestral area reconstruction: A parsimonious solution? , 2012 .

[35]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[36]  Bryan Kolaczkowski,et al.  Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous , 2004, Nature.

[37]  A. Dassios,et al.  The Distribution of the Interval between Events of a Cox Process with Shot Noise Intensity , 2008 .