Phylodynamic Inference for Structured Epidemiological Models

Coalescent theory is routinely used to estimate past population dynamics and demographic parameters from genealogies. While early work in coalescent theory only considered simple demographic models, advances in theory have allowed for increasingly complex demographic scenarios to be considered. The success of this approach has lead to coalescent-based inference methods being applied to populations with rapidly changing population dynamics, including pathogens like RNA viruses. However, fitting epidemiological models to genealogies via coalescent models remains a challenging task, because pathogen populations often exhibit complex, nonlinear dynamics and are structured by multiple factors. Moreover, it often becomes necessary to consider stochastic variation in population dynamics when fitting such complex models to real data. Using recently developed structured coalescent models that accommodate complex population dynamics and population structure, we develop a statistical framework for fitting stochastic epidemiological models to genealogies. By combining particle filtering methods with Bayesian Markov chain Monte Carlo methods, we are able to fit a wide class of stochastic, nonlinear epidemiological models with different forms of population structure to genealogies. We demonstrate our framework using two structured epidemiological models: a model with disease progression between multiple stages of infection and a two-population model reflecting spatial structure. We apply the multi-stage model to HIV genealogies and show that the proposed method can be used to estimate the stage-specific transmission rates and prevalence of HIV. Finally, using the two-population model we explore how much information about population structure is contained in genealogies and what sample sizes are necessary to reliably infer parameters like migration rates.

[1]  O. Pybus,et al.  An integrated framework for the inference of viral population history from reconstructed genealogies. , 2000, Genetics.

[2]  Zih E N G Ya N,et al.  On the Best Evolutionary Rate for Phylogenetic Analysis , 1998 .

[3]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[4]  M. Notohara,et al.  The coalescent and the genealogical process in geographically structured population , 1990, Journal of mathematical biology.

[5]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[6]  Jacco Wallinga,et al.  Relating Phylogenetic Trees to Transmission Trees of Infectious Disease Outbreaks , 2013, Genetics.

[7]  Peter Beerli,et al.  Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  P Donnelly,et al.  Coalescents and genealogical structure under neutrality. , 1995, Annual review of genetics.

[9]  J. Felsenstein,et al.  Maximum-likelihood estimation of migration rates and effective population numbers in two populations using a coalescent approach. , 1999, Genetics.

[10]  Mary K. Kuhner,et al.  LAMARC 2.0: maximum likelihood and Bayesian estimation of population parameters , 2006, Bioinform..

[11]  E L Ionides,et al.  Inference for nonlinear dynamical systems , 2006, Proceedings of the National Academy of Sciences.

[12]  Erik M. Volz,et al.  Viral phylodynamics and the search for an ‘effective number of infections’ , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  Forrest W. Crawford,et al.  Unifying the spatial epidemiology and molecular evolution of emerging epidemics , 2012, Proceedings of the National Academy of Sciences.

[14]  Katia Koelle,et al.  Rates of coalescence for common epidemiological models at equilibrium , 2012, Journal of The Royal Society Interface.

[15]  M. Keeling,et al.  Estimating spatial coupling in epidemiological systems: a mechanistic approach , 2002 .

[16]  Christopher Dye,et al.  Universal voluntary HIV testing with immediate antiretroviral therapy as a strategy for elimination of HIV transmission: a mathematical model , 2009, The Lancet.

[17]  Erik M. Volz,et al.  Simple Epidemiological Dynamics Explain Phylogenetic Clustering of HIV from Patients with Recent Infection , 2012, PLoS Comput. Biol..

[18]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[19]  Alexei J. Drummond,et al.  Bayesian Phylogeography Finds Its Roots , 2009, PLoS Comput. Biol..

[20]  P. Kaye Infectious diseases of humans: Dynamics and control , 1993 .

[21]  O. Pybus,et al.  Bayesian coalescent inference of past population dynamics from molecular sequences. , 2005, Molecular biology and evolution.

[22]  A. Rambaut,et al.  Episodic Sexual Transmission of HIV Revealed by Molecular Phylodynamics , 2008, PLoS medicine.

[23]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.

[24]  M Slatkin,et al.  Genealogy of neutral genes in two partially isolated populations. , 1990, Theoretical population biology.

[25]  Christopher D Pilcher,et al.  Brief but efficient: acute HIV infection and the sexual transmission of HIV. , 2004, The Journal of infectious diseases.

[26]  Erik M. Volz,et al.  HIV-1 Transmission during Early Infection in Men Who Have Sex with Men: A Phylodynamic Analysis , 2013, PLoS medicine.

[27]  Katia Koelle,et al.  Reconciling Phylodynamics with Epidemiology: The Case of Dengue Virus in Southern Vietnam , 2013, Molecular biology and evolution.

[28]  J. Wakeley Coalescent Theory: An Introduction , 2008 .

[29]  E. Holmes,et al.  Inferring population history from molecular phylogenies. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[30]  K. Strimmer,et al.  Exploring the demographic history of DNA sequences using the generalized skyline plot. , 2001, Molecular biology and evolution.

[31]  Daniel T Gillespie,et al.  Stochastic simulation of chemical kinetics. , 2007, Annual review of physical chemistry.

[32]  Neil J. Gordon,et al.  Editors: Sequential Monte Carlo Methods in Practice , 2001 .

[33]  M. Keeling,et al.  Modeling Infectious Diseases in Humans and Animals , 2007 .

[34]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[35]  O. Pybus,et al.  Unifying the Epidemiological and Evolutionary Dynamics of Pathogens , 2004, Science.

[36]  William C Miller,et al.  The role of acute and early HIV infection in the spread of HIV and implications for transmission prevention strategies in Lilongwe, Malawi: a modelling study , 2011, The Lancet.

[37]  Simon J. Godsill,et al.  An Overview of Existing Methods and Recent Advances in Sequential Monte Carlo , 2007, Proceedings of the IEEE.

[38]  Esther Fearnhill,et al.  Transmission Network Parameters Estimated From HIV Sequences for a Nationwide Epidemic , 2011, The Journal of infectious diseases.

[39]  Erik M. Volz,et al.  Complex Population Dynamics and the Coalescent Under Neutrality , 2012, Genetics.

[40]  Paul J. Birrell,et al.  Prospects of elimination of HIV with test-and-treat strategy , 2013, Proceedings of the National Academy of Sciences.

[41]  David A. Rasmussen,et al.  Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series , 2011, PLoS Comput. Biol..

[42]  Brian G. Williams,et al.  HIV Treatment as Prevention: Debate and Commentary—Will Early Infection Compromise Treatment-as-Prevention Strategies? , 2012, PLoS medicine.

[43]  Sergei L. Kosakovsky Pond,et al.  Phylodynamics of Infectious Disease Epidemics , 2009, Genetics.

[44]  Daniel J. Wilson,et al.  Coalescent inference for infectious disease: meta-analysis of hepatitis C , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[45]  Jon A Yamato,et al.  Maximum likelihood estimation of population growth rates based on the coalescent. , 1998, Genetics.

[46]  Fan Zhang,et al.  Using HIV Diagnostic Data to Estimate HIV Incidence: Method and Simulation , 2011 .

[47]  Christophe Fraser,et al.  HIV-1 transmission, by stage of infection. , 2008, The Journal of infectious diseases.

[48]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[49]  James O Lloyd-Smith,et al.  The Potential Impact of Male Circumcision on HIV in Sub-Saharan Africa , 2006, PLoS medicine.

[50]  Mirjam Kretzschmar,et al.  Joint Modeling of HCV and HIV Co-Infection among Injecting Drug Users in Italy and Spain Using Individual Cross-Sectional Data , 2011 .

[51]  Beda Joos,et al.  Estimating the basic reproductive number from viral sequence data. , 2012, Molecular biology and evolution.