Resolving outbreak dynamics using Approximate Bayesian Computation for stochastic birth-death models

Earlier research has suggested that Approximate Bayesian Computation (ABC) makes it possible to fit simulator-based intractable birth-death models to investigate communicable disease outbreak dynamics with accuracy comparable to that of exact Bayesian methods. However, recent findings have indicated that key parameters such as the reproductive number R may remain poorly identifiable. Here we show that the identifiability issue can be resolved by taking into account disease-specific characteristics of the transmission process in closer detail. Using tuberculosis (TB) in the San Francisco Bay area as a case-study, we consider the situation where the genotype data are generated as a mixture of three stochastic processes, each with their distinct dynamics and clear epidemiological interpretation. The ABC inference yields stable and accurate posterior inferences about outbreak dynamics from aggregated annual case data with genotype information. We also show that under the proposed model, the infectious population size can be reliably inferred from the data. The estimate is approximately two orders of magnitude smaller compared to assumptions made in the earlier ABC studies, and is much better aligned with epidemiological knowledge about active TB prevalence. Similarly, the reproductive number R related to the primary underlying transmission process is estimated to be nearly three-fold compared with the previous estimates, which has a substantial impact on the interpretation of the fitted outbreak model.

[1]  M. Blum Approximate Bayesian Computation: A Nonparametric Perspective , 2009, 0904.0635.

[2]  Andrew R. Francis,et al.  Using Approximate Bayesian Computation to Estimate Tuberculosis Transmission Parameters From Genotype Data , 2006, Genetics.

[3]  Hans R. Künsch,et al.  A simulated annealing approach to approximate Bayes computations , 2012, Statistics and Computing.

[4]  Mark M. Tanaka,et al.  Sequential Monte Carlo without likelihoods , 2007, Proceedings of the National Academy of Sciences.

[5]  R. May,et al.  Infectious Diseases of Humans: Dynamics and Control , 1991, Annals of Internal Medicine.

[6]  D. Balding,et al.  Statistical Applications in Genetics and Molecular Biology On Optimal Selection of Summary Statistics for Approximate Bayesian Computation , 2011 .

[7]  Andrew R. Francis,et al.  The epidemiological fitness cost of drug resistance in Mycobacterium tuberculosis , 2009, Proceedings of the National Academy of Sciences.

[8]  J. Møller Discussion on the paper by Feranhead and Prangle , 2012 .

[9]  L. Excoffier,et al.  Efficient Approximate Bayesian Computation Coupled With Markov Chain Monte Carlo Without Likelihood , 2009, Genetics.

[10]  Denys Pommeret,et al.  Likelihood-free parallel tempering , 2011, Stat. Comput..

[11]  T. Stadler Inferring Epidemiological Parameters on the Basis of Allele Frequencies , 2011, Genetics.

[12]  G. Schoolnik,et al.  The epidemiology of tuberculosis in San Francisco. A population-based study using conventional and molecular methods. , 1994, The New England journal of medicine.

[13]  M. Gutmann,et al.  Fundamentals and Recent Developments in Approximate Bayesian Computation , 2016, Systematic biology.

[14]  Jukka Corander,et al.  On the Identifiability of Transmission Dynamic Models for Infectious Diseases , 2015, Genetics.

[15]  Paul Fearnhead,et al.  Constructing summary statistics for approximate Bayesian computation: semi‐automatic approximate Bayesian computation , 2012 .

[16]  C. Sreeramareddy,et al.  Time delays in diagnosis of pulmonary tuberculosis: a systematic review of literature , 2009, BMC infectious diseases.

[17]  Tanja Stadler,et al.  Exact vs. Approximate Computation: Reconciling Different Estimates of Mycobacterium tuberculosis Epidemiological Parameters , 2014, Genetics.

[18]  Aki Vehtari,et al.  ELFI: Engine for Likelihood Free Inference , 2016, J. Mach. Learn. Res..

[19]  Arnaud Doucet,et al.  An adaptive sequential Monte Carlo method for approximate Bayesian computation , 2011, Statistics and Computing.

[20]  ELFI ELFI: Engine for Likelihood-Free Inference , 2018 .