Forward Simulation Markov Chain Monte Carlo with Applications to Stochastic Epidemic Models

type="main" xml:id="sjos12111-abs-0001"> For many stochastic models, it is difficult to make inference about the model parameters because it is impossible to write down a tractable likelihood given the observed data. A common solution is data augmentation in a Markov chain Monte Carlo (MCMC) framework. However, there are statistical problems where this approach has proved infeasible but where simulation from the model is straightforward leading to the popularity of the approximate Bayesian computation algorithm. We introduce a forward simulation MCMC (fsMCMC) algorithm, which is primarily based upon simulation from the model. The fsMCMC algorithm formulates the simulation of the process explicitly as a data augmentation problem. By exploiting non-centred parameterizations, an efficient MCMC updating schema for the parameters and augmented data is introduced, whilst maintaining straightforward simulation from the model. The fsMCMC algorithm is successfully applied to two distinct epidemic models including a birth–death–mutation model that has only previously been analysed using approximate Bayesian computation methods.

[1]  A. Rényi On Measures of Entropy and Information , 1961 .

[2]  M Juchniewicz,et al.  [Epidemiology of tuberculosis]. , 1970, Pielegniarka i polozna.

[3]  M. Beaumont Estimation of population growth or decline in genetically monitored populations. , 2003, Genetics.

[4]  Ochs Cw The Epidemiology of Tuberculosis. , 1962 .

[5]  N G Becker,et al.  Inference for an epidemic when susceptibility varies. , 2001, Biostatistics.

[6]  Mark M. Tanaka,et al.  Sequential Monte Carlo without likelihoods , 2007, Proceedings of the National Academy of Sciences.

[7]  Simon R. White,et al.  Fast Approximate Bayesian Computation for discretely observed Markov models using a factorised posterior distribution , 2013, 1301.2975.

[8]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[9]  Peter Neal,et al.  Efficient likelihood-free Bayesian Computation for household epidemics , 2012, Stat. Comput..

[10]  Arnaud Doucet,et al.  An adaptive sequential Monte Carlo method for approximate Bayesian computation , 2011, Statistics and Computing.

[11]  J. Møller Discussion on the paper by Feranhead and Prangle , 2012 .

[12]  G. Schoolnik,et al.  The epidemiology of tuberculosis in San Francisco. A population-based study using conventional and molecular methods. , 1994, The New England journal of medicine.

[13]  N. Bailey,et al.  The mathematical theory of infectious diseases and its applications. 2nd edition. , 1975 .

[14]  Philip D. O'Neill,et al.  Computation of final outcome probabilities for the generalised stochastic epidemic , 2006, Stat. Comput..

[15]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[16]  A. Barbour Networks of queues and the method of stages , 1976, Advances in Applied Probability.

[17]  Paul Fearnhead,et al.  Constructing summary statistics for approximate Bayesian computation: semi‐automatic approximate Bayesian computation , 2012 .

[18]  Anthony N. Pettitt,et al.  Discussion of : constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation , 2012 .

[19]  Andrew R. Francis,et al.  Using Approximate Bayesian Computation to Estimate Tuberculosis Transmission Parameters From Genotype Data , 2006, Genetics.

[20]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[21]  T. Kurtz Limit theorems for sequences of jump Markov processes approximating ordinary differential processes , 1971, Journal of Applied Probability.

[22]  P. Donnelly,et al.  Inferring coalescence times from DNA sequence data. , 1997, Genetics.

[23]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .

[24]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[25]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[26]  Darren J. Wilkinson,et al.  Bayesian inference for a discretely observed stochastic kinetic model , 2008, Stat. Comput..

[27]  Rob Deardon,et al.  Computational Statistics and Data Analysis Simulation-based Bayesian Inference for Epidemic Models , 2022 .

[28]  A. Gelman,et al.  Weak convergence and optimal scaling of random walk Metropolis algorithms , 1997 .

[29]  M. J. Bayarri,et al.  Non-Centered Parameterisations for Hierarchical Models and Data Augmentation , 2003 .

[30]  Thomas Sellke,et al.  On the asymptotic distribution of the size of a stochastic epidemic , 1983, Journal of Applied Probability.

[31]  W. K. Yuen,et al.  Optimal scaling of random walk Metropolis algorithms with discontinuous target densities , 2012, 1210.5090.

[32]  PETER NEAL,et al.  A case study in non-centering for data augmentation: Stochastic epidemics , 2005, Stat. Comput..

[33]  Gareth O. Roberts,et al.  Non-centred parameterisations for hierarchical models and data augmentation. , 2003 .