Forward simulation MCMC with applications to stochastic epidemic models

For many stochastic models, it is difficult to make inference about the model parameters because it is impossible to write down a tractable likelihood given the observed data. A common solution is data augmentation in a Markov chain Monte Carlo (MCMC) framework. However, there are statistical problems where this approach has proved infeasible but where simulation from the model is straightforward leading to the popularity of the approximate Bayesian computation algorithm. We introduce a forward simulation MCMC (fsMCMC) algorithm, which is primarily based upon simulation from the model. The fsMCMC algorithm formulates the simulation of the process explicitly as a data augmentation problem. By exploiting non-centred parameterizations, an efficient MCMC updating schema for the parameters and augmented data is introduced, whilst maintaining straightforward simulation from the model. The fsMCMC algorithm is successfully applied to two distinct epidemic models including a birth–death–mutation model that has only previously been analysed using approximate Bayesian computation methods.

[1]  G. Schoolnik,et al.  The epidemiology of tuberculosis in San Francisco. A population-based study using conventional and molecular methods. , 1994, The New England journal of medicine.

[2]  Arnaud Doucet,et al.  An adaptive sequential Monte Carlo method for approximate Bayesian computation , 2011, Statistics and Computing.

[3]  C. Andrieu,et al.  The pseudo-marginal approach for efficient Monte Carlo computations , 2009, 0903.5480.

[4]  M. Beaumont Estimation of population growth or decline in genetically monitored populations. , 2003, Genetics.

[5]  PETER NEAL,et al.  A case study in non-centering for data augmentation: Stochastic epidemics , 2005, Stat. Comput..

[6]  N G Becker,et al.  Inference for an epidemic when susceptibility varies. , 2001, Biostatistics.

[7]  Philip D. O'Neill,et al.  Computation of final outcome probabilities for the generalised stochastic epidemic , 2006, Stat. Comput..

[8]  Andrew R. Francis,et al.  Using Approximate Bayesian Computation to Estimate Tuberculosis Transmission Parameters From Genotype Data , 2006, Genetics.

[9]  Anthony N. Pettitt,et al.  Discussion of : constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation , 2012 .

[10]  W. K. Yuen,et al.  Optimal scaling of random walk Metropolis algorithms with discontinuous target densities , 2012, 1210.5090.

[11]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[12]  Mark M. Tanaka,et al.  Sequential Monte Carlo without likelihoods , 2007, Proceedings of the National Academy of Sciences.

[13]  M. Behr,et al.  Sex differences in the epidemiology of tuberculosis in San Francisco. , 2000, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[14]  A. Gelman,et al.  Weak convergence and optimal scaling of random walk Metropolis algorithms , 1997 .

[15]  Darren J. Wilkinson,et al.  Bayesian inference for a discretely observed stochastic kinetic model , 2008, Stat. Comput..

[16]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[17]  Paul Fearnhead,et al.  Constructing summary statistics for approximate Bayesian computation: semi‐automatic approximate Bayesian computation , 2012 .

[18]  Rob Deardon,et al.  Computational Statistics and Data Analysis Simulation-based Bayesian Inference for Epidemic Models , 2022 .

[19]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[20]  M. J. Bayarri,et al.  Non-Centered Parameterisations for Hierarchical Models and Data Augmentation , 2003 .

[21]  Thomas Sellke,et al.  On the asymptotic distribution of the size of a stochastic epidemic , 1983, Journal of Applied Probability.

[22]  Peter Neal,et al.  Efficient likelihood-free Bayesian Computation for household epidemics , 2012, Stat. Comput..

[23]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .

[24]  A. Barbour Networks of queues and the method of stages , 1976, Advances in Applied Probability.

[25]  Ching-fan Sheu,et al.  Simulation-based bayesian inference using BUGS , 1998 .

[26]  T. Kurtz Limit theorems for sequences of jump Markov processes approximating ordinary differential processes , 1971, Journal of Applied Probability.

[27]  G. Roberts,et al.  Optimal Scaling of Random Walk Metropolis Algorithms with Non-Gaussian Proposals , 2011 .

[28]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[29]  P. Donnelly,et al.  Inferring coalescence times from DNA sequence data. , 1997, Genetics.

[30]  Simon R. White,et al.  Fast Approximate Bayesian Computation for discretely observed Markov models using a factorised posterior distribution , 2013, 1301.2975.

[31]  Alexander Grey,et al.  The Mathematical Theory of Infectious Diseases and Its Applications , 1977 .

[32]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Gareth O. Roberts,et al.  Non-centred parameterisations for hierarchical models and data augmentation. , 2003 .