Efficient Data Augmentation for Fitting Stochastic Epidemic Models to Prevalence Data

ABSTRACT Stochastic epidemic models describe the dynamics of an epidemic as a disease spreads through a population. Typically, only a fraction of cases are observed at a set of discrete times. The absence of complete information about the time evolution of an epidemic gives rise to a complicated latent variable problem in which the state space size of the epidemic grows large as the population size increases. This makes analytically integrating over the missing data infeasible for populations of even moderate size. We present a data augmentation Markov chain Monte Carlo (MCMC) framework for Bayesian estimation of stochastic epidemic model parameters, in which measurements are augmented with subject-level disease histories. In our MCMC algorithm, we propose each new subject-level path, conditional on the data, using a time-inhomogenous continuous-time Markov process with rates determined by the infection histories of other individuals. The method is general, and may be applied to a broad class of epidemic models with only minimal modifications to the model dynamics and/or emission distribution. We present our algorithm in the context of multiple stochastic epidemic models in which the data are binomially sampled prevalence counts, and apply our method to data from an outbreak of influenza in a British boarding school. Supplementary material for this article is available online.

[1]  H. Andersson,et al.  Stochastic Epidemic Models and Their Statistical Analysis , 2000 .

[2]  M. Keeling,et al.  Modeling Infectious Diseases in Humans and Animals , 2007 .

[3]  K. Svensson,et al.  Estimation of the Malthusian parameter in an stochastic epidemic model using martingale methods. , 2013, Mathematical biosciences.

[4]  I. Longini,et al.  Household and community transmission parameters from final distributions of infections in households. , 1982, Biometrics.

[5]  Eric Moulines,et al.  Inference in Hidden Markov Models (Springer Series in Statistics) , 2005 .

[6]  Jon Wakefield,et al.  PREDICTIVE MODELING OF CHOLERA OUTBREAKS IN BANGLADESH. , 2016, The annals of applied statistics.

[7]  P. O’Neill Introduction and snapshot review: Relating infectious disease transmission models to data , 2010, Statistics in medicine.

[8]  Gianfranco Ciardo,et al.  Tutorial on Structured Continuous-Time Markov Processes , 2014, J. Artif. Intell. Res..

[9]  Yves F. Atchad'e,et al.  Iterated filtering , 2009, 0902.0347.

[10]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[11]  Michael Höhle,et al.  Estimating Parameters for Stochastic Epidemics , 2002 .

[12]  L. Allen An Introduction to Stochastic Epidemic Models , 2008 .

[13]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[14]  M. Hirsch,et al.  Differential Equations, Dynamical Systems, and an Introduction to Chaos , 2003 .

[15]  S. Cauchemez,et al.  Estimates of the reproduction number for seasonal, pandemic, and zoonotic influenza: a systematic review of the literature , 2014, BMC Infectious Diseases.

[16]  Asger Hobolth,et al.  SIMULATION FROM ENDPOINT-CONDITIONED, CONTINUOUS-TIME MARKOV CHAINS ON A FINITE STATE SPACE, WITH APPLICATIONS TO MOLECULAR EVOLUTION. , 2009, The annals of applied statistics.

[17]  C. Viboud,et al.  A Bayesian MCMC approach to study transmission of influenza: application to household longitudinal data , 2004, Statistics in medicine.

[18]  G. Roberts,et al.  Statistical inference and model selection for the 1861 Hagelloch measles epidemic. , 2004, Biostatistics.

[19]  B. Finkenstädt,et al.  Statistical Inference in a Stochastic Epidemic SEIR Model with Control Intervention: Ebola as a Case Study , 2006, Biometrics.

[20]  Philip D O'Neill,et al.  Bayesian inference for stochastic multitype epidemics in structured populations using sample data. , 2009, Biostatistics.

[21]  Philip D O'Neill,et al.  A tutorial introduction to Bayesian inference for stochastic epidemic models using Markov chain Monte Carlo methods. , 2002, Mathematical biosciences.

[22]  G. Lerche The Royal Veterinary and Agricultural University : its contribution to rural education and research in Denmark : an introduction , 1999 .

[23]  Simon Cauchemez,et al.  Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London , 2008, Journal of The Royal Society Interface.

[24]  T. Hoskins,et al.  Christ's Hospital 1978–79: An account of two outbreaks of influenza A H1N1 , 1982 .

[25]  Tom Britton,et al.  Stochastic epidemic models: a survey. , 2009, Mathematical biosciences.

[26]  Pejman Rohani,et al.  Appropriate Models for the Management of Infectious Diseases , 2005, PLoS medicine.

[27]  G. Marion,et al.  Using model-based proposals for fast parameter inference on discrete state space, continuous-time Markov processes , 2015, Journal of The Royal Society Interface.

[28]  Nicholas G. Polson,et al.  Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model , 2012, Journal of the American Statistical Association.

[29]  G. Roberts,et al.  Bayesian analysis for emerging infectious diseases , 2009 .

[30]  G. Roberts,et al.  On inference for partially observed nonlinear diffusion models using the Metropolis–Hastings algorithm , 2001 .

[31]  N G Becker,et al.  On a general stochastic epidemic model. , 1977, Theoretical population biology.

[32]  Murali Haran,et al.  Emulating a gravity model to infer the spatiotemporal dynamics of an infectious disease , 2011, 1110.6451.

[33]  R. Watson An application of a martingale central limit theorem to the standard epidemic model , 1981 .

[34]  J. Gani,et al.  On the general stochastic epidemic , 1967 .

[35]  A. Sudbury The proportion of the population never hearing a rumour , 1985 .

[36]  Rob Deardon,et al.  Computational Statistics and Data Analysis Simulation-based Bayesian Inference for Epidemic Models , 2022 .

[37]  S. L. Scott Bayesian Methods for Hidden Markov Models , 2002 .

[38]  Elja Arjas,et al.  Transmission of Pneumococcal Carriage in Families: A Latent Markov Process Model for Binary Longitudinal Data , 2000 .

[39]  Darren J. Wilkinson Stochastic Modelling for Systems Biology , 2006 .

[40]  Jianjun Paul Tian,et al.  Lumpability and Commutativity of Markov Processes , 2006 .

[41]  A. Cook,et al.  Inference in Epidemic Models without Likelihoods , 2009 .

[42]  Gavin J. Gibson,et al.  Estimating parameters in stochastic compartmental models using Markov chain methods , 1998 .

[43]  E. Ionides,et al.  Compound Markov counting processes and their applications to modeling infinitesimally over-dispersed systems , 2010, 1003.0173.

[44]  Leonhard Held,et al.  Modeling seasonality in space‐time infectious disease surveillance data , 2012, Biometrical journal. Biometrische Zeitschrift.

[45]  Haikady N. Nagaraja,et al.  Inference in Hidden Markov Models , 2006, Technometrics.

[46]  Dao Nguyen,et al.  Statistical Inference for Partially Observed Markov Processes via the R Package pomp , 2015, 1509.00503.

[47]  Leonhard Held,et al.  A statistical framework for the analysis of multivariate infectious disease surveillance counts , 2005 .

[48]  G. Roberts,et al.  Bayesian inference for partially observed stochastic epidemics , 1999 .

[49]  Cleve B. Moler,et al.  Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later , 1978, SIAM Rev..

[50]  M. Plummer,et al.  CODA: convergence diagnosis and output analysis for MCMC , 2006 .

[51]  K Glass,et al.  Interpreting time-series analyses for continuous-time biological models--measles as a case study. , 2003, Journal of theoretical biology.

[52]  W. O. Kermack,et al.  A contribution to the mathematical theory of epidemics , 1927 .

[53]  Radford M. Neal,et al.  Sampling Latent States for High-Dimensional Non-Linear State Space Models with the Embedded HMM Method , 2016, Bayesian Analysis.

[54]  David Welch,et al.  Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems , 2009, Journal of The Royal Society Interface.

[55]  Zhen Qin,et al.  Auxiliary Gibbs Sampling for Inference in Piecewise-Constant Conditional Intensity Models , 2015, UAI.

[56]  A Keith,et al.  Hunterian Lectures ON MAN'S POSTURE: ITS EVOLUTION AND DISORDERS , 1923, British medical journal.