Time series modeling of pathogen‐specific disease probabilities with subsampled data

Many diseases arise due to exposure to one of multiple possible pathogens. We consider the situation in which disease counts are available over time from a study region, along with a measure of clinical disease severity, for example, mild or severe. In addition, we suppose a subset of the cases are lab tested in order to determine the pathogen responsible for disease. In such a context, we focus interest on modeling the probabilities of disease incidence given pathogen type. The time course of these probabilities is of great interest as is the association with time-varying covariates such as meteorological variables. In this set up, a natural Bayesian approach would be based on imputation of the unsampled pathogen information using Markov Chain Monte Carlo but this is computationally challenging. We describe a practical approach to inference that is easy to implement. We use an empirical Bayes procedure in a first step to estimate summary statistics. We then treat these summary statistics as the observed data and develop a Bayesian generalized additive model. We analyze data on hand, foot, and mouth disease (HFMD) in China in which there are two pathogens of primary interest, enterovirus 71 (EV71) and Coxackie A16 (CA16). We find that both EV71 and CA16 are associated with temperature, relative humidity, and wind speed, with reasonably similar functional forms for both pathogens. The important issue of confounding by time is modeled using a penalized B-spline model with a random effects representation. The level of smoothing is addressed by a careful choice of the prior on the tuning variance.

[1]  G. Roberts,et al.  Bayesian inference for partially observed stochastic epidemics , 1999 .

[2]  A. Gelman,et al.  Weak convergence and optimal scaling of random walk Metropolis algorithms , 1997 .

[3]  B. Hoadley The Compound Multinomial Distribution and Bayesian Analysis of Categorical Data from Finite Populations , 1969 .

[4]  M. Wand,et al.  Semiparametric Regression: Parametric Regression , 2003 .

[5]  P. Diaconis,et al.  Algebraic algorithms for sampling from conditional distributions , 1998 .

[6]  Y. Hao,et al.  Effect of meteorological variables on the incidence of hand, foot, and mouth disease in children: a time-series analysis in Guangzhou, China , 2013, BMC Infectious Diseases.

[7]  J. Wakefield,et al.  Hand, Foot, and Mouth Disease in China: Patterns of Spread and Transmissibility , 2011, Epidemiology.

[8]  Nicholas G. Polson,et al.  Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model , 2012, Journal of the American Statistical Association.

[9]  N. Hens,et al.  Contribution of respiratory pathogens to influenza-like illness consultations , 2012, Epidemiology and Infection.

[10]  J. Wakefield,et al.  eAppendix Hand , Foot and Mouth Disease in China : Patterns of Spread and Transmissibility during 2008-2009 , 2011 .

[11]  Cici Xi Chen Bauer,et al.  Bayesian Modeling of Health Data in Space and Time , 2012 .

[12]  Cecile Viboud,et al.  Mortality due to influenza in the United States--an annualized regression approach using multiple-cause mortality data. , 2006, American journal of epidemiology.

[13]  Gavin J. Gibson,et al.  Estimating parameters in stochastic compartmental models using Markov chain methods , 1998 .

[14]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[15]  A. Cook,et al.  Inference in Epidemic Models without Likelihoods , 2009 .

[16]  C. Tong,et al.  Global epidemiology of Enterovirus 71 , 2009 .

[17]  B. Finkenstädt,et al.  Statistical Inference in a Stochastic Epidemic SEIR Model with Control Intervention: Ebola as a Case Study , 2006, Biometrics.

[18]  Jianguo Wu,et al.  Characterization of an Outbreak of Hand, Foot, and Mouth Disease in Nanchang, China in 2010 , 2011, PLoS ONE.

[19]  J. Wakefield,et al.  Bayesian approaches to disease mapping , 2001 .

[20]  E L Ionides,et al.  Inference for nonlinear dynamical systems , 2006, Proceedings of the National Academy of Sciences.

[21]  T. Louis,et al.  Model choice in time series studies of air pollution and mortality , 2006 .

[22]  Hualiang Lin,et al.  The effect of meteorological factors on adolescent hand, foot, and mouth disease and associated effect modifiers. , 2014, Global health action.

[23]  A. Doucet,et al.  Particle Markov chain Monte Carlo methods , 2010 .

[24]  Kow-Tong Chen,et al.  The Association between Enterovirus 71 Infections and Meteorological Parameters in Taiwan , 2012, PloS one.

[25]  A. Melegaro,et al.  Assessing the burden of influenza and other respiratory infections in England and Wales. , 2007, The Journal of infection.

[26]  T. Hastie,et al.  Improved Semiparametric Time Series Models of Air Pollution and Mortality , 2004 .

[27]  J. Wakefield,et al.  Spatial epidemiology: methods and applications. , 2000 .

[28]  David A. Rasmussen,et al.  Inference for Nonlinear Epidemiological Models Using Genealogies and Time Series , 2011, PLoS Comput. Biol..

[29]  Leonhard Held,et al.  A statistical framework for the analysis of multivariate infectious disease surveillance counts , 2005 .

[30]  S. Chuang,et al.  Is hand, foot and mouth disease associated with meteorological parameters? , 2010, Epidemiology and Infection.

[31]  M. Clyde,et al.  Health Effects of Air Pollution: A Statistical Review , 2003 .

[32]  M. Hashizume,et al.  The influence of temperature and humidity on the incidence of hand, foot, and mouth disease in Japan. , 2011, The Science of the total environment.