Predictive assessment of a non‐linear random effects model for multivariate time series of infectious disease counts

Infectious disease counts from surveillance systems are typically observed in several administrative geographical areas. In this paper, a non-linear model for the analysis of such multiple time series of counts is discussed. To account for heterogeneous incidence levels or varying transmission of a pathogen across regions, region-specific and possibly spatially correlated random effects are introduced. Inference is based on penalized likelihood methodology for mixed models. Since the use of classical model choice criteria such as AIC or BIC can be problematic in the presence of random effects, models are compared by means of one-step-ahead predictions and proper scoring rules. In a case study, the model is applied to monthly counts of meningococcal disease cases in 94 departments of France (excluding Corsica) and weekly counts of influenza cases in 140 administrative districts of Southern Germany. The predictive performance improves if existing heterogeneity is accounted for by random effects.

[1]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[2]  P. McCullagh,et al.  Bias Correction in Generalized Linear Models , 1991 .

[3]  J. Giesecke,et al.  Modern Infectious Disease Epidemiology , 1994 .

[4]  Guoqi Qian,et al.  A random effects model for diseases with heterogeneous rates of infection , 2003 .

[5]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[6]  M. Plummer Penalized loss functions for Bayesian model comparison. , 2008, Biostatistics.

[7]  L. Fahrmeir,et al.  A Mixed Model Approach for Geoadditive Hazard Regression , 2007 .

[8]  Hua Liang,et al.  A Note on Conditional AIC for Linear Mixed-Effects Models. , 2008, Biometrika.

[9]  Nick Andrews,et al.  A Statistical Algorithm for the Early Detection of Outbreaks of Infectious Disease , 1996 .

[10]  D. Brigo,et al.  Parameterizing correlations: a geometric interpretation , 2007 .

[11]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[12]  L. Held,et al.  Multivariate modelling of infectious disease surveillance data , 2008, Statistics in medicine.

[13]  Volker Schmid,et al.  A two-component model for counts of infectious diseases. , 2005, Biostatistics.

[14]  P. Diggle,et al.  Spatiotemporal prediction for log‐Gaussian Cox processes , 2001 .

[15]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[16]  J. Copas,et al.  Using regression models for prediction: shrinkage and regression to the mean , 1997, Statistical methods in medical research.

[17]  S. Greven,et al.  On the behaviour of marginal and conditional AIC in linear mixed models , 2010 .

[18]  Nando de Freitas,et al.  Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[19]  John Ludbrook,et al.  Why Permutation Tests are Superior to t and F Tests in Biomedical Research , 1998 .

[20]  T. Britton,et al.  Statistical studies of infectious disease incidence , 1999 .

[21]  L. Tierney,et al.  Accurate Approximations for Posterior Moments and Marginal Densities , 1986 .

[22]  Sylvia Richardson,et al.  A hierarchical model for space–time surveillance data on meningococcal disease incidence , 2003 .

[23]  Edward S. Epstein,et al.  A Scoring System for Probability Forecasts of Ranked Categories , 1969 .

[24]  Ronald A. Thisted,et al.  Elements of statistical computing , 1986 .

[25]  L. Waller,et al.  Estimating Vaccine Efficacy from Outbreak Size Household Data in the Presence of Heterogeneous Transmission Probabilities , 2006, Journal of biopharmaceutical statistics.

[26]  R. Schall Estimation in generalized linear models with random effects , 1991 .

[27]  Douglas M. Bates,et al.  Unconstrained parametrizations for variance-covariance matrices , 1996, Stat. Comput..

[28]  J. S. Rao,et al.  Fence methods for mixed model selection , 2008, 0808.0985.

[29]  I. Jolliffe Uncertainty and Inference for Verification Measures , 2007 .

[30]  Dag Tjøstheim,et al.  Poisson Autoregression , 2008 .

[31]  John E. Dennis,et al.  Numerical methods for unconstrained optimization and nonlinear equations , 1983, Prentice Hall series in computational mathematics.

[32]  A. P. Dawid,et al.  Present position and potential developments: some personal views , 1984 .

[33]  D. Harville Matrix Algebra From a Statistician's Perspective , 1998 .

[34]  Refik Soyer,et al.  Bayesian Methods for Nonlinear Classification and Regression , 2004, Technometrics.

[35]  Rob J Hyndman,et al.  Mixed Model-Based Hazard Estimation , 2002 .

[36]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[37]  D. Pauler The Schwarz criterion and related methods for normal linear models , 1998 .

[38]  P. Diggle Time Series: A Biostatistical Introduction , 1990 .

[39]  J. Copas Regression, Prediction and Shrinkage , 1983 .

[40]  Thomas Kneib,et al.  Mixed model based inference in structured additive regression , 2006 .

[41]  Leonhard Held,et al.  A statistical framework for the analysis of multivariate infectious disease surveillance counts , 2005 .

[42]  Claudia Czado,et al.  Predictive Model Assessment for Count Data , 2009, Biometrics.

[43]  A. Lambert Branching Processes: Variation, Growth and Extinction of Populations , 2006 .

[44]  C P Farrington,et al.  Branching process models for surveillance of infectious diseases controlled by mass vaccination. , 2003, Biostatistics.

[45]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[46]  M. Karim Generalized Linear Models With Random Effects , 1991 .

[47]  R. Rigby,et al.  Generalized Autoregressive Moving Average Models , 2003 .

[48]  Marginal and Conditional Akaike Information Criteria in Linear Mixed Models , 2009 .

[49]  Michael Höhle,et al.  surveillance: An R package for the monitoring of infectious diseases , 2007, Comput. Stat..

[50]  G. Verbeke,et al.  Statistical inference in generalized linear mixed models: a review. , 2006, The British journal of mathematical and statistical psychology.

[51]  Mathias W. Hofmann Statistical Models for Infectious Disease Surveillance Counts , 2007 .

[52]  Y. Ogata Evaluation of spatial Bayesian models—Two computational methods☆ , 1996 .

[53]  Qing Liu,et al.  A note on Gauss—Hermite quadrature , 1994 .

[54]  N. Breslow,et al.  Bias Correction in Generalized Linear Mixed Models with Multiple Components of Dispersion , 1996 .

[55]  UsingSmoothing SplinesbyXihong Liny,et al.  Inference in Generalized Additive Mixed Models , 1999 .

[56]  S. Zeger,et al.  Markov regression models for time series: a quasi-likelihood approach. , 1988, Biometrics.

[57]  Jim Q. Smith,et al.  Diagnostic checks of non‐standard time series models , 1985 .

[58]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .