Disease Mapping With Generative Models

ABSTRACT Disease mapping focuses on learning about areal units presenting high relative risk. Disease mapping models assume that the disease counts are distributed as Poisson random variables with the respective means typically specified as the product of the relative risk and the expected count. These models usually incorporate spatial random effects to accomplish spatial smoothing of the relative risks. Fitting of these models often computes expected disease counts via internal standardization. This places the data on both sides of the model, that is, the counts are on the left side but they are also used to obtain the expected counts on the right side. As a result, these internally standardized models are incoherent and not generative; probabilistically, they could not produce the data we observe. Here, we argue for adopting the direct generative model for disease counts, modeling disease incidence rates instead of relative risks, using a generalized logistic regression. Then, the relative risks are then extracted post model fitting. We first demonstrate the benefit of the generative model without incorporating spatial smoothing using simulation. Then, spatial smoothing is introduced using the customary conditionally autoregressive model. We also extend the generative model to dynamic settings. The generative models are compared with internally standardized models, again through simulated datasets but also through a well-examined lung cancer morbidity dataset in Ohio. Both models are spatial and both smooth the data similarly with regard to relative risks. However, the generative coherent models tend to provide tighter credible intervals. Since the generative specification is coherent, is at least as good inferentially, and is no more difficult to fit, we suggest that it should be the model of choice for spatial disease mapping.

[1]  M. Held Towards Joint Disease , 2005 .

[2]  P. Diggle,et al.  Bivariate Binomial Spatial Modeling of Loa loa Prevalence in Tropical Africa , 2008 .

[3]  Nicola G. Best,et al.  A shared component model for detecting joint and selective clustering of two diseases , 2001 .

[4]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[5]  C B Dean,et al.  Parametric bootstrap and penalized quasi-likelihood inference in conditional autoregressive models. , 2000, Statistics in medicine.

[6]  L Knorr-Held,et al.  Bayesian modelling of inseparable space-time variation in disease risk. , 2000, Statistics in medicine.

[7]  Tom Koch Cartographies of Disease: Maps, Mapping, and Medicine , 2004 .

[8]  Norman E. Breslow,et al.  Estimation of Disease Rates in Small Areas: A new Mixed Model for Spatial Dependence , 2000 .

[9]  L. Waller,et al.  Applied Spatial Statistics for Public Health Data: Waller/Applied Spatial Statistics , 2004 .

[10]  L. Held,et al.  Towards joint disease mapping , 2005, Statistical methods in medical research.

[11]  L Knorr-Held,et al.  Modelling risk from a disease in time and space. , 1998, Statistics in medicine.

[12]  Bradley P. Carlin,et al.  Hierarchical Multivarite CAR Models for Spatio-Temporally Correlated Survival Data , 2002 .

[13]  Gentry White,et al.  A stochastic neighborhood conditional autoregressive model for spatial data , 2009, Comput. Stat. Data Anal..

[14]  Y. MacNab,et al.  Autoregressive Spatial Smoothing and Temporal Spline Smoothing for Mapping Rates , 2001, Biometrics.

[15]  A. Gelfand,et al.  Proper multivariate conditional autoregressive models for spatial data analysis. , 2003, Biostatistics.

[16]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[17]  Fabio Divino,et al.  Disease mapping models: an empirical evaluation , 2000 .

[18]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[19]  Thomas A. Louis,et al.  Empirical Bayes Methods , 2006 .

[20]  Ronald P. Barry,et al.  A method for spatial–temporal forecasting with an application to real estate prices , 2000 .

[21]  D. Clayton,et al.  Statistical Models in Epidemiology , 1993 .

[22]  Sylvia Richardson,et al.  A comparison of Bayesian spatial models for disease mapping , 2005, Statistical methods in medical research.

[23]  B. Efron,et al.  Stein's Estimation Rule and Its Competitors- An Empirical Bayes Approach , 1973 .

[24]  A. Leyland,et al.  Empirical Bayes methods for disease mapping , 2005, Statistical methods in medical research.

[25]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[26]  Leah Blau,et al.  Statistical Models In Epidemiology , 2016 .

[27]  Bradley P. Carlin,et al.  Hierarchical models for mapping Ohio lung cancer rates , 1997 .

[28]  A. Gelfand,et al.  Hierarchical Spatio-Temporal Mapping of Disease , 2007 .

[29]  Bradley P. Carlin,et al.  Hierarchical Spatio-Temporal Mapping of Disease Rates , 1997 .

[30]  B. Efron,et al.  Stein's Paradox in Statistics , 1977 .

[31]  D. Berry,et al.  Statistical models in epidemiology, the environment, and clinical trials , 2000 .

[32]  Dongchu Sun,et al.  Spatio-temporal interaction with disease mapping. , 2000, Statistics in medicine.

[33]  Jason A. Duan,et al.  Modeling Disease Incidence Data with Spatial and Spatio Temporal Dirichlet Process Mixtures , 2008, Biometrical journal. Biometrische Zeitschrift.

[34]  D. Clayton,et al.  Bayesian analysis of space-time variation in disease risk. , 1995, Statistics in medicine.