Residual Spatial Correlation Between Geographically Referenced Observations: A Bayesian Hierarchical Modeling Approach

Background: Analytic methods commonly used in epidemiology do not account for spatial correlation between observations. In regression analyses, this omission can bias parameter estimates and yield incorrect standard error estimates. We present a Bayesian hierarchical model (BHM) approach that accounts for spatial correlation, and illustrate its strengths and weaknesses by applying this modeling approach to data on Wuchereria bancrofti infection in Haiti. Methods: A program to eliminate lymphatic filariasis in Haiti assessed prevalence of W. bancrofti infection in 57 schools across Leogane Commune. We analyzed the spatial pattern in the prevalence data using semi-variograms and correlograms. We then modeled the data using (1) standard logistic regression (GLM); (2) non-Bayesian logistic generalized linear mixed models (GLMMs) with school-specific nonspatial random effects; (3) BHMs with school-specific nonspatial random effects; and (4) BHMs with spatial random effects. Results: An exponential semi-variogram with an effective range of 2.15 km best fit the data. GLMM and nonspatial BHM point estimates were comparable and also were generally similar with the marginal GLM point estimates. In contrast, compared with the nonspatial mixed model results, spatial BHM point estimates were markedly attenuated. Discussion: The clear spatial pattern evident in the Haitian W. bancrofti prevalence data and the observation that point estimates and standard errors differed depending on the modeling approach indicate that it is important to account for residual spatial correlation in analyses of W. bancrofti infection data. Bayesian hierarchical models provide a flexible, readily implementable approach to modeling spatially correlated data. However, our results also illustrate that spatial smoothing must be applied with care.

[1]  P. Diggle,et al.  Model‐based geostatistics , 2007 .

[2]  L. Waller,et al.  Community- and individual-level determinants of Wuchereria bancrofti infection in Leogane Commune, Haiti. , 2004, The American journal of tropical medicine and hygiene.

[3]  R. Moyeed,et al.  Spatial variation of Anopheles-transmitted Wuchereria bancrofti and Plasmodium falciparum infection densities in Papua New Guinea , 2003, Filaria journal.

[4]  A. Gelfand,et al.  Proper multivariate conditional autoregressive models for spatial data analysis. , 2003, Biostatistics.

[5]  E. Michael,et al.  A geostatistical analysis of the geographic distribution of lymphatic filariasis prevalence in southern India. , 2002, The American journal of tropical medicine and hygiene.

[6]  P. Diggle,et al.  Childhood malaria in the Gambia: a case-study in model-based geostatistics. , 2002 .

[7]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[8]  Penelope Vounatsou,et al.  Rise in malaria incidence rates in South Africa: a small-area spatial analysis of variation in time trends. , 2002, American journal of epidemiology.

[9]  I. Kleinschmidt,et al.  Use of generalized linear mixed models in the spatial analysis of small-area malaria incidence rates in Kwazulu Natal, South Africa. , 2001, American journal of epidemiology.

[10]  T. Louis,et al.  Empirical Bayes: Past, Present and Future , 2000 .

[11]  J. Booth,et al.  2. Random-Effects Modeling of Categorical Response Data , 2000 .

[12]  M C Thomson,et al.  Predicting malaria infection in Gambian children from satellite data and bed net use surveys: the importance of spatial correlation in the interpretation of results. , 1999, The American journal of tropical medicine and hygiene.

[13]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[14]  S+ Spatial Stats , 1998 .

[15]  Stephen P. Kaluzny,et al.  S+SpatialStats: User’s Manual for Windows® and UNIX® , 1998 .

[16]  J. Roberts,et al.  Wuchereria bancrofti in Kwale District, Coastal Kenya: patterns of focal distribution of infection, clinical manifestations and anti-filarial IgG responsiveness , 1998, Parasitology.

[17]  T. C. Haas,et al.  Model-based geostatistics - Discussion , 1998 .

[18]  N. G. Best,et al.  Disease mapping with errors in covariates. , 1997, Statistics in medicine.

[19]  Bradley P. Carlin,et al.  Hierarchical models for mapping Ohio lung cancer rates , 1997 .

[20]  R. Littell SAS System for Mixed Models , 1996 .

[21]  J. Besag,et al.  On conditional and intrinsic autoregressions , 1995 .

[22]  P. Diggle Analysis of Longitudinal Data , 1995 .

[23]  A. Buck,et al.  Efficient assessment of filariasis endemicity by screening for filarial antigenaemia in a sentinel population. , 1994, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[24]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[25]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[26]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[27]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[28]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[29]  L. Anselin,et al.  Spatial Econometrics: Methods and Models , 1988 .

[30]  D. Clayton,et al.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. , 1987, Biometrics.

[31]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[32]  D. Cook,et al.  Multiple regression in geographical mortality studies, with allowance for spatially correlated errors. , 1983, Biometrics.

[33]  Sirpa Mäki,et al.  The computer program , 1980 .