Variable selection for spatial latent predictors under Bayesian spatial model

The problem of variable selection is encountered in model fitting with unobserved spatial predictors at the sites where outcomes are measured. The variability of the interpolated predictors at outcome sites results in potential problems of variable selection and averaging the results across different datasets. A Bayesian spatial model is developed to tackle this issue. By sampling the latent spatial predictors and selecting the spatial and non-spatial predictors using stochastic search variable selection Gibbs sampling algorithm, our approach allows for uncertainty of the predictors including the interpolated predictors. The approach is evaluated and illustrated through a simulated data example and an application to mental retardation and developmental delay in a Medicaid population in South Carolina with samples of soil chemistry.

[1]  Eduardo Ley,et al.  On the Effect of Prior Assumptions in Bayesian Model Averaging With Applications to Growth Regression , 2007 .

[2]  M. Hutchinson,et al.  Splines — more than just a smooth interpolator , 1994 .

[3]  Ying C. MacNab,et al.  Spline smoothing in Bayesian disease mapping , 2007 .

[4]  W. Gilks,et al.  Adaptive Rejection Metropolis Sampling Within Gibbs Sampling , 1995 .

[5]  Marc Voltz,et al.  A comparison of kriging, cubic splines and classification for predicting soil properties from sample information , 1990 .

[6]  B. Ripley,et al.  Semiparametric Regression: Preface , 2003 .

[7]  Douglas W. Nychka,et al.  FUNFITS: data analysis and statistical tools for estimating functions , 2008 .

[8]  Seymour Geisser,et al.  On Prior Distributions for Binary Trials , 1984 .

[9]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[10]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[11]  James G. Scott,et al.  Multiple Testing , Empirical Bayes , and the Variable-Selection Problem , 2008 .

[12]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[13]  Edward I. George,et al.  Empirical Bayes vs. Fully Bayes Variable Selection , 2008 .

[14]  Geoffrey M. Laslett,et al.  Kriging and Splines: An Empirical Comparison of their Predictive Performance in Some Applications , 1994 .

[15]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[16]  M. Wand,et al.  Generalized additive models for cancer mapping with incomplete covariates. , 2004, Biostatistics.

[17]  Alex B. McBratney,et al.  Further Comparison of Spatial Methods for Predicting Soil pH , 1990 .

[18]  B. Carlin,et al.  Bayesian Model Choice Via Markov Chain Monte Carlo Methods , 1995 .

[19]  Andrew B Lawson,et al.  Bayesian spatial modeling of disease risk in relation to multivariate environmental risk fields , 2010, Statistics in medicine.

[20]  Andrew B. Lawson,et al.  Bayesian Disease Mapping: Hierarchical Modeling in Spatial Epidemiology , 2008 .

[21]  L R Goldman,et al.  Chemicals in the environment and developmental toxicity to children: a public health and policy perspective. , 2000, Environmental health perspectives.

[22]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[23]  Montserrat Fuentes,et al.  Spatial Association between Speciated Fine Particles and Mortality , 2006, Biometrics.

[24]  Petros Dellaportas,et al.  On Bayesian model and variable selection using MCMC , 2002, Stat. Comput..

[25]  Gary R. Krieger,et al.  Clinical Environmental Health and Toxic Exposures , 2001 .

[26]  Refik Soyer,et al.  Bayesian Methods for Nonlinear Classification and Regression , 2004, Technometrics.

[27]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[28]  David Bellinger,et al.  Human Developmental Neurotoxicology , 2006 .

[29]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[30]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[31]  Bradley P Carlin,et al.  spBayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models. , 2007, Journal of statistical software.