Preferential sampling and Bayesian geostatistics: Statistical modeling and examples

Preferential sampling refers to any situation in which the spatial process and the sampling locations are not stochastically independent. In this paper, we present two examples of geostatistical analysis in which the usual assumption of stochastic independence between the point process and the measurement process is violated. To account for preferential sampling, we specify a flexible and general Bayesian geostatistical model that includes a shared spatial random component. We apply the proposed model to two different case studies that allow us to highlight three different modeling and inferential aspects of geostatistical modeling under preferential sampling: (1) continuous or finite spatial sampling frame; (2) underlying causal model and relevant covariates; and (3) inferential goals related to mean prediction surface or prediction uncertainty.

[1]  Peter J. Diggle,et al.  Combining data from multiple spatially referenced prevalence surveys using generalized linear geostatistical models , 2013, 1308.2790.

[2]  D. Catelan,et al.  A Bayesian kriging model for estimating residential exposure to air pollution of children living in a high-risk area in Italy. , 2013, Geospatial health.

[3]  D. Catelan,et al.  Preferential sampling in veterinary parasitological surveillance. , 2016, Geospatial health.

[4]  P. Diggle,et al.  Geostatistical inference under preferential sampling , 2010 .

[5]  Montserrat Fuentes,et al.  Model Evaluation and Spatial Interpolation by Bayesian Combination of Observations with Outputs from Numerical Models , 2005, Biometrics.

[6]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[7]  Peter J. Diggle,et al.  Bayesian Geostatistical Design , 2006 .

[8]  L. Held,et al.  Towards joint disease mapping , 2005, Statistical methods in medical research.

[9]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[10]  G. Shaddick,et al.  Unbiasing estimates from preferentially sampled spatial data ∗ , 2012 .

[11]  D. Dunson,et al.  Bayesian geostatistical modelling with informative sampling locations. , 2011, Biometrika.

[12]  A. Szpiro,et al.  Impact of preferential sampling on exposure prediction and health effect inference in the context of air pollution epidemiology , 2015, Environmetrics.

[13]  M. Held Towards Joint Disease , 2005 .

[14]  J. Avorn,et al.  Variable selection for propensity score models. , 2006, American journal of epidemiology.

[15]  P. Diggle,et al.  Model‐based geostatistics , 2007 .

[16]  Alan E Gelfand,et al.  A Spatio-Temporal Downscaler for Output From Numerical Models , 2010, Journal of agricultural, biological, and environmental statistics.

[17]  Peter J. Diggle,et al.  Spatial and spatio-temporal Log-Gaussian Cox processes:extending the geostatistical paradigm , 2013, 1312.6536.

[18]  G. Fossati,et al.  Modelling of PM10 concentrations over Milano urban area using two aerosol modules , 2008, Environ. Model. Softw..

[19]  P. Vounatsou,et al.  Bayesian spatio-temporal modelling of tobacco-related cancer mortality in Switzerland. , 2013, Geospatial health.

[20]  D. Catelan,et al.  Geostatistical integration and uncertainty in pollutant concentration surface under preferential sampling. , 2016, Geospatial health.

[21]  D. Catelan,et al.  Sheep and Fasciola hepatica in Europe: the GLOWORM experience. , 2015, Geospatial health.

[22]  Alan E Gelfand,et al.  On the effect of preferential sampling in spatial prediction , 2012, Environmetrics.

[23]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .