Consequences of kriging and land use regression for PM2.5 predictions in epidemiologic analyses: insights into spatial variability using high-resolution satellite data

Many epidemiological studies use predicted air pollution exposures as surrogates for true air pollution levels. These predicted exposures contain exposure measurement error, yet simulation studies have typically found negligible bias in resulting health effect estimates. However, previous studies typically assumed a statistical spatial model for air pollution exposure, which may be oversimplified. We address this shortcoming by assuming a realistic, complex exposure surface derived from fine-scale (1 km × 1 km) remote-sensing satellite data. Using simulation, we evaluate the accuracy of epidemiological health effect estimates in linear and logistic regression when using spatial air pollution predictions from kriging and land use regression models. We examined chronic (long-term) and acute (short-term) exposure to air pollution. Results varied substantially across different scenarios. Exposure models with low out-of-sample R2 yielded severe biases in the health effect estimates of some models, ranging from 60% upward bias to 70% downward bias. One land use regression exposure model with >0.9 out-of-sample R2 yielded upward biases up to 13% for acute health effect estimates. Almost all models drastically underestimated the SEs. Land use regression models performed better in chronic effect simulations. These results can help researchers when interpreting health effect estimates in these types of studies.

[1]  Yujie Wang,et al.  Multiangle implementation of atmospheric correction (MAIAC): 2. Aerosol algorithm , 2011 .

[2]  Bert Brunekreef,et al.  Estimating Long-Term Average Particulate Air Pollution Concentrations: Application of Traffic Indicators and Geographic Information Systems , 2003, Epidemiology.

[3]  David Ruppert,et al.  Regression with spatially misaligned data , 2008 .

[4]  Joel Schwartz,et al.  Measurement error caused by spatial misalignment in environmental epidemiology. , 2009, Biostatistics.

[5]  J. Schwartz,et al.  Incorporating local land use regression and satellite aerosol optical depth in a hybrid model of spatiotemporal PM2.5 exposures in the Mid-Atlantic states. , 2012, Environmental science & technology.

[6]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[7]  J. Schwartz,et al.  Assessing temporally and spatially resolved PM2.5 exposures for epidemiological studies using satellite aerosol optical depth measurements , 2011 .

[8]  S L Zeger,et al.  Exposure measurement error in time-series studies of air pollution: concepts and consequences. , 2000, Environmental health perspectives.

[9]  E. Vermote,et al.  The MODIS Aerosol Algorithm, Products, and Validation , 2005 .

[10]  G. Heiss,et al.  GIS APPROACHES FOR ESTIMATION OF RESIDENTIAL-LEVEL AMBIENT PM CONCENTRATIONS , 2005, Environmental health perspectives.

[11]  Yujie Wang,et al.  An automatic cloud mask algorithm based on time series of MODIS measurements , 2008 .

[12]  Itai Kloog,et al.  Using new satellite based exposure methods to study the association between pregnancy pm2.5 exposure, premature birth and birth weight in Massachusetts , 2012, Environmental Health.

[13]  Yujie Wang,et al.  Multiangle implementation of atmospheric correction (MAIAC): 1. Radiative transfer basis and look-up tables , 2011 .

[14]  Alexei Lyapustin,et al.  A critical assessment of high-resolution aerosol optical depth retrievals for fine particulate matter predictions , 2013 .

[15]  Lianne Sheppard,et al.  Efficient measurement error correction with spatially misaligned data. , 2011, Biostatistics.

[16]  Christopher J Paciorek,et al.  Measurement error in two‐stage analyses, with application to air pollution epidemiology , 2012, Environmetrics.

[17]  Jonathan I Levy,et al.  Land use regression modeling of intra-urban residential variability in multiple traffic-related air pollutants , 2008, Environmental health : a global access science source.

[18]  Lianne Sheppard,et al.  Does more accurate exposure prediction necessarily improve health effect estimates? , 2011, Epidemiology.

[19]  R. Burnett,et al.  Spatial Analysis of Air Pollution and Mortality in Los Angeles , 2005, Epidemiology.

[20]  Marcela Rivera,et al.  Measurement error in epidemiologic studies of air pollution based on land-use regression models. , 2013, American journal of epidemiology.

[21]  Linda J Young,et al.  A comparison of errors in variables methods for use in regression models with spatially misaligned data , 2011, Statistical methods in medical research.

[22]  A. Peters,et al.  Particulate Matter Air Pollution and Cardiovascular Disease: An Update to the Scientific Statement From the American Heart Association , 2010, Circulation.