A Review of Spatial Causal Inference Methods for Environmental and Epidemiological Applications

The scientific rigor and computational methods of causal inference have had great impacts on many disciplines, but have only recently begun to take hold in spatial applications. Spatial casual inference poses analytic challenges due to complex correlation structures and interference between the treatment at one location and the outcomes at others. In this paper, we review the current literature on spatial causal inference and identify areas of future work. We first discuss methods that exploit spatial structure to account for unmeasured confounding variables. We then discuss causal analysis in the presence of spatial interference including several common assumptions used to reduce the complexity of the interference patterns under consideration. These methods are extended to the spatiotemporal case where we compare and contrast the potential outcomes framework with Granger causality, and to geostatistical analyses involving spatial random fields of treatments and responses. The methods are introduced in the context of observational environmental and epidemiological studies, and are compared using both a simulation study and analysis of the effect of ambient air pollution on COVID-19 mortality rate. Code to implement many of the methods using the popular Bayesian software OpenBUGS is provided.

[1]  G. Imbens,et al.  Matching on the Estimated Propensity Score , 2009 .

[2]  H. White,et al.  Granger Causality and Dynamic Structural Systems , 2010 .

[3]  M. Fuentes,et al.  Journal of the American Statistical Association Bayesian Spatial Quantile Regression Bayesian Spatial Quantile Regression , 2022 .

[4]  Corwin M. Zigler,et al.  The Central Role of Bayes’ Theorem for Joint Estimation of Causal Effects and Propensity Scores , 2013, The American statistician.

[5]  P. Cross,et al.  Confronting models with data: the challenges of estimating disease spillover , 2019, Philosophical Transactions of the Royal Society B.

[6]  Michael S. Delgado,et al.  Difference-in-differences techniques for spatial data: Local autocorrelation and spatial interaction ☆ , 2015 .

[7]  Till Bärnighausen,et al.  Regression Discontinuity Designs in Epidemiology: Causal Inference Without Randomized Trials , 2014 .

[8]  Corwin M Zigler,et al.  Causal inference with interfering units for cluster and population level treatment allocation programs , 2017, Biometrics.

[9]  P. Holland Statistics and Causal Inference , 1985 .

[10]  Sw. Banerjee,et al.  Hierarchical Modeling and Analysis for Spatial Data , 2003 .

[11]  Sebastian Schutte,et al.  Matched Wake Analysis: Finding Causal Relationships in Spatiotemporal Event Data , 2014 .

[12]  C. F. Sirmans,et al.  Spatial Modeling With Spatially Varying Coefficient Processes , 2003 .

[13]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme , 1997 .

[14]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[15]  Christopher J Paciorek,et al.  The importance of scale for spatial-confounding bias and precision of spatial regression estimators. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[16]  Léo R. Belzile,et al.  A Bayesian view of doubly robust causal inference , 2016, 1701.04093.

[17]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[18]  B. Hansen Full Matching in an Observational Study of Coaching for the SAT , 2004 .

[19]  D. Rubin,et al.  Assessing Sensitivity to an Unobserved Binary Covariate in an Observational Study with Binary Outcome , 1983 .

[20]  G. Imbens,et al.  Large Sample Properties of Matching Estimators for Average Treatment Effects , 2004 .

[21]  J. Møller,et al.  Handbook of Spatial Statistics , 2008 .

[22]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[23]  Edoardo M. Airoldi,et al.  Identification and Estimation of Treatment and Interference Effects in Observational Studies on Networks , 2016, Journal of the American Statistical Association.

[24]  V. Zadnik,et al.  Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease‐Mapping Models , 2006, Biometrics.

[25]  David L. Kaplan Causal Inference for Observational Studies. , 2018, The Journal of infectious diseases.

[26]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[27]  Marie-Abèle Bind,et al.  Causal Modeling in Environmental Health. , 2019, Annual review of public health.

[28]  Eric J Tchetgen Tchetgen,et al.  Interference and Sensitivity Analysis. , 2014, Statistical science : a review journal of the Institute of Mathematical Statistics.

[29]  Brian J. Reich,et al.  A spatiotemporal recommendation engine for malaria control. , 2020, Biostatistics.

[30]  M Elizabeth Halloran,et al.  The Minicommunity Design to Assess Indirect Effects of Vaccination , 2012, Epidemiologic methods.

[31]  B. Reich Spatiotemporal quantile regression for detecting distributional changes in environmental processes , 2012, Journal of the Royal Statistical Society. Series C, Applied statistics.

[32]  Francis W. Zwiers,et al.  Use of models in detection and attribution of climate change , 2011 .

[33]  John M Drake,et al.  Optimal treatment allocations in space and time for on‐line control of an emerging infectious disease , 2018, Journal of the Royal Statistical Society. Series C, Applied statistics.

[34]  P. Diggle,et al.  Geostatistical inference under preferential sampling , 2010 .

[35]  Virgilio Gómez-Rubio,et al.  Spatial Point Patterns: Methodology and Applications with R , 2016 .

[36]  Tyler J VanderWeele,et al.  On causal inference in the presence of interference , 2012, Statistical methods in medical research.

[37]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1995 .

[38]  Addressing geographic confounding through spatial propensity scores: a study of racial disparities in diabetes , 2019, Statistical methods in medical research.

[39]  Francesca Dominici,et al.  Exposure to air pollution and COVID-19 mortality in the United States: A nationwide cross-sectional study , 2020, medRxiv.

[40]  Peter J. Diggle,et al.  Estimation of Spatial Variation in Risk Using Matched Case‐control Data , 2002 .

[41]  M. Hudgens,et al.  On inverse probability-weighted estimators in the presence of interference , 2016, Biometrika.

[42]  Orley Ashenfelter,et al.  Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs , 1984 .

[43]  Y. LindaJ. Combining Incompatible Spatial Data , 2003 .

[44]  M. Halloran,et al.  Causal Inference in Infectious Diseases , 1995, Epidemiology.

[45]  Shu Yang,et al.  Generalized propensity score approach to causal inference with spatial interference , 2020, Biometrics.

[46]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[47]  Thomas Kneib,et al.  Structural Equation Models for Dealing With Spatial Confounding , 2018 .

[48]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[49]  I NICOLETTI,et al.  The Planning of Experiments , 1936, Rivista di clinica pediatrica.

[50]  Corwin M Zigler,et al.  Model Feedback in Bayesian Propensity Score Estimation , 2013, Biometrics.

[51]  Corwin M Zigler,et al.  Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching , 2016, Biostatistics.

[52]  B. Carlin,et al.  Spatial Analyses of Periodontal Data Using Conditionally Autoregressive Priors Having Two Classes of Neighbor Relations , 2007 .

[53]  M. Wall A close look at the spatial structure implied by the CAR and SAR models , 2004 .

[54]  Brian J. Reich,et al.  Spatial Bayesian Nonparametric Methods , 2015 .

[55]  Mevin B. Hooten,et al.  Restricted spatial regression in practice: geostatistical models, confounding, and robustness under model misspecification , 2015 .

[56]  Georgia Papadogeorgou,et al.  Mitigating unobserved spatial confounding when estimating the effect of supermarket access on cardiovascular disease deaths , 2019 .

[57]  J. Aislinn Bohren,et al.  Optimal Design of Experiments in the Presence of Interference , 2017, Review of Economics and Statistics.

[58]  F. Dominici,et al.  Trends in Air Pollution and Mortality: An Approach to the Assessment of Unmeasured Confounding , 2007, Epidemiology.

[59]  Corwin M Zigler,et al.  Bipartite Causal Inference with Interference. , 2018, Statistical science : a review journal of the Institute of Mathematical Statistics.

[60]  P. Schnell,et al.  Mitigating unobserved spatial confounding bias with mixed models , 2019, 1907.12150.

[61]  M. Hudgens,et al.  Assessing effects of cholera vaccination in the presence of interference , 2014, Biometrics.

[62]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2002 .

[63]  M. Davidian,et al.  Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data , 2009, Biometrika.

[64]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[65]  D. Rubin Matched Sampling for Causal Effects , 2006 .

[66]  L. Keele,et al.  Geographic Boundaries as Regression Discontinuities , 2015, Political Analysis.

[67]  J. Hodges,et al.  Adding Spatially-Correlated Errors Can Mess Up the Fixed Effect You Love , 2010 .

[68]  Eric J. Tchetgen Tchetgen,et al.  Auto-G-Computation of Causal Effects on a Network , 2017, Journal of the American Statistical Association.

[69]  Raja Jurdak,et al.  Modeling stochastic processes in disease spread across a heterogeneous social system , 2018, Proceedings of the National Academy of Sciences.

[70]  Shu Yang,et al.  A SPATIAL CAUSAL ANALYSIS OF WILDLAND FIRE-CONTRIBUTED PM2.5 USING NUMERICAL MODEL OUTPUT. , 2020, The annals of applied statistics.

[71]  D. Dunson,et al.  Bayesian geostatistical modelling with informative sampling locations. , 2011, Biometrika.

[72]  Lawrence C McCandless,et al.  The International Journal of Biostatistics CAUSAL INFERENCE Cutting Feedback in Bayesian Regression Adjustment for the Propensity Score , 2011 .

[73]  J. Robins,et al.  Adjusting for differential rates of prophylaxis therapy for PCP in high- versus low-dose AZT treatment arms in an AIDS randomized trial , 1994 .

[74]  Paul R. Rosenbaum,et al.  Optimal Matching for Observational Studies , 1989 .

[75]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[76]  M E Halloran,et al.  Study designs for dependent happenings. , 1991, Epidemiology.

[77]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach , 2011 .

[78]  Corwin M Zigler,et al.  Estimating causal effects of air quality regulations using principal stratification for spatially correlated multivariate intermediate outcomes. , 2012, Biostatistics.

[79]  Zhulin He Inverse Conditional Probability Weighting with Clustered Data in Causal Inference. , 2018, 1808.01647.

[80]  Michael E. Sobel,et al.  What Do Randomized Studies of Housing Mobility Demonstrate? , 2006 .

[81]  G. Imbens,et al.  The Propensity Score with Continuous Treatments , 2005 .

[82]  Murali Haran,et al.  Dimension reduction and alleviation of confounding for spatial generalized linear mixed models , 2010, 1011.6649.

[83]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[84]  Andrew Gelman,et al.  Applied Bayesian Modeling And Causal Inference From Incomplete-Data Perspectives , 2005 .

[85]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1994 .

[86]  Olli Saarela,et al.  On Bayesian estimation of marginal structural models , 2015, Biometrics.

[87]  Beat Neuenschwander,et al.  Combining MCMC with ‘sequential’ PKPD modelling , 2009, Journal of Pharmacokinetics and Pharmacodynamics.

[88]  G. Hegerl,et al.  Detection and attribution of climate change: from global to regional , 2013 .

[89]  Natalya Verbitsky-Savitz,et al.  Causal Inference Under Interference in Spatial Settings: A Case Study Evaluating Community Policing Program in Chicago , 2012 .

[90]  A. Gelfand,et al.  Spatial Quantile Multiple Regression Using the Asymmetric Laplace Process , 2012 .

[91]  F. Mealli,et al.  Bipartite Interference and Air Pollution Transport: Estimating Health Effects of Power Plant Interventions. , 2020, 2012.04831.

[92]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2013, 1305.6156.

[93]  Raja Jurdak,et al.  Modeling stochastic processes in disease spread across a heterogeneous social system , 2018, Proceedings of the National Academy of Sciences.

[94]  M. Eichler Causal inference in time series analysis , 2012 .

[95]  M. Hudgens,et al.  Causal inference from observational studies with clustered interference, with application to a cholera vaccine study , 2017, 1711.04834.