Estimating Population Average Causal Effects in the Presence of Non-Overlap: A Bayesian Approach

Most causal inference studies rely on the assumption of overlap to estimate population or sample average causal effects. When data suffer from non-overlap, estimation of these estimands requires reliance on model specifications, due to poor data support. All existing methods to address non-overlap, such as trimming or down-weighting data in regions of poor data support, change the estimand so that inference cannot be made on the sample or the underlying population. In environmental health research settings, where study results are often intended to influence policy, population-level inference may be critical, and changes in the estimand can diminish the impact of the study results, because estimates may not be representative of effects in the population of interest to policymakers. Researchers may be willing to make additional, minimal modeling assumptions in order to preserve the ability to estimate population average causal effects. We seek to make two contributions on this topic. First, we propose a flexible, data-driven definition of propensity score overlap and non-overlap regions. Second, we develop a novel Bayesian framework to estimate population average causal effects with minor model dependence and appropriately large uncertainties in the presence of non-overlap and causal effect heterogeneity. In this approach, the tasks of estimating causal effects in the overlap and non-overlap regions are delegated to two distinct models, suited to the degree of data support in each region. Tree ensembles are used to non-parametrically estimate individual causal effects in the overlap region, where the data can speak for themselves. In the non-overlap region, where insufficient data support means reliance on model specification is necessary, individual causal effects are estimated by extrapolating trends from the overlap region via a spline model. The promising performance of our method is demonstrated in simulations. Finally, we utilize our method to perform a novel investigation of the causal effect of natural gas compressor station exposure on cancer outcomes. Code and data to implement the method and reproduce all simulations and analyses is available on Github (https://github.com/rachelnethery/overlap).

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Martyn T. Smith,et al.  The mechanism of benzene-induced leukemia: a hypothesis and speculations on the causes of leukemia. , 1996, Environmental health perspectives.

[3]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[4]  Giovanni Parmigiani,et al.  Bayesian Effect Estimation Accounting for Adjustment Uncertainty , 2012, Biometrics.

[5]  Stephen R Cole,et al.  Constructing inverse probability weights for marginal structural models. , 2008, American journal of epidemiology.

[6]  Mark Lunt,et al.  Selecting an Appropriate Caliper Can Be Essential for Achieving Good Balance With Propensity Score Matching , 2013, American journal of epidemiology.

[7]  Yi Lin,et al.  Random Forests and Adaptive Nearest Neighbors , 2006 .

[8]  Antonella Zanobetti,et al.  Association of Short-term Exposure to Air Pollution With Mortality in Older Adults , 2017, JAMA.

[9]  Jennifer L. Hill,et al.  Assessing lack of common support in causal inference using bayesian nonparametrics: Implications for evaluating the effect of breastfeeding on children's cognitive outcomes , 2013, 1311.7244.

[10]  Raluca Mihăescu,et al.  RE: ‘‘TRENDS IN ASTHMA PREVALENCE AND INCIDENCE IN ONTARIO, , 2011 .

[11]  P. Austin An introduction to propensity-score methods for reducing confounding in observational studies , 2011 .

[12]  Jun S. Liu,et al.  Extracting sequence features to predict protein–DNA interactions: a comparative study , 2008, Nucleic acids research.

[13]  F. Frasca,et al.  Worldwide Increasing Incidence of Thyroid Cancer: Update on Epidemiology and Risk Factors , 2013, Journal of cancer epidemiology.

[14]  Purushottam W. Laud,et al.  Nonparametric survival analysis using Bayesian Additive Regression Trees (BART) , 2016, Statistics in medicine.

[15]  Kari Lock Morgan,et al.  Balancing Covariates via Propensity Score Weighting , 2014, 1609.07494.

[16]  Giovanni Parmigiani,et al.  Accounting for uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models , 2015, Biometrics.

[17]  Mohsen Naghavi,et al.  Trends and Patterns of Disparities in Cancer Mortality Among US Counties, 1980-2014 , 2017, JAMA.

[18]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[19]  M L Finkel,et al.  Shale gas development and cancer incidence in southwest Pennsylvania. , 2016, Public health.

[20]  Elizabeth L. Ogburn,et al.  Association Between Unconventional Natural Gas Development in the Marcellus Shale and Asthma Exacerbations. , 2016, JAMA internal medicine.

[21]  Daniel Westreich,et al.  Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression. , 2010, Journal of clinical epidemiology.

[22]  Dylan S. Small,et al.  Ensemble of trees approaches to risk adjustment for evaluating a hospital’s performance , 2015, Health care management science.

[23]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[24]  Kristin E. Porter,et al.  Diagnosing and responding to violations in the positivity assumption , 2012, Statistical methods in medical research.

[25]  Fan Li,et al.  Addressing Extreme Propensity Scores via the Overlap Weights , 2018, American journal of epidemiology.

[26]  D. Rubin Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment , 1980 .

[27]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[28]  R. Lalonde Evaluating the Econometric Evaluations of Training Programs with Experimental Data , 1984 .

[29]  Kelly K Ferguson,et al.  Environmental phthalate exposure and preterm birth. , 2014, JAMA pediatrics.

[30]  A. Linero Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection , 2018 .

[31]  Roee Gutman,et al.  Estimation of causal effects of binary treatments in unconfounded studies , 2015, Statistics in medicine.

[32]  W. G. Cochran,et al.  Controlling Bias in Observational Studies: A Review. , 1974 .

[33]  Hao Wang,et al.  Multinomial probit Bayesian additive regression trees , 2016, Stat.

[34]  John Howard,et al.  Minimum Latency & Types or Categories of Cancer , 2014 .

[35]  Gary King,et al.  The Dangers of Extreme Counterfactuals , 2006, Political Analysis.

[36]  Luc Devroye,et al.  Consistency of Random Forests and Other Averaging Classifiers , 2008, J. Mach. Learn. Res..

[37]  Edward J. Bedrick,et al.  Childhood hematologic cancer and residential proximity to oil and gas development , 2017, PloS one.

[38]  E. Marco,et al.  Predicting chromatin organization using histone marks , 2015, Genome Biology.

[39]  R Fisher,et al.  Design of Experiments , 1936 .

[40]  Kim-Anh Do,et al.  Bayesian ensemble methods for survival prediction in gene expression data , 2011, Bioinform..

[41]  B. Conti,et al.  Benzene, an experimental multipotential carcinogen: results of the long-term bioassays performed at the Bologna Institute of Oncology. , 1989, Environmental health perspectives.

[42]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[43]  Corwin M Zigler,et al.  Replication Data for: Posterior Predictive Treatment Assignment for Estimating Causal Effects with Limited Overlap , 2017, 1710.08749.

[44]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[45]  Chung-Ho Lin,et al.  Endocrine-Disrupting Chemicals and Oil and Natural Gas Operations: Potential Environmental Contamination and Recommendations to Assess Complex Environmental Mixtures , 2015, Environmental health perspectives.

[46]  W P Watson,et al.  Possible mechanisms of carcinogenesis after exposure to benzene. , 1999, IARC scientific publications.

[47]  Yan Wang,et al.  Air Pollution and Mortality in the Medicare Population , 2017, The New England journal of medicine.

[48]  Alexander D'Amour,et al.  Overlap in observational studies with high-dimensional covariates , 2017, Journal of Econometrics.

[49]  Guo-Cheng Yuan,et al.  Prediction of Polycomb target genes in mouse embryonic stem cells. , 2010, Genomics.

[50]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[51]  G. Tutz,et al.  An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. , 2009, Psychological methods.

[52]  P. Austin An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies , 2011, Multivariate behavioral research.

[53]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.