Causal inference with observational studies trimmed by the estimated propensity scores

Causal inference with observational studies often relies on the assumptions of unconfoundedness and overlap of covariate distributions in different treatment groups. The overlap assumption is violated when some units have propensity scores close to zero or one, and therefore both theoretical and practical researchers suggest dropping units with extreme estimated propensity scores. We advance the literature in three directions. First, we clarify a conceptual issue of sample trimming by defining causal parameters based on a target population without extreme propensity score. Second, we propose a procedure of smooth weighting, which approximates the existing sample trimming but has better asymptotic properties. The new weighting estimator is asymptotically linear and the bootstrap can be used to construct confidence intervals. Third, we extend the theory to the average treatment effect on the treated, suggesting trimming samples with estimated propensity scores close to one.

[1]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[2]  Donald B. Rubin,et al.  Affinely Invariant Matching Methods with Ellipsoidal Distributions , 1992 .

[3]  J. Shao,et al.  The jackknife and bootstrap , 1996 .

[4]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Programme , 1997 .

[5]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[6]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[7]  Jeffrey A. Smith,et al.  Does Matching Overcome Lalonde's Critique of Nonexperimental Estimators? , 2000 .

[8]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2002 .

[9]  J. Vincent,et al.  Anemia and blood transfusion in critically ill patients. , 2002, JAMA.

[10]  M. Grzybowski,et al.  Mortality benefit of immediate revascularization of acute ST-segment elevation myocardial infarction in patients with contraindications to thrombolytic therapy: a propensity analysis. , 2003, JAMA.

[11]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[12]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[13]  J. Robins,et al.  Results of multivariable logistic regression, propensity matching, propensity adjustment, and propensity-based weighting under conditions of nonuniform effect. , 2006, American journal of epidemiology.

[14]  Gary King,et al.  The Dangers of Extreme Counterfactuals , 2006, Political Analysis.

[15]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[16]  Joshua D. Angrist,et al.  Mostly Harmless Econometrics: An Empiricist's Companion , 2008 .

[17]  G. Imbens,et al.  Matching on the Estimated Propensity Score , 2009 .

[18]  Richard K. Crump,et al.  Dealing with limited overlap in estimation of average treatment effects , 2009 .

[19]  Elizabeth A Stuart,et al.  Improving propensity score weighting using machine learning , 2010, Statistics in medicine.

[20]  Shakeeb Khan,et al.  Irregular Identification, Support Conditions, and Inverse Weight Estimation , 2010 .

[21]  J. Kmenta Mostly Harmless Econometrics: An Empiricist's Companion , 2010 .

[22]  Dylan S. Small,et al.  Defining the Study Population for an Observational Study to Ensure Sufficient Overlap: A Tree Approach , 2011 .

[23]  Anemia and blood transfusion in the critically ill , 2011 .

[24]  Dylan S. Small,et al.  Calibrating Sensitivity Analyses to Observed Covariates in Observational Studies , 2013, Biometrics.

[25]  Fan Li,et al.  Do debit cards increase household spending? Evidence from a semiparametric causal analysis of a survey , 2014, 1409.2441.

[26]  Dylan S. Small,et al.  Discrete Optimization for Interpretable Study Populations and Randomization Inference in an Observational Study of Severe Sepsis Mortality , 2014, 1411.4873.

[27]  Y. Matsuyama,et al.  Brief Report: Doubly Robust Estimation of Standardized Risk Difference and Ratio in the Exposed Population , 2015, Epidemiology.

[28]  Guido W. Imbens,et al.  Matching Methods in Practice: Three Examples , 2014, The Journal of Human Resources.

[29]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: Sensitivity Analysis and Bounds , 2015 .

[30]  Kari Lock Morgan,et al.  Balancing Covariates via Propensity Score Weighting , 2014, 1609.07494.