The Value of Knowing the Propensity Score for Estimating Average Treatment Effects

In a treatment effect model with unconfoundedness, treatment assignments are not only independent of potential outcomes given the covariates, but also given the propensity score alone. Despite this powerful dimension reduction property, adjusting for the propensity score is known to lead to an estimator of the average treatment effect with lower asymptotic efficiency than one based on adjusting for all covariates. Moreover, knowledge of the propensity score does not change the efficiency bound for estimating average treatment effects, and many empirical strategies are more efficient when an estimate of the propensity score is used instead of its true value. Here, we resolve this "propensity score paradox" by demonstrating the value of knowledge of the propensity score. We show that by exploiting such knowledge properly, it is possible to construct an efficient treatment effect estimator that is not affected by the "curse of dimensionality", which yields desirable second order asymptotic properties and finite sample performance. The method combines knowledge of the propensity score with a nonparametric adjustment for covariates, building on ideas from the literature on double robust estimation. It is straightforward to implement, and performs well in simulations. We also show that confidence intervals based on our estimator and a simple variance estimate have remarkably robust coverage properties with respect to the implementation details of the nonparametric adjustment step.

[1]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[2]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[3]  Jianqing Fan Local Linear Regression Smoothers and Their Minimax Efficiencies , 1993 .

[4]  M. Wand,et al.  Multivariate Locally Weighted Least Squares Regression , 1994 .

[5]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[6]  W. Newey,et al.  The asymptotic variance of semiparametric estimators , 1994 .

[7]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[8]  J. Robins,et al.  Semiparametric Efficiency in Multivariate Regression Models with Missing Data , 1995 .

[9]  Oliver Linton,et al.  SECOND ORDER APPROXIMATION IN THE PARTIALLY LINEAR REGRESSION MODEL , 1995 .

[10]  Elias Masry,et al.  MULTIVARIATE LOCAL POLYNOMIAL REGRESSION FOR TIME SERIES:UNIFORM STRONG CONSISTENCY AND RATES , 1996 .

[11]  W. Newey,et al.  Convergence rates and asymptotic normality for series estimators , 1997 .

[12]  J. Robins,et al.  Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. , 1997, Statistics in medicine.

[13]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[14]  A. V. D. Vaart Asymptotic Statistics: Delta Method , 1998 .

[15]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[16]  G. Imbens,et al.  Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[17]  Xiaohong Chen,et al.  Estimation of Semiparametric Models When the Criterion Function is Not Smooth , 2002 .

[18]  Jinyong Hahn,et al.  When to Control for Covariates? Panel Asymptotics for Estimates of Treatment Effects , 2004, Review of Economics and Statistics.

[19]  Kosuke Imai,et al.  Causal Inference With General Treatment Regimes , 2004 .

[20]  G. Imbens,et al.  Mean-Squared-Error Calculations for Average Treatment Effects , 2005 .

[21]  Hidehiko Ichimura,et al.  Characterization of the asymptotic distribution of semiparametric M-estimators , 2006 .

[22]  J. L. Ojeda,et al.  Hölder continuity properties of the local polynomial estimator , 2008 .

[23]  Xiaohong Chen,et al.  Semiparametric efficiency in GMM models with auxiliary data , 2007, 0705.0069.

[24]  G. Imbens,et al.  Matching on the Estimated Propensity Score , 2009 .

[25]  Matias D. Cattaneo,et al.  Efficient semiparametric estimation of multi-valued treatment effects under ignorability , 2010 .

[26]  Shakeeb Khan,et al.  Irregular Identification, Support Conditions, and Inverse Weight Estimation , 2010 .

[27]  J. Kmenta Mostly Harmless Econometrics: An Empiricist's Companion , 2010 .

[28]  Geert Ridder,et al.  Asymptotic Variance of Semiparametric Estimators With Generated Regressors , 2013 .

[29]  C. Rothe,et al.  Semiparametric Estimation and Inference Using Doubly Robust Moment Conditions , 2013, SSRN Electronic Journal.

[30]  Enno Mammen,et al.  SEMIPARAMETRIC ESTIMATION WITH GENERATED COVARIATES , 2015, Econometric Theory.

[31]  Johan Segers,et al.  On the weak convergence of the empirical conditional copula under a simplifying assumption , 2015, J. Multivar. Anal..