论文信息 - Machine learning in policy evaluation: new tools for causal inference

Machine learning in policy evaluation: new tools for causal inference

While machine learning (ML) methods have received a lot of attention in recent years, these methods are primarily for prediction. Empirical researchers conducting policy evaluations are, on the other hand, pre-occupied with causal problems, trying to answer counterfactual questions: what would have happened in the absence of a policy? Because these counterfactuals can never be directly observed (described as the "fundamental problem of causal inference") prediction tools from the ML literature cannot be readily used for causal inference. In the last decade, major innovations have taken place incorporating supervised ML tools into estimators for causal parameters such as the average treatment effect (ATE). This holds the promise of attenuating model misspecification issues, and increasing of transparency in model selection. One particularly mature strand of the literature include approaches that incorporate supervised ML approaches in the estimation of the ATE of a binary treatment, under the \textit{unconfoundedness} and positivity assumptions (also known as exchangeability and overlap assumptions). This article reviews popular supervised machine learning algorithms, including the Super Learner. Then, some specific uses of machine learning for treatment effect estimation are introduced and illustrated, namely (1) to create balance among treated and control groups, (2) to estimate so-called nuisance models (e.g. the propensity score, or conditional expectations of the outcome) in semi-parametric estimators that target causal parameters (e.g. targeted maximum likelihood estimation or the double ML estimator), and (3) the use of machine learning for variable selection in situations with a high number of covariates.

Noemi Kreif | Karla DiazOrdaz

[1] S. Rose. Mortality risk score prediction in an elderly population using machine learning. , 2013, American journal of epidemiology.

[2] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[3] M. J. van der Laan,et al. The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[4] Til Stürmer,et al. The role of the c‐statistic in variable selection for propensity score models , 2011, Pharmacoepidemiology and drug safety.

[5] J. Kleinberg,et al. Prediction Policy Problems. , 2015, The American economic review.

[6] Jared K Lunceford,et al. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. , 2017, Statistics in medicine.

[7] Tyler J. VanderWeele,et al. Concerning the consistency assumption in causal inference. , 2009, Epidemiology.

[8] Jennifer L. Hill,et al. Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[9] Matías Busso,et al. New Evidence on the Finite Sample Properties of Propensity Score Reweighting and Matching Estimators , 2014, Review of Economics and Statistics.

[10] J. Zubizarreta. Journal of the American Statistical Association Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery Using Mixed Integer Programming for Matching in an Observational Study of Kidney Failure after Surgery , 2022 .

[11] Georg Heinze,et al. Variable selection – A review and recommendations for the practicing statistician , 2018, Biometrical journal. Biometrische Zeitschrift.