论文信息 - Debiased machine learning of conditional average treatment effects and other causal functions

Debiased machine learning of conditional average treatment effects and other causal functions

This paper provides estimation and inference methods for the best linear predictor (approximation) of a structural function, such as conditional average structural and treatment effects, and structural derivatives, based on modern machine learning (ML) tools. We represent this structural function as a conditional expectation of an unbiased signal that depends on a nuisance parameter, which we estimate by modern machine learning techniques. We first adjust the signal to make it insensitive (Neyman-orthogonal) with respect to the first-stage regularization bias. We then project the signal onto a set of basis functions, growing with sample size, which gives us the best linear predictor of the structural function. We derive a complete set of results for estimation and simultaneous inference on all parameters of the best linear predictor, conducting inference by Gaussian bootstrap. When the structural function is smooth and the basis is sufficiently rich, our estimation and inference result automatically targets this function. When basis functions are group indicators, the best linear predictor reduces to group average treatment/structural effect, and our inference automatically targets these parameters. We demonstrate our method by estimating uniform confidence bands for the average price elasticity of gasoline demand conditional on income.

Victor Chernozhukov | Vira Semenova

[1] Susan Athey,et al. Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[2] D. Rubin,et al. The central role of the propensity score in observational studies for causal effects , 1983 .

[3] G. Imbens,et al. Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score , 2000 .

[4] Edward H Kennedy,et al. Non‐parametric methods for doubly robust estimation of continuous treatment effects , 2015, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[5] Johannes Schmidt-Hieber,et al. Nonparametric regression using deep neural networks with ReLU activation function , 2017, The Annals of Statistics.

[6] M. Rudelson. Random Vectors in the Isotropic Position , 1996, math/9608208.

[7] Justin Grimmer,et al. Estimating Heterogeneous Treatment Effects and the Effects of Heterogeneous Treatments with Ensemble Methods , 2017, Political Analysis.

[8] Robert P. Lieli,et al. Estimation of Conditional Average Treatment Effects With High-Dimensional Data , 2019, Journal of Business & Economic Statistics.

[9] Robert P. Lieli,et al. Estimating Conditional Average Treatment Effects , 2014 .

[10] Prem S. Puri,et al. On Optimal Asymptotic Tests of Composite Statistical Hypotheses , 1967 .

[11] Michael Lechner,et al. Nonparametric estimation of causal heterogeneity under high-dimensional confounding , 2019, 1908.08779.