论文信息 - Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity

Bayesian Causal Forests & the 2022 ACIC Data Challenge: Scalability and Sensitivity

Abstract:We demonstrate how Hahn et al.'s Bayesian Causal Forests model (BCF) can be used to estimate conditional average treatment effects for the longitudinal dataset in the 2022 American Causal Inference Conference Data Challenge. Unfortunately, existing implementations of BCF do not scale to the size of the challenge data. Therefore, we developed flexBCF—a more scalable and flexible implementation of BCF— and used it in our challenge submission. We investigate the sensitivity of our results to the choice of propensity score estimation method and the use of sparsity-inducing regression tree priors. While we found that our overall point predictions were not especially sensitive to these modeling choices, we did observe that running BCF with flexibly estimated propensity scores often yielded better-calibrated uncertainty intervals.

Sameer K. Deshpande | Hyunseung Kang | Ajinkya Kokandakar

[1] Robert E. McCulloch,et al. Nonparametric Machine Learning and Efficient Computation with Bayesian Additive Regression Trees: The BART R Package , 2021, J. Stat. Softw..

[2] A. Linero. Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection , 2018 .

[3] P. Richard Hahn,et al. Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects , 2017, 1706.09523.

[4] K. Imai,et al. Covariate balancing propensity score , 2014 .

[5] Trevor Hastie,et al. Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[6] Jennifer L. Hill,et al. Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[7] H. Chipman,et al. BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[8] T. Speed,et al. On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[9] D. Rubin. [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[10] P. Rosenbaum. The Consequences of Adjustment for a Concomitant Variable that Has Been Affected by the Treatment , 1984 .

[11] D. Basu. Randomization Analysis of Experimental Data: The Fisher Randomization Test , 1980 .

[12] D. Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[13] M. P. Dumont. Comment … , 1970 .

[14] Sameer K. Deshpande. A new BART prior for ﬂexible modeling with categorical predictors , 2022 .