What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work?

Estimation of heterogeneous treatment effects (HTE) is of prime importance in many disciplines, ranging from personalized medicine to economics among many others. Random forests have been shown to be a flexible and powerful approach to HTE estimation in both randomized trials and observational studies. In particular “causal forests”, introduced by Athey, Tibshirani, and Wager (2019), along with the R implementation in package grf were rapidly adopted. A related approach, called “model-based forests”, that is geared towards randomized trials and simultaneously captures effects of both prognostic and predictive variables, was introduced by Seibold, Zeileis, and Hothorn (2018) along with a modular implementation in the R package model4you. Here, we present a unifying view that goes beyond the theoretical motivations and investigates which computational elements make causal forests so successful and how these can be blended with the strengths of model-based forests. To do so, we show that both methods can be understood in terms of the same parameters and model assumptions for an additive model under L2 loss. This theoretical insight allows us to implement several flavors of “model-based causal forests” and dissect their different elements in silico. The original causal forests and model-based forests are compared with the new blended versions in a benchmark study exploring both randomized trials and observational settings. In the randomized setting, both approaches performed akin. If confounding was present in the data generating process, we found local centering of the treatment indicator with the corresponding propensities to be the main driver for good performance. Local centering of the outcome was less important, and might be replaced or enhanced by simultaneous split selection with respect to both prognostic and predictive effects. This lays the foundation for future research combining random forests for HTE estimation with other types of models. We demonstrate the practical aspects of such a model-agnostic approach to HTE estimation analyzing the effect of cesarean section on postpartum blood loss in comparison to vaginal delivery. Clearly, randomization is hardly possible in this setup, and we present a tailored model-based forest for skewed and interval-censored data to infer possible predictive variables and their impact on the treatment effect.

[1]  Achim Zeileis,et al.  Predictive Distribution Modeling Using Transformation Forests , 2021, J. Comput. Graph. Stat..

[2]  T. Hothorn,et al.  Model-based random forests for ordinal regression , 2020, The international journal of biostatistics.

[3]  Estimating heterogeneous treatment effects with right-censored data via causal survival forests , 2020, ArXiv.

[4]  T. Hothorn,et al.  The impact of prepartum factor XIII activity on postpartum blood loss , 2019, Zeitschrift für Geburtshilfe und Neonatologie.

[5]  Doubly robust treatment effect estimation with missing attributes , 2019, The Annals of Applied Statistics.

[6]  Achim Zeileis,et al.  model4you: An R Package for Personalised Treatment Effect Estimation , 2019 .

[7]  S. Athey,et al.  Estimating Treatment Effects with Causal Forests: An Application , 2019, Observational Studies.

[8]  Korepanova Natalia,et al.  Survival Forests under Test: Impact of the Proportional Hazards Assumption on Prognostic and Predictive Forests for ALS Survival , 2019 .

[9]  Sören R. Künzel,et al.  Metalearners for estimating heterogeneous treatment effects using machine learning , 2017, Proceedings of the National Academy of Sciences.

[10]  S. Athey,et al.  Generalized random forests , 2016, The Annals of Statistics.

[11]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[12]  T. Hothorn,et al.  Individual treatment effect prediction for amyotrophic lateral sclerosis patients , 2018, Statistical methods in medical research.

[13]  T Hothorn,et al.  Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees , 2017, Behavior Research Methods.

[14]  C. Haslinger,et al.  Validation of a quantitative system for real-time measurement of postpartum blood loss , 2018, Archives of Gynecology and Obstetrics.

[15]  Trevor Hastie,et al.  Some methods for heterogeneous treatment effect estimation in high dimensions , 2017, Statistics in medicine.

[16]  Hemant Ishwaran,et al.  Estimating Individual Treatment Effect in Observational Data Using Random Forest Methods , 2017, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[17]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[18]  Xinkun Nie,et al.  Quasi-oracle estimation of heterogeneous treatment effects , 2017, Biometrika.

[19]  P. Müller,et al.  Subgroup inference for multiple treatments and multiple endpoints in an Alzheimer’s disease treatment trial , 2017 .

[20]  J. Robins,et al.  Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .

[21]  Fei Tang,et al.  Random forest missing data algorithms , 2017, Stat. Anal. Data Min..

[22]  Achim Zeileis,et al.  Model-Based Recursive Partitioning for Subgroup Analyses , 2016, The international journal of biostatistics.

[23]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[24]  Torsten Hothorn,et al.  Most Likely Transformations , 2015, 1508.06749.

[25]  Achim Zeileis,et al.  partykit : A Toolkit for Recursive Partytioning , 2015 .

[26]  Achim Zeileis,et al.  Partykit: a modular toolkit for recursive partytioning in R , 2015, J. Mach. Learn. Res..

[27]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: Sensitivity Analysis and Bounds , 2015 .

[28]  James D. Malley,et al.  Synthetic learning machines , 2014, BioData Mining.

[29]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[30]  Jason H. Moore,et al.  Risk estimation using probability machines , 2014, BioData Mining.

[31]  Xiao-Hua Zhou,et al.  Generalized propensity score for estimating the average treatment effect of multiple treatments , 2012, Statistics in medicine.

[32]  J. M. Taylor,et al.  Subgroup identification from randomized clinical trial data , 2011, Statistics in medicine.

[33]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[34]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[35]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[36]  T. Hothorn,et al.  Simultaneous Inference in General Parametric Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[37]  Nicolai Meinshausen,et al.  Quantile Regression Forests , 2006, J. Mach. Learn. Res..

[38]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[39]  Yi Lin,et al.  Random Forests and Adaptive Nearest Neighbors , 2006 .

[40]  Elaine L. Zanutto,et al.  Using Propensity Score Subclassification for Multiple Treatment Doses to Evaluate a National Antidrug Media Campaign , 2005 .

[41]  Torsten Hothorn,et al.  Bagging survival trees , 2002, Statistics in medicine.

[42]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[43]  E. Kandel,et al.  Proceedings of the National Academy of Sciences of the United States of America. Annual subject and author indexes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[44]  P. Robinson ROOT-N-CONSISTENT SEMIPARAMETRIC REGRESSION , 1988 .

[45]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .