Assessing effect heterogeneity of a randomized treatment using conditional inference trees

Treatment effect heterogeneity occurs when individual characteristics influence the effect of a treatment. We propose a novel approach that combines prognostic score matching and conditional inference trees to characterize effect heterogeneity of a randomized binary treatment. One key feature that distinguishes our method from alternative approaches is that it controls the Type I error rate, that is, the probability of identifying effect heterogeneity if none exists and retains the underlying subgroups. This feature makes our technique particularly appealing in the context of clinical trials, where there may be significant costs associated with erroneously declaring that effects differ across population subgroups. Treatment effect heterogeneity trees are able to identify heterogeneous subgroups, characterize the relevant subgroups and estimate the associated treatment effects. We demonstrate the efficacy of the proposed method using a comprehensive simulation study and illustrate our method using a nutrition trial dataset to evaluate effect heterogeneity within a patient population.

[1]  Elena Zheleva,et al.  Learning Triggers for Heterogeneous Treatment Effects , 2019, AAAI.

[2]  D. Green,et al.  Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees , 2012 .

[3]  Paul R. Rosenbaum,et al.  Comparison of Multivariate Matching Methods: Structures, Distances, and Algorithms , 1993 .

[4]  Richard L Kravitz,et al.  Evidence-based medicine, heterogeneity of treatment effects, and the trouble with averages. , 2004, The Milbank quarterly.

[5]  M. J. van der Laan,et al.  Super-Learning of an Optimal Dynamic Treatment Rule , 2016, The international journal of biostatistics.

[6]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[7]  Richard L Kravitz,et al.  Dealing with heterogeneity of treatment effects: is the literature up to the challenge? , 2009, Trials.

[8]  B. Goldstein,et al.  Using Machine Learning to Identify Heterogeneous Effects in Randomized Clinical Trials-Moving Beyond the Forest Plot and Into the Forest. , 2019, JAMA network open.

[9]  N. Longford Selection bias and treatment heterogeneity in clinical trials. , 1999, Statistics in medicine.

[10]  J. M. Taylor,et al.  Subgroup identification from randomized clinical trial data , 2011, Statistics in medicine.

[11]  I. Lipkovich,et al.  Tutorial in biostatistics: data‐driven subgroup identification and analysis in clinical trials , 2017, Statistics in medicine.

[12]  R. Jeffery,et al.  Questionnaire and laboratory measures of eating behavior. Associations with energy intake and BMI in a community sample of working adults , 2014, Appetite.

[13]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[14]  S. Basu,et al.  Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials , 2018, Trials.

[15]  Jasjeet S. Sekhon,et al.  Multivariate and Propensity Score Matching Software with Automated Balance Optimization: The Matching Package for R , 2008 .

[16]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[17]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[18]  J. Steeves,et al.  Toward Inclusive Trial Protocols in Heterogeneous Neurological Disorders , 2015, Neurorehabilitation and neural repair.

[19]  Sören R. Künzel,et al.  Metalearners for estimating heterogeneous treatment effects using machine learning , 2017, Proceedings of the National Academy of Sciences.

[20]  Corwin M Zigler,et al.  Approaches to treatment effect heterogeneity in the presence of confounding , 2019, Statistics in medicine.

[21]  P. Melillo,et al.  Identifying fallers among ophthalmic patients using classification tree methodology , 2017, PloS one.

[22]  Achim Zeileis,et al.  Partykit: a modular toolkit for recursive partytioning in R , 2015, J. Mach. Learn. Res..

[23]  D. Rubin Comment: Which Ifs Have Causal Answers , 1986 .

[24]  Lin Liu,et al.  Estimating heterogeneous treatment effect by balancing heterogeneity and fitness , 2018, BMC Bioinformatics.

[25]  V. Devanarayan,et al.  Patient subgroup identification for clinical drug development , 2017, Statistics in medicine.

[26]  G. Imbens,et al.  Bias-Corrected Matching Estimators for Average Treatment Effects , 2002 .

[27]  B. Hansen The prognostic analogue of the propensity score , 2008 .

[28]  V. Devanarayan,et al.  A PRIM approach to predictive-signature development for patient stratification , 2014, Statistics in medicine.

[29]  G. Imbens,et al.  Bias-Corrected Matching Estimators for Average Treatment Effects , 2011 .

[30]  Jonathan P. Kastellec The Statistical Analysis of Judicial Decisions and Legal Rules with Classification Trees , 2010 .

[31]  J. Morgan,et al.  Problems in the Analysis of Survey Data, and a Proposal , 1963 .

[32]  Trevor Hastie,et al.  Some methods for heterogeneous treatment effect estimation in high dimensions , 2017, Statistics in medicine.

[33]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[34]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[35]  J. Wasem,et al.  Regression Trees Identify Relevant Interactions: Can This Improve the Predictive Performance of Risk Adjustment? , 2017, Health economics.

[36]  Lei Liu,et al.  Random forests of interaction trees for estimating individualized treatment effects in randomized trials , 2017, Statistics in medicine.

[37]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[38]  Wei-Yin Loh,et al.  Subgroup identification for precision medicine: A comparative review of 13 methods , 2019, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[39]  Lu Tian,et al.  Adaptive index models for marker-based risk stratification. , 2011, Biostatistics.

[40]  C. Holmes,et al.  Machine learning analysis plans for randomised controlled trials: detecting treatment effect heterogeneity with strict control of type I error , 2020, Trials.

[41]  R. Jeffery,et al.  Portion size effects on weight gain in a free living setting , 2014, Obesity.

[42]  Richard K. Crump,et al.  Nonparametric Tests for Treatment Effect Heterogeneity , 2006, The Review of Economics and Statistics.

[43]  R.C.J.A. van Vliet,et al.  Exploring the predictive power of interaction terms in a sophisticated risk equalization model using regression trees. , 2018 .

[44]  Elizabeth A Stuart,et al.  Prognostic score-based balance measures can be a useful diagnostic for propensity score methods in comparative effectiveness research. , 2013, Journal of clinical epidemiology.

[45]  D. Rubin Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment , 1980 .

[46]  Weng Kee Wong,et al.  A model‐based multithreshold method for subgroup identification , 2019, Statistics in medicine.