Look before you leap: systematic evaluation of tree-based statistical methods in subgroup identification

ABSTRACT Subgroup analysis, as the key component of personalized medicine development, has attracted a lot of interest in recent years. While a number of exploratory subgroup searching approaches have been proposed, informative evaluation criteria and scenario-based systematic comparison of these methods are still underdeveloped topics. In this article, we propose two evaluation criteria in connection with traditional type I error and power concepts, and another criterion to directly assess recovery performance of the underlying treatment effect structure. Extensive simulation studies are carried out to investigate empirical performance of a variety of tree-based exploratory subgroup methods under the proposed criteria. A real data application is also included to illustrate the necessity and importance of method evaluation.

[1]  Xiaogang Su,et al.  Interaction Trees with Censored Survival Data , 2008, The international journal of biostatistics.

[2]  W. Loh,et al.  REGRESSION TREES WITH UNBIASED VARIABLE SELECTION AND INTERACTION DETECTION , 2002 .

[3]  B. Carlin,et al.  A Bayesian credible subgroups approach to identifying patient subgroups with positive treatment effects , 2016, Biometrics.

[4]  J. Berger,et al.  A Bayesian Approach to Subgroup Identification , 2014, Journal of biopharmaceutical statistics.

[5]  James M Robins,et al.  Structural Nested Cumulative Failure Time Models to Estimate the Effects of Interventions , 2012, Journal of the American Statistical Association.

[6]  Purushottam W. Laud,et al.  A Bayesian subgroup analysis with a zero‐enriched Polya Urn scheme , 2011, Statistics in medicine.

[7]  I. Lipkovich,et al.  Tutorial in biostatistics: data‐driven subgroup identification and analysis in clinical trials , 2017, Statistics in medicine.

[8]  Wesley O. Johnson,et al.  Interaction trees: exploring the differential effects of an intervention programme for breast cancer survivors , 2011 .

[9]  Haoda Fu,et al.  Identification of subgroups with differential treatment effects for longitudinal and multiresponse variables , 2016, Statistics in medicine.

[10]  J. M. Taylor,et al.  Subgroup identification from randomized clinical trial data , 2011, Statistics in medicine.

[11]  W. Loh,et al.  A regression tree approach to identifying subgroups with differential treatment effects , 2014, Statistics in medicine.

[12]  P. Müller,et al.  Subgroup finding via Bayesian additive regression trees , 2017, Statistics in medicine.

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  W. Loh,et al.  Improving the precision of classification trees , 2010, 1011.0608.

[15]  Lu Tian,et al.  A Simple Method for Detecting Interactions between a Treatment and a Large Number of Covariates , 2012, 1212.2995.

[16]  Xiaogang Su,et al.  Subgroup Analysis via Recursive Partitioning , 2009 .

[17]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[18]  Eric B. Laber,et al.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data. , 2015, Biometrika.

[19]  Richard Schulz,et al.  Moderators of interventions designed to enhance physical and psychological functioning among younger women with early-stage breast cancer. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  I. van Mechelen,et al.  Qualitative interaction trees: a tool to identify qualitative treatment–subgroup interactions , 2014, Statistics in medicine.