Interpretable Dynamic Treatment Regimes

ABSTRACT Precision medicine is currently a topic of great interest in clinical and intervention science.  A key component of precision medicine is that it is evidence-based, that is, data-driven, and consequently there has been tremendous interest in estimation of precision medicine strategies using observational or randomized study data. One way to formalize precision medicine is through a treatment regime, which is a sequence of decision rules, one per stage of clinical intervention, that map up-to-date patient information to a recommended treatment. An optimal treatment regime is defined as maximizing the mean of some cumulative clinical outcome if applied to a population of interest. It is well-known that even under simple generative models an optimal treatment regime can be a highly nonlinear function of patient information. Consequently, a focal point of recent methodological research has been the development of flexible models for estimating optimal treatment regimes. However, in many settings, estimation of an optimal treatment regime is an exploratory analysis intended to generate new hypotheses for subsequent research and not to directly dictate treatment to new patients. In such settings, an estimated treatment regime that is interpretable in a domain context may be of greater value than an unintelligible treatment regime built using “black-box” estimation methods. We propose an estimator of an optimal treatment regime composed of a sequence of decision rules, each expressible as a list of “if-then” statements that can be presented as either a paragraph or as a simple flowchart that is immediately interpretable to domain experts. The discreteness of these lists precludes smooth, that is, gradient-based, methods of estimation and leads to nonstandard asymptotics. Nevertheless, we provide a computationally efficient estimation algorithm, prove consistency of the proposed estimator, and derive rates of convergence. We illustrate the proposed methods using a series of simulation examples and application to data from a sequential clinical trial on bipolar disorder. Supplementary materials for this article are available online.

[1]  Michael R. Kosorok,et al.  Causal nearest neighbor rules for optimal treatment regimes , 2017, 1711.08451.

[2]  Rui Song,et al.  Using pilot data to size a two‐arm randomized trial to find a nearly optimal personalized treatment strategy , 2016, Statistics in medicine.

[3]  Menggang Yu,et al.  Regularized outcome weighted subgroup identification for differential treatment effects , 2015, Biometrics.

[4]  Eric B. Laber,et al.  Tree-based methods for individualized treatment regimes. , 2015, Biometrika.

[5]  Michael R Kosorok,et al.  Residual Weighted Learning for Estimating Individualized Treatment Rules , 2015, Journal of the American Statistical Association.

[6]  D. Longo,et al.  Precision medicine--personalized, problematic, and promising. , 2015, The New England journal of medicine.

[7]  Euan A Ashley,et al.  The precision medicine initiative: a new national effort. , 2015, JAMA.

[8]  Marie Davidian,et al.  Using decision lists to construct interpretable and parsimonious treatment regimes , 2015, Biometrics.

[9]  Donglin Zeng,et al.  New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2015, Journal of the American Statistical Association.

[10]  Iven Van Mechelen,et al.  A Novel Method for Estimating Optimal Tree-Based Treatment Regimes in Randomized Clinical Trials , 2015 .

[11]  Eric B. Laber,et al.  Doubly Robust Learning for Estimating Individualized Treatment with Censored Data. , 2015, Biometrika.

[12]  Wenting Cheng,et al.  Reader reaction to “A robust method for estimating optimal treatment regimes” by Zhang et al. (2012) , 2015, Biometrics.

[13]  F. Collins,et al.  A new initiative on precision medicine. , 2015, The New England journal of medicine.

[14]  Eric B. Laber,et al.  Interactive model building for Q-learning. , 2014, Biometrika.

[15]  Cynthia Rudin,et al.  Falling Rule Lists , 2014, AISTATS.

[16]  Nema Dean,et al.  Q-Learning: Flexible Learning About Useful Utilities , 2013, Statistics in Biosciences.

[17]  Holly Janes,et al.  Combining biomarkers to optimize patient treatment recommendations , 2014, Biometrics.

[18]  H. Krumholz Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. , 2014, Health affairs.

[19]  Abdus S Wahed,et al.  Bayesian Nonparametric Estimation for Dynamic Treatment Regimes With Sequential Transition Times , 2014, Journal of the American Statistical Association.

[20]  Taraneh Abarin,et al.  On Method of Moments Estimation in Linear Mixed Effects Models with Measurement Error on Covariates and Response with Application to a Longitudinal Study of Gene-Environment Interaction , 2014 .

[21]  Marie Davidian,et al.  Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions. , 2013, Biometrika.

[22]  Gábor Lugosi,et al.  Concentration Inequalities - A Nonasymptotic Theory of Independence , 2013, Concentration Inequalities.

[23]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[24]  S. Murphy,et al.  A "SMART" design for building individualized treatment sequences. , 2012, Annual review of clinical psychology.

[25]  Anastasios A. Tsiatis,et al.  Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[26]  M. Kosorok,et al.  Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer , 2011, Biometrics.

[27]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[28]  Inderjit S. Dhillon,et al.  Tackling Box-Constrained Optimization via a New Projected Quasi-Newton Approach , 2010, SIAM J. Sci. Comput..

[29]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[30]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[31]  R. El-Mallakh Adjunctive antidepressant treatment for bipolar depression. , 2007, New England Journal of Medicine.

[32]  David J Miklowitz,et al.  Effectiveness of adjunctive antidepressant treatment for bipolar depression. , 2007, The New England journal of medicine.

[33]  Mario Marchand,et al.  Learning with Decision Lists of Data-Dependent Features , 2005, J. Mach. Learn. Res..

[34]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.

[35]  D. Kupfer,et al.  Rationale, design, and methods of the systematic treatment enhancement program for bipolar disorder (STEP-BD) , 2003, Biological Psychiatry.

[36]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[37]  P. Massart,et al.  About the constants in Talagrand's concentration inequalities for empirical processes , 2000 .

[38]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[39]  D. Pollard,et al.  Cube Root Asymptotics , 1990 .

[40]  R. Rivest Learning Decision Lists , 1987, Machine Learning.

[41]  C. J. Stone,et al.  Optimal Global Rates of Convergence for Nonparametric Regression , 1982 .

[42]  Eric B. Laber,et al.  Sizing a phase II trial to find a nearly optimal personalized treatment strategy , 2014 .

[43]  Ingo Steinwart,et al.  Optimal regression rates for SVMs using Gaussian kernels , 2013 .

[44]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[45]  C. Rudin,et al.  Building Interpretable Classifiers with Rules using Bayesian Analysis , 2012 .

[46]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[47]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[48]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[49]  O. Bousquet A Bennett concentration inequality and its application to suprema of empirical processes , 2002 .

[50]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[51]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .