Personalized Dose Finding Using Outcome Weighted Learning

ABSTRACT In dose-finding clinical trials, it is becoming increasingly important to account for individual-level heterogeneity while searching for optimal doses to ensure an optimal individualized dose rule (IDR) maximizes the expected beneficial clinical outcome for each individual. In this article, we advocate a randomized trial design where candidate dose levels assigned to study subjects are randomly chosen from a continuous distribution within a safe range. To estimate the optimal IDR using such data, we propose an outcome weighted learning method based on a nonconvex loss function, which can be solved efficiently using a difference of convex functions algorithm. The consistency and convergence rate for the estimated IDR are derived, and its small-sample performance is evaluated via simulation studies. We demonstrate that the proposed method outperforms competing approaches. Finally, we illustrate this method using data from a cohort study for warfarin (an anti-thrombotic drug) dosing. Supplementary materials for this article are available online.

[1]  Michael R Kosorok,et al.  Residual Weighted Learning for Estimating Individualized Treatment Rules , 2015, Journal of the American Statistical Association.

[2]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[3]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[4]  D. Hunter,et al.  A Tutorial on MM Algorithms , 2004 .

[5]  David A Stephens,et al.  Simulating sequential multiple assignment randomized trials to generate optimal personalized warfarin dosing strategies , 2014, Clinical trials.

[6]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[7]  M. Kramer,et al.  Estimating Response-Maximized Decision Rules With Applications to Breastfeeding , 2009 .

[8]  M. Kosorok,et al.  Reinforcement learning design for cancer clinical trials , 2009, Statistics in medicine.

[9]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[10]  Nema Dean,et al.  Q-Learning: Flexible Learning About Useful Utilities , 2013, Statistics in Biosciences.

[11]  Ingo Steinwart,et al.  Optimal regression rates for SVMs using Gaussian kernels , 2013 .

[12]  Le Thi Hoai An,et al.  Solving a Class of Linearly Constrained Indefinite Quadratic Problems by D.C. Algorithms , 1997, J. Glob. Optim..

[13]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[14]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[15]  Mark Crowther,et al.  Systematic overview of warfarin and its drug and food interactions. , 2005, Archives of internal medicine.

[16]  Kosuke Imai,et al.  Causal Inference With General Treatment Regimes , 2004 .

[17]  Harriet Black Nembhard,et al.  Statistical Methods for Dose-Finding Experiments , 2008, Technometrics.

[18]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[19]  Tong Zhang Statistical behavior and consistency of classification methods based on convex risk minimization , 2003 .

[20]  Peter Buhlmann,et al.  BOOSTING ALGORITHMS: REGULARIZATION, PREDICTION AND MODEL FITTING , 2007, 0804.2752.

[21]  Daniel J. Lizotte,et al.  Set‐valued dynamic treatment regimes for competing outcomes , 2012, Biometrics.

[22]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[23]  James M. Robins,et al.  Optimal Structural Nested Models for Optimal Sequential Decisions , 2004 .

[24]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[25]  Michael P Wallace,et al.  Doubly‐robust dynamic treatment regimen estimation via weighted least squares , 2015, Biometrics.

[26]  Fan Wu,et al.  Predicting warfarin dosage from clinical data: A supervised learning approach , 2012, Artif. Intell. Medicine.

[27]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[28]  P F Thall,et al.  A strategy for dose-finding and safety monitoring based on efficacy and adverse outcomes in phase I/II clinical trials. , 1998, Biometrics.

[29]  Anastasios A. Tsiatis,et al.  Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[30]  Á. Avezum,et al.  A validated prediction model for all forms of acute coronary syndrome: estimating the risk of 6-month postdischarge death in an international registry. , 2004, JAMA.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  Debashis Ghosh,et al.  A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments , 2015, Journal of causal inference.

[33]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[34]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[35]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[36]  Phil Ansell,et al.  Regret‐Regression for Optimal Dynamic Treatment Regimes , 2010, Biometrics.

[37]  Bibhas Chakraborty,et al.  Q‐learning for estimating optimal dynamic treatment rules from observational data , 2012, The Canadian journal of statistics = Revue canadienne de statistique.

[38]  R. Altman,et al.  Estimation of the warfarin dose with clinical and pharmacogenetic data. , 2009, The New England journal of medicine.

[39]  D. Marlowe,et al.  Adapting judicial supervision to the risk level of drug offenders: discharge and 6-month outcomes from a prospective matching study. , 2007, Drug and alcohol dependence.

[40]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[41]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[42]  S. Chevret,et al.  Statistical Methods for Dose-Finding Experiments: Chevret/Statistical Methods for Dose-Finding Experiments , 2006 .

[43]  B. Chakraborty,et al.  Statistical methods for dynamic treatment regimes , 2013 .

[44]  Hajime Uno,et al.  Calibrating parametric subject-specific risk estimation. , 2010, Biometrika.