Boosting Algorithms for Estimating Optimal Individualized Treatment Rules

We present nonparametric algorithms for estimating optimal individualized treatment rules. The proposed algorithms are based on the XGBoost algorithm, which is known as one of the most powerful algorithms in the machine learning literature. Our main idea is to model the conditional mean of clinical outcome or the decision rule via additive regression trees, and use the boosting technique to estimate each single tree iteratively. Our approaches overcome the challenge of correct model specification, which is required in current parametric methods. The major contribution of our proposed algorithms is providing efficient and accurate estimation of the highly nonlinear and complex optimal individualized treatment rules that often arise in practice. Finally, we illustrate the superior performance of our algorithms by extensive simulation studies and conclude with an application to the real data from a diabetes Phase III trial.

[1]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[2]  M. Hanefeld,et al.  A long‐term comparison of pioglitazone and gliclazide in patients with Type 2 diabetes mellitus: a randomized, double‐blind, parallel‐group comparison trial , 2005, Diabetic medicine : a journal of the British Diabetic Association.

[3]  Lerato Mohapi,et al.  Timing of antiretroviral therapy for HIV-1 infection and tuberculosis. , 2011, The New England journal of medicine.

[4]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[5]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[6]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[7]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[8]  Yufeng Liu,et al.  Multicategory Outcome Weighted Margin-based Learning for Estimating Individualized Treatment Rules. , 2020, Statistica Sinica.

[9]  Nuno Vasconcelos,et al.  Multiclass Boosting: Theory and Algorithms , 2011, NIPS.

[10]  Yufeng Liu,et al.  Robust Truncated Hinge Loss Support Vector Machines , 2007 .

[11]  Haoda Fu,et al.  Estimating optimal treatment regimes via subgroup identification in randomized control trials and observational studies , 2016, Statistics in medicine.

[12]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[13]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[14]  Michael R Kosorok,et al.  Residual Weighted Learning for Estimating Individualized Treatment Rules , 2015, Journal of the American Statistical Association.

[15]  I. Boutron,et al.  Reporting of analyses from randomized controlled trials with multiple arms: a systematic review , 2013, BMC Medicine.

[16]  Yufeng Liu,et al.  Estimating individualized treatment rules for ordinal treatments , 2017, Biometrics.

[17]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[18]  Yufeng Liu,et al.  Multicategory angle-based large-margin classification. , 2014, Biometrika.

[19]  Michael R. Kosorok,et al.  Robust Hybrid Learning for Estimating Personalized Dynamic Treatment Regimens , 2016, 1611.02314.

[20]  Yufeng Liu,et al.  Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes , 2020, Journal of the American Statistical Association.

[21]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.