Combining biomarkers to optimize patient treatment recommendations

Markers that predict treatment effect have the potential to improve patient outcomes. For example, the OncotypeDX® RecurrenceScore® has some ability to predict the benefit of adjuvant chemotherapy over and above hormone therapy for the treatment of estrogen-receptor-positive breast cancer, facilitating the provision of chemotherapy to women most likely to benefit from it. Given that the score was originally developed for predicting outcome given hormone therapy alone, it is of interest to develop alternative combinations of the genes comprising the score that are optimized for treatment selection. However, most methodology for combining markers is useful when predicting outcome under a single treatment. We propose a method for combining markers for treatment selection which requires modeling the treatment effect as a function of markers. Multiple models of treatment effect are fit iteratively by upweighting or "boosting" subjects potentially misclassified according to treatment benefit at the previous stage. The boosting approach is compared to existing methods in a simulation study based on the change in expected outcome under marker-based treatment. The approach improves upon methods in some settings and has comparable performance in others. Our simulation study also provides insights as to the relative merits of the existing methods. Application of the boosting approach to the breast cancer data, using scaled versions of the original markers, produces marker combinations that may have improved performance for treatment selection.

[1]  S. Murphy,et al.  Variable Selection for Qualitative Interactions. , 2011, Statistical methodology.

[2]  Holly Janes,et al.  Measuring the Performance of Markers for Guiding Treatment Decisions , 2011, Annals of Internal Medicine.

[3]  L. Tian,et al.  Analysis of randomized comparative clinical trial data for personalized treatment selections. , 2011, Biostatistics.

[4]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[5]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[6]  Charles Kooperberg,et al.  Combining biomarkers to detect disease with application to prostate cancer. , 2003, Biostatistics.

[7]  J. Sekhon,et al.  Genetic Optimization Using Derivatives , 1998, Political Analysis.

[8]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[9]  D. Hayes,et al.  Uses and abuses of tumor markers in the diagnosis, monitoring, and treatment of primary and metastatic breast cancer. , 2006, The oncologist.

[10]  Robert B Livingston,et al.  Prognostic and predictive value of the 21-gene recurrence score assay in postmenopausal women with node-positive, oestrogen-receptor-positive breast cancer on chemotherapy: a retrospective analysis of a randomised trial. , 2010, The Lancet. Oncology.

[11]  Margaret Sullivan Pepe,et al.  Evaluating Markers for Selecting a Patient's Treatment , 2004, Biometrics.

[12]  M. Cronin,et al.  Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  Tong Zhang Statistical behavior and consistency of classification methods based on convex risk minimization , 2003 .

[14]  Andrew J Vickers,et al.  Method for evaluating prediction models that apply the results of randomized trials to individual patients , 2007, Trials.

[15]  Ping Liu,et al.  A comparative analysis of methods for probability estimation tree , 2011 .

[16]  Jason Brinkley,et al.  A Generalized Estimator of the Attributable Benefit of an Optimal Treatment Regime , 2010, Biometrics.

[17]  Pedro M. Domingos,et al.  Tree Induction for Probability-Based Ranking , 2003, Machine Learning.

[18]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[19]  Holly Janes,et al.  An Approach to Evaluating and Comparing Biomarkers for Patient Treatment Selection , 2014, The international journal of biostatistics.

[20]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[21]  M. Kosorok,et al.  Reinforcement learning design for cancer clinical trials , 2009, Statistics in medicine.

[22]  Ji Zhu,et al.  Variable Selection for Optimal Decision Making , 2007, AIME.

[23]  Min Zhang,et al.  Estimating optimal treatment regimes from a classification perspective , 2012, Stat.

[24]  Lihui Zhao,et al.  Estimating Subject-Specific Treatment Differences for Risk-Benefit Assessment with Competing Risk Event-Time Data , 2011 .

[25]  T. Cai,et al.  Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve , 2006, Biometrics.

[26]  S. Martino,et al.  Adjuvant chemotherapy and timing of tamoxifen in postmenopausal patients with endocrine-responsive, node-positive breast cancer: a phase 3, open-label, randomised controlled trial , 2009, The Lancet.

[27]  Lacey Gunter,et al.  Variable Selection for Qualitative Interactions in Personalized Medicine While Controlling the Family-Wise Error Rate , 2011, Journal of biopharmaceutical statistics.

[28]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[29]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[30]  Holly Janes,et al.  A Framework for Evaluating Markers Used to Select Patient Treatment , 2014, Medical decision making : an international journal of the Society for Medical Decision Making.

[31]  G. Clark Prognostic and predictive factors for breast cancer , 1995, Breast cancer.

[32]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[33]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[34]  Jasjeet S. Sekhon,et al.  Genetic Optimization Using Derivatives , 2011, Political Analysis.

[35]  J. M. Taylor,et al.  Subgroup identification from randomized clinical trial data , 2011, Statistics in medicine.

[36]  Mark Culp,et al.  ada: An R Package for Stochastic Boosting , 2006 .

[37]  Wenbin Lu,et al.  Variable selection for optimal treatment decision , 2013, Statistical methods in medical research.

[38]  Y. Li,et al.  AUC-based biomarker ensemble with an application on gene scores predicting low bone mineral density , 2011, Bioinform..

[39]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[40]  Yoav Freund,et al.  An Adaptive Version of the Boost by Majority Algorithm , 1999, COLT '99.