Bayesian Variable Selection with Joint Modeling of Categorical and Survival Outcomes: An Application to Individualizing Chemotherapy Treatment in Advanced Colorectal Cancer

Colorectal cancer is the second leading cause of cancer related deaths in the United States, with more than 130,000 new cases of colorectal cancer diagnosed each year. Clinical studies have shown that genetic alterations lead to different responses to the same treatment, despite the morphologic similarities of tumors. A molecular test prior to treatment could help in determining an optimal treatment for a patient with regard to both toxicity and efficacy. This article introduces a statistical method appropriate for predicting and comparing multiple endpoints given different treatment options and molecular profiles of an individual. A latent variable-based multivariate regression model with structured variance covariance matrix is considered here. The latent variables account for the correlated nature of multiple endpoints and accommodate the fact that some clinical endpoints are categorical variables and others are censored variables. The mixture normal hierarchical structure admits a natural variable selection rule. Inference was conducted using the posterior distribution sampling Markov chain Monte Carlo method. We analyzed the finite-sample properties of the proposed method using simulation studies. The application to the advanced colorectal cancer study revealed associations between multiple endpoints and particular biomarkers, demonstrating the potential of individualizing treatment based on genetic profiles.

[1]  Xiao-Li Meng,et al.  Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage , 2000 .

[2]  C. D. Litton,et al.  Theory of Probability (3rd Edition) , 1984 .

[3]  Marina Vannucci,et al.  Bioinformatics Original Paper Bayesian Variable Selection for the Analysis of Microarray Data with Censored Outcomes , 2022 .

[4]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[5]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[6]  Dipak K. Dey,et al.  Variable selection for multivariate logistic regression models , 2003 .

[7]  T. Fearn,et al.  Multivariate Bayesian variable selection and prediction , 1998 .

[8]  H. McLeod,et al.  Can dihydropyrimidine dehydrogenase impact 5-fluorouracil-based treatment? , 2000, European journal of cancer.

[9]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[10]  Michael H. Kutner Applied Linear Statistical Models , 1974 .

[11]  H. Jeffreys A Treatise on Probability , 1922, Nature.

[12]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[13]  H. McLeod,et al.  Tumour markers of prognosis in colorectal cancer , 1999, British Journal of Cancer.

[14]  A. P. Dawid,et al.  Bayesian Model Averaging and Model Search Strategies , 2007 .

[15]  Alan J. Miller Subset Selection in Regression , 1992 .

[16]  A. Atkinson Subset Selection in Regression , 1992 .

[17]  Joanna H Shih,et al.  Appropriateness of some resampling‐based inference procedures for assessing performance of prognostic classifiers derived from microarray data , 2007, Statistics in medicine.

[18]  Daniel J Sargent,et al.  A randomized controlled trial of fluorouracil plus leucovorin, irinotecan, and oxaliplatin combinations in patients with previously untreated metastatic colorectal cancer. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[19]  Debashis Ghosh,et al.  A false‐discovery‐rate‐based loss framework for selection of interactions , 2008, Statistics in medicine.