Joint Models for Time-to-Event Data and Longitudinal Biomarkers of High Dimension

Joint models for longitudinal biomarkers and time-to-event data are widely used in longitudinal studies. Many joint modeling approaches have been proposed to handle different types of longitudinal biomarkers and survival outcomes. However, most existing joint modeling methods cannot deal with a large number of longitudinal biomarkers simultaneously, such as the longitudinally collected gene expression profiles. In this article, we propose a new joint modeling method under the Bayesian framework, which is able to analyze longitudinal biomarkers of high dimension. Specifically, we assume that only a few unobserved latent variables are related to the survival outcome and the latent variables are inferred using a factor analysis model, which greatly reduces the dimensionality of the biomarkers and also accounts for the high correlations among the biomarkers. Through extensive simulation studies, we show that our proposed method has improved prediction accuracy over other joint modeling methods. We illustrate the usefulness of our method on a dataset of idiopathic pulmonary fibrosis patients in which we are interested in predicting the patients' time-to-death using their gene expression profiles.

[1]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[2]  Bradley P. Carlin,et al.  Combining Dynamic Predictions From Joint Models for Longitudinal and Time-to-Event Data Using Bayesian Model Averaging , 2013, 1303.2797.

[3]  Cécile Proust-Lima,et al.  Joint latent class models for longitudinal and time-to-event data: A review , 2014, Statistical methods in medical research.

[4]  Joyce S Lee,et al.  A Multidimensional Index and Staging System for Idiopathic Pulmonary Fibrosis , 2012, Annals of Internal Medicine.

[5]  Dimitris Rizopoulos,et al.  Fast fitting of joint models for longitudinal and event time data using a pseudo-adaptive Gaussian quadrature rule , 2012, Comput. Stat. Data Anal..

[6]  David B Dunson,et al.  Default Prior Distributions and Efficient Posterior Computation in Bayesian Factor Analysis , 2009, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[7]  George C Tseng,et al.  Peripheral Blood Mononuclear Cell Gene Expression Profiles Predict Poor Outcome in Idiopathic Pulmonary Fibrosis , 2013, Science Translational Medicine.

[8]  J. Ibrahim,et al.  A Flexible B‐Spline Model for Multiple Longitudinal Biomarkers and Survival , 2005, Biometrics.

[9]  Dimitris Rizopoulos,et al.  A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time‐to‐event , 2011, Statistics in medicine.

[10]  Wanzhu Tu,et al.  Simultaneous variable selection for joint models of longitudinal and survival outcomes , 2015, Biometrics.

[11]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[12]  J R Anderson,et al.  Analysis of survival by tumor response. , 1983, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[14]  Yongseok Park,et al.  Real‐Time Individual Predictions of Prostate Cancer Recurrence Using Joint Models , 2013, Biometrics.

[15]  H. Putter,et al.  Dynamic Prediction in Clinical Survival Analysis , 2011 .

[16]  M. West,et al.  High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics , 2008, Journal of the American Statistical Association.

[17]  D. Thomas,et al.  Simultaneously modelling censored survival data and repeatedly measured covariates: a Gibbs sampling approach. , 1996, Statistics in medicine.

[18]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[19]  Elena A. Erosheva,et al.  Dealing with rotational invariance in Bayesian conrmatory factor analysis , 2013 .