Functional survival forests for multivariate longitudinal outcomes: Dynamic prediction of Alzheimer’s disease progression

The random survival forest (RSF) is a non-parametric alternative to the Cox proportional hazards model in modeling time-to-event data. In this article, we developed a modeling framework to incorporate multivariate longitudinal data in the model building process to enhance the predictive performance of RSF. To extract the essential features of the multivariate longitudinal outcomes, two methods were adopted and compared: multivariate functional principal component analysis and multivariate fast covariance estimation for sparse functional data. These resulting features, which capture the trajectories of the multiple longitudinal outcomes, are then included as time-independent predictors in the subsequent RSF model. This non-parametric modeling framework, denoted as functional survival forests, is better at capturing the various trends in both the longitudinal outcomes and the survival model which may be difficult to model using only parametric approaches. These advantages are demonstrated through simulations and applications to the Alzheimer's Disease Neuroimaging Initiative.

[1]  Cécile Proust-Lima,et al.  Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time‐to‐event in presence of censoring and competing risks , 2015, Biometrics.

[2]  Thomas A Gerds,et al.  The c‐index is not proper for the evaluation of t‐year predicted risks , 2019, Biostatistics.

[3]  J. Morris,et al.  Recent publications from the Alzheimer's Disease Neuroimaging Initiative: Reviewing progress toward improved AD clinical trials , 2017, Alzheimer's & Dementia.

[4]  M. LeBlanc,et al.  Survival Trees by Goodness of Split , 1993 .

[5]  B. Marx,et al.  Multivariate calibration with temperature interaction using two-dimensional penalized signal regression , 2003 .

[6]  Mark R. Segal,et al.  Regression Trees for Censored Data , 1988 .

[7]  Kan Li,et al.  Prediction of Conversion to Alzheimer's Disease with Longitudinal Measures and Time-To-Event Data. , 2017, Journal of Alzheimer's disease : JAD.

[8]  S. Greven,et al.  Multivariate Functional Principal Component Analysis for Data Observed on Different (Dimensional) Domains , 2015, 1509.02029.

[9]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[10]  Kan Li,et al.  Dynamic prediction of Alzheimer's disease progression using features of multiple longitudinal outcomes and time‐to‐event data , 2019, Statistics in medicine.

[11]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[12]  Rosemary J. Jackson,et al.  Amyloid Beta and Tau Cooperate to Cause Reversible Behavioral and Transcriptional Deficits in a Model of Alzheimer’s Disease , 2019, Cell reports.

[13]  K M Leung,et al.  Censoring issues in survival analysis. , 1997, Annual review of public health.

[14]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[15]  H. Arrighi,et al.  Rate of Conversion from Prodromal Alzheimer's Disease to Alzheimer's Dementia: A Systematic Review of the Literature , 2013, Dementia and Geriatric Cognitive Disorders Extra.

[16]  Martin Schumacher,et al.  Dynamic prediction: A challenge for biostatisticians, but greatly needed by patients, physicians and the public , 2019, Biometrical journal. Biometrische Zeitschrift.

[17]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[18]  Sheng Luo,et al.  Dynamic predictions in Bayesian functional joint models for longitudinal and time-to-event data: An application to Alzheimer’s disease , 2019, Statistical methods in medical research.

[19]  Akinori Nakamura,et al.  Brain fluorodeoxyglucose (FDG) PET in dementia , 2016, Ageing Research Reviews.

[20]  Nick C Fox,et al.  Hippocampal atrophy rates in Alzheimer disease , 2009, Neurology.

[21]  Torsten Hothorn,et al.  On the Exact Distribution of Maximally Selected Rank Statistics , 2002, Comput. Stat. Data Anal..

[22]  Jon Arni Steingrimsson,et al.  Personalized Risk Prediction in Clinical Oncology Research: Applications and Practical Issues Using Survival Trees and Random Forests , 2018, Journal of biopharmaceutical statistics.

[23]  R. Petersen,et al.  Mild cognitive impairment , 2006, The Lancet.

[24]  Jeffrey S Simonoff,et al.  Survival trees for interval‐censored survival data , 2017, Statistics in medicine.

[25]  Liang Li,et al.  Quantifying and estimating the predictive accuracy for censored time-to-event data with competing risks. , 2018, Statistics in medicine.