Evaluating Random Forests for Survival Analysis using Prediction Error Curves.

Prediction error curves are increasingly used to assess and compare predictions in survival analysis. This article surveys the R package pec which provides a set of functions for efficient computation of prediction error curves. The software implements inverse probability of censoring weights to deal with right censored data and several variants of cross-validation to deal with the apparent error problem. In principle, all kinds of prediction models can be assessed, and the package readily supports most traditional regression modeling strategies, like Cox regression or additive hazard regression, as well as state of the art machine learning methods such as random forests, a nonparametric method which provides promising alternatives to traditional strategies in low and high-dimensional settings. We show how the functionality of pec can be extended to yet unsupported prediction models. As an example, we implement support for random forest prediction models based on the R-packages randomSurvivalForest and party. Using data of the Copenhagen Stroke Study we use pec to compare random forests to a Cox regression model derived from stepwise variable selection. Reproducible results on the user level are given for publicly available data from the German breast cancer study group.

[1]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[2]  Peter C Austin,et al.  Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality. , 2004, Journal of clinical epidemiology.

[3]  Wenjiang J. Fu,et al.  Estimating misclassification error with small samples via bootstrap cross-validation , 2005, Bioinform..

[4]  Torben Martinussen,et al.  Dynamic Regression Models for Survival Data , 2006 .

[5]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[6]  ipred : Improved Predictors , 2009 .

[7]  Berthold Lausen,et al.  Bootstrap estimated true and false positive rates and ROC curve , 2008 .

[8]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[9]  M. Schumacher,et al.  Consistent Estimation of the Expected Brier Score in General Survival Models with Right‐Censored Event Times , 2006, Biometrical journal. Biometrische Zeitschrift.

[10]  P. Bühlmann,et al.  Survival ensembles. , 2006, Biostatistics.

[11]  Torsten Hothorn,et al.  Bagging survival trees , 2002, Statistics in medicine.

[12]  T. Olsen,et al.  Silent Infarction in Acute Stroke Patients: Prevalence, Localization, Risk Factors, and Clinical Significance The Copenhagen Stroke Study , 1994, Stroke.

[13]  Schumacher Martin,et al.  Adapting Prediction Error Estimates for Biased Complexity Selection in High-Dimensional Bootstrap Samples , 2008 .

[14]  Thomas A Gerds,et al.  Estimating a time‐dependent concordance index for survival prediction models with covariate dependent censoring , 2013, Statistics in medicine.

[15]  Wessel N van Wieringen,et al.  Testing the prediction error difference between 2 predictors. , 2009, Biostatistics.

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  M. A. van de Wiel,et al.  Confidence scores for prediction models , 2011, Biometrical journal. Biometrische Zeitschrift.

[18]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[19]  M. Schumacher,et al.  A Comparison of Nonparametric Error Rate Estimation Methods in Classification Problems , 2004 .

[20]  Udaya B. Kogalur,et al.  Random Survival Forests for R , 2007 .

[21]  Harald Binder,et al.  Adapting Prediction Error Estimates for Biased Complexity Selection in High-Dimensional Bootstrap Samples , 2008, Statistical applications in genetics and molecular biology.

[22]  Richard Simon,et al.  A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification , 2007, Statistics in medicine.

[23]  Daniel B. Mark,et al.  TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[24]  T. Olsen,et al.  Sex differences in stroke survival: 10-year follow-up of the Copenhagen stroke study cohort. , 2005, Journal of stroke and cerebrovascular diseases : the official journal of National Stroke Association.

[25]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[26]  Annette M. Molinaro,et al.  Prediction error estimation: a comparison of resampling methods , 2005, Bioinform..

[27]  Mei-Jie Zhang,et al.  Analyzing Competing Risk Data Using the R timereg Package. , 2011, Journal of statistical software.

[28]  Niels Keiding,et al.  Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns , 2010, Lifetime data analysis.

[29]  R. Tibshirani,et al.  Improvements on Cross-Validation: The 632+ Bootstrap Method , 1997 .

[30]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[31]  Thomas A Gerds,et al.  Efron‐Type Measures of Prediction Error for Survival Analysis , 2007, Biometrics.

[32]  F. Harrell,et al.  Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors , 2005 .

[33]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[34]  Niels Keiding,et al.  Statistical Models Based on Counting Processes , 1993 .