Discrimination measures for discrete time-to-event predictions

Discrete time-to-event models have become a popular tool for the statistical analysis of longitudinal data. These models are useful when either time is intrinsically discrete or when continuous time-to-event outcomes are collected at pre-specified follow-up times, yielding interval-censored data. While there exists a variety of methods for discrete-time model building and estimation, measures for the evaluation of discrete time-to-event predictions are scarce. To address this issue, a set of measures that quantify the discriminatory power of prediction rules for discrete event times is proposed. More specifically, sensitivity rates, specificity rates, AUC, and also a time-independent summary index (“concordance index”) for discrete time-to-event outcomes are developed. Using inverse-probability-of-censoring weighting, it is shown how to consistently estimate the proposed measures from a set of censored data. To illustrate the proposed methodology, the duration of unemployment of US citizens is analyzed, and it is demonstrated how discrimination measures can be used for model comparison.

[1]  R. Prentice,et al.  Regression analysis of grouped survival data with application to breast cancer data. , 1978, Biometrics.

[2]  Hans A. Kestler,et al.  On the validity of time-dependent AUC estimators , 2015, Briefings Bioinform..

[3]  G. Tutz,et al.  Semi- and Nonparametric Modeling of Ordinal Data , 2003 .

[4]  Gary S Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration , 2015, Annals of Internal Medicine.

[5]  A. Tsiatis Semiparametric Theory and Missing Data , 2006 .

[6]  E. Steyerberg Clinical Prediction Models , 2008, Statistics for Biology and Health.

[7]  M. Pencina,et al.  On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival data , 2011, Statistics in medicine.

[8]  Rajeshwari Sundaram,et al.  A survival analysis approach to modeling human fecundity. , 2012, Biostatistics.

[9]  Gerhard Tutz,et al.  A survival tree method for the analysis of discrete event times in clinical and epidemiological studies , 2016, Statistics in medicine.

[10]  Matthias Schmid,et al.  A Robust Alternative to the Schemper–Henderson Estimator of Prediction Error , 2011, Biometrics.

[11]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[12]  Gerard J. van den Berg,et al.  Duration models: specification, identification and multiple durations , 2000 .

[13]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[14]  Tianxi Cai,et al.  Evaluating Prediction Rules for t-Year Survivors With Censored Regression Models , 2007 .

[15]  Brian P. McCall,et al.  Unemployment Insurance Rules, Joblessness, and Part-Time Work , 1996 .

[16]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[17]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[18]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[19]  J. Singer,et al.  Investigating onset, cessation, relapse, and recovery: why you should, and how you can, use discrete-time survival analysis to examine event occurrence. , 1993, Journal of consulting and clinical psychology.

[20]  L. Fahrmeir,et al.  Correction: Consistency and Asymptotic Normality of the Maximum Likelihood Estimator in Generalized Linear Models , 1985 .

[21]  B. Efron Logistic Regression, Survival Analysis, and the Kaplan-Meier Curve , 1988 .

[22]  A. Cameron,et al.  Microeconometrics: Methods and Applications , 2005 .

[23]  Gerhard Tutz,et al.  Modeling Discrete Time-To-Event Data , 2016 .

[24]  Ying Huang,et al.  Evaluating the ROC performance of markers for future events , 2008, Lifetime data analysis.

[25]  Daniel B. Mark,et al.  TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[26]  John O'Quigley,et al.  Explained randomness in proportional hazards models , 2005, Statistics in medicine.

[27]  P. Heagerty,et al.  Survival Model Predictive Accuracy and ROC Curves , 2005, Biometrics.

[28]  Elia Biganzoli,et al.  A time‐dependent discrimination index for survival data , 2005, Statistics in medicine.

[29]  N. Obuchowski,et al.  Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures , 2010, Epidemiology.

[30]  J. Ibrahim,et al.  Handbook of survival analysis , 2014 .

[31]  Bernd Bischl,et al.  The residual‐based predictiveness curve: A visual tool to assess the performance of prediction models , 2016, Biometrics.

[32]  Xian Zhou,et al.  Discrete‐time survival models with long‐term survivors , 2008, Statistics in medicine.

[33]  J Stare,et al.  Explained variation in survival analysis. , 1996, Statistics in medicine.

[34]  Niels Keiding,et al.  Explained Variation and Predictive Accuracy in General Parametric Statistical Models: The Role of Model Misspecification , 2004, Lifetime data analysis.

[35]  M. Pencina,et al.  Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation , 2004, Statistics in medicine.

[36]  Matthias Schmid,et al.  A comparison of estimators to evaluate the discriminatory power of time‐to‐event models , 2012, Statistics in medicine.

[37]  John T. Kent,et al.  Measures of dependence for censored survival data , 1988 .

[38]  Thomas A Gerds,et al.  Estimating a time‐dependent concordance index for survival prediction models with covariate dependent censoring , 2013, Statistics in medicine.

[39]  M. Schumacher,et al.  Consistent Estimation of the Expected Brier Score in General Survival Models with Right‐Censored Event Times , 2006, Biometrical journal. Biometrische Zeitschrift.

[40]  Qi Long,et al.  Addressing issues associated with evaluating prediction models for survival endpoints based on the concordance statistic , 2016, Biometrics.

[41]  M. Gonen,et al.  Concordance probability and discriminatory power in proportional hazards regression , 2005 .