A simple method to estimate the time-dependent receiver operating characteristic curve and the area under the curve with right censored data

The time-dependent receiver operating characteristic curve is often used to study the diagnostic accuracy of a single continuous biomarker, measured at baseline, on the onset of a disease condition when the disease onset may occur at different times during the follow-up and hence may be right censored. Due to right censoring, the true disease onset status prior to the pre-specified time horizon may be unknown for some patients, which causes difficulty in calculating the time-dependent sensitivity and specificity. We propose to estimate the time-dependent sensitivity and specificity by weighting the censored data by the conditional probability of disease onset prior to the time horizon given the biomarker, the observed time to event, and the censoring indicator, with the weights calculated nonparametrically through a kernel regression on time to event. With this nonparametric weighting adjustment, we derive a novel, closed-form formula to calculate the area under the time-dependent receiver operating characteristic curve. We demonstrate through numerical study and theoretical arguments that the proposed method is insensitive to misspecification of the kernel bandwidth, produces unbiased and efficient estimators of time-dependent sensitivity and specificity, the area under the curve, and other estimands from the receiver operating characteristic curve, and outperforms several other published methods currently implemented in R packages.

[1]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[2]  D. Dabrowska Non-parametric regression with censored survival time data , 1987 .

[3]  D. Harrington,et al.  Counting Processes and Survival Analysis , 1991 .

[4]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[5]  P. Grambsch,et al.  Primary biliary cirrhosis: Prediction of short‐term survival based on repeated patient visits , 1994, Hepatology.

[6]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[7]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[8]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[9]  Guoqing Diao,et al.  Estimation of time‐dependent area under the ROC curve for long‐term risk prediction , 2006, Statistics in medicine.

[10]  Tianxi Cai,et al.  Evaluating Prediction Rules for t-Year Survivors With Censored Regression Models , 2007 .

[11]  Xiao Song,et al.  A semiparametric approach for the covariate specific ROC curve with survival outcome , 2008 .

[12]  Holly Janes,et al.  Pivotal Evaluation of the Accuracy of a Biomarker Used for Classification or Prediction: Standards for Study Design , 2008, Journal of the National Cancer Institute.

[13]  Chin-Tsang Chiang,et al.  Estimation methods for time‐dependent AUC models with survival data , 2009 .

[14]  Chin-Tsang Chiang,et al.  Optimal Composite Markers for Time‐Dependent Receiver Operating Characteristic Curves with Censored Survival Data , 2010 .

[15]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[16]  P. Heagerty,et al.  Time‐Dependent Predictive Accuracy in the Presence of Competing Risks , 2010, Biometrics.

[17]  Jialiang Li,et al.  Time‐dependent ROC analysis under diverse censoring patterns , 2011, Statistics in medicine.

[18]  Paul Blanche,et al.  Time-dependent AUC with right-censored data:a survey. In: Risk Assessment and Evaluation of Predictions , 2012, 1210.6805.

[19]  Jean-François Dartigues,et al.  Review and comparison of ROC curve estimators for a time-dependent outcome with marker-dependent censoring. , 2013, Biometrical journal. Biometrische Zeitschrift.

[20]  Ziding Feng,et al.  Improving the Quality of Biomarker Discovery Research: The Right Samples and Enough of Them , 2015, Cancer Epidemiology, Biomarkers & Prevention.

[21]  Jean-François Dartigues,et al.  Receiver operating characteristic curve estimation for time to event with semicompeting risks and interval censoring , 2016, Statistical methods in medical research.

[22]  A. Dreher Modeling Survival Data Extending The Cox Model , 2016 .

[23]  Yi Wu Survey Study , 2018, Achieving Supply Chain Agility.

[24]  Leonhard Held,et al.  Validation of discrete time‐to‐event prediction models in the presence of competing risks , 2019, Biometrical journal. Biometrische Zeitschrift.

[25]  Kan Li,et al.  Bayesian functional joint models for multivariate longitudinal and time-to-event data , 2019, Comput. Stat. Data Anal..

[26]  Liang Li,et al.  Landmark linear transformation model for dynamic prediction with application to a longitudinal cohort study of chronic disease , 2018, Journal of the Royal Statistical Society. Series C, Applied statistics.

[27]  Thomas O Jemielita,et al.  Are tumor size changes predictive of survival for checkpoint blockade based immunotherapy in metastatic melanoma? , 2019, Journal of Immunotherapy for Cancer.

[28]  Chen Liang,et al.  Multigene panel predicting survival of patients with colon cancer , 2019, Journal of Cancer.