Time-to-Event Prediction with Neural Networks and Cox Regression

New methods for time-to-event prediction are proposed by extending the Cox proportional hazards model with neural networks. Building on methodology from nested case-control studies, we propose a loss function that scales well to large data sets, and enables fitting of both proportional and non-proportional extensions of the Cox model. Through simulation studies, the proposed loss function is verified to be a good approximation for the Cox partial log-likelihood. The proposed methodology is compared to existing methodologies on real-world data sets, and is found to be highly competitive, typically yielding the best performance in terms of Brier score and binomial log-likelihood. A python package for the proposed methods is available at this https URL.

[1]  Junzhou Huang,et al.  Deep convolutional neural network for survival analysis with pathological images , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[2]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[3]  D.,et al.  Regression Models and Life-Tables , 2022 .

[4]  D. Sargent,et al.  Comparison of artificial neural networks with other statistical approaches , 2001, Cancer.

[5]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[6]  P. Heagerty,et al.  Survival Model Predictive Accuracy and ROC Curves , 2005, Biometrics.

[7]  Junzhou Huang,et al.  WSISA: Making Survival Prediction from Whole Slide Histopathological Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[9]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[10]  Elia Biganzoli,et al.  A time‐dependent discrimination index for survival data , 2005, Statistics in medicine.

[11]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[12]  P. Grambsch,et al.  A Package for Survival Analysis in S , 1994 .

[13]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[14]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[15]  Jorge Nocedal,et al.  On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima , 2016, ICLR.

[16]  Gian Antonio Susto,et al.  Machine Learning for Predictive Maintenance: A Multiple Classifier Approach , 2015, IEEE Transactions on Industrial Informatics.

[17]  P. Lapuerta,et al.  Comparison of the performance of neural network methods and Cox regression for censored survival data , 2000 .

[18]  Bryan Langholz,et al.  Asymptotic Theory for Nested Case-Control Sampling in the Cox Regression Model , 1992 .

[19]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[20]  Bart Baesens,et al.  Time to default in credit scoring using survival analysis: a benchmark study , 2015, J. Oper. Res. Soc..

[21]  A. Vigano,et al.  Survival prediction in terminal cancer patients: a systematic review of the medical literature , 2000, Palliative medicine.

[22]  Yoshua Bengio,et al.  Deep Learning for Patient-Specific Kidney Graft Survival Analysis , 2017, ArXiv.

[23]  Elad Hoffer,et al.  Train longer, generalize better: closing the generalization gap in large batch training of neural networks , 2017, NIPS.

[24]  D Faraggi,et al.  A neural network model for survival data. , 1995, Statistics in medicine.

[25]  Joshua E. Lewis,et al.  Predicting clinical outcomes from large scale cancer genomic profiles with deep survival models , 2017, Scientific Reports.

[26]  Cheng Guo,et al.  Entity Embeddings of Categorical Variables , 2016, ArXiv.

[27]  Bryan Langholz,et al.  Risk set sampling in epidemiologic cohort studies , 1996 .

[28]  Stephane Fotso,et al.  Deep Neural Networks for Survival Analysis Based on a Multi-Task Framework , 2018, ArXiv.

[29]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[30]  Thomas A Gerds,et al.  Estimating a time‐dependent concordance index for survival prediction models with covariate dependent censoring , 2013, Statistics in medicine.

[31]  Changhee Lee,et al.  DeepHit: A Deep Learning Approach to Survival Analysis With Competing Risks , 2018, AAAI.