Cox-nnet v2.0: improved neural-network-based survival prediction extended to large-scale EMR data

Cox-nnet is a neural-network based prognosis prediction method, originally applied to genomics data. Here we propose the version 2 of Cox-nnet, with significant improvement on efficiency and interpretability, making it suitable to predict prognosis based on large-scale electronic medical records (EMR) datasets. We also add permutation-based feature importance scores and the direction of feature coefficients. Applying on an EMR dataset of OPTN kidney transplantation, Cox-nnet v2.0 reduces the training time of Cox-nnet up to 32 folds (n=10,000) and achieves better prediction accuracy than Cox-PH (p<0.05). Availability and implementation: Cox-nnet v2.0 is freely available to the public at this https URL

[1]  M. Pencina,et al.  On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival data , 2011, Statistics in medicine.

[2]  Sepp Hochreiter,et al.  Self-Normalizing Neural Networks , 2017, NIPS.

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  James O. Berger,et al.  Borrowing Strength: Theory Powering Applications – A Festschrift for Lawrence D. Brown , 2010 .

[7]  Yang Feng,et al.  High-dimensional variable selection for Cox's proportional hazards model , 2010, 1002.3315.

[8]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[9]  John P. A. Ioannidis,et al.  Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review , 2017, J. Am. Medical Informatics Assoc..

[10]  Xun Zhu,et al.  Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data , 2018, PLoS Comput. Biol..

[11]  D.,et al.  Regression Models and Life-Tables , 2022 .

[12]  Harry Hemingway,et al.  Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease , 2018, bioRxiv.

[13]  Cynthia Rudin,et al.  All Models are Wrong, but Many are Useful: Learning a Variable's Importance by Studying an Entire Class of Prediction Models Simultaneously , 2019, J. Mach. Learn. Res..

[14]  M. Westerhoff,et al.  Two-stage biologically interpretable neural-network models for liver cancer prognosis prediction using histopathology and transcriptomic data , 2020 .

[15]  Elham Mahmoudi,et al.  Use of electronic medical records in development and validation of risk prediction models of hospital readmission: systematic review , 2020, BMJ.