SurvTRACE: transformers for survival analysis with competing events

In medicine, survival analysis studies the time duration to events of interest such as mortality. One major challenge is how to deal with multiple competing events (e.g., multiple disease diagnoses). In this work, we propose a transformer-based model that does not make the assumption for the underlying survival distribution and is capable of handling competing events, namely SurvTRACE. We account for the implicit confounders in the observational setting in multi-events scenarios, which causes selection bias as the predicted survival probability is influenced by irrelevant factors. To sufficiently utilize the survival data to train transformers from scratch, multiple auxiliary tasks are designed for multi-task learning. The model hence learns a strong shared representation from all these tasks and in turn serves for better survival analysis. We further demonstrate how to inspect the covariate relevance and importance through interpretable attention mechanisms of SurvTRACE, which suffices to great potential in enhancing clinical trial design and new treatment development. Experiments on METABRIC, SUPPORT, and SEER data with 470k patients validate the all-around superiority of our method. Software is available at https://github.com/RyanWangZf/SurvTRACE.

[1]  Jimeng Sun,et al.  Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision , 2022, EMNLP.

[2]  Jimeng Sun,et al.  TransTab: Learning Transferable Tabular Transformers Across Tables , 2022, NeurIPS.

[3]  Artem Babenko,et al.  Revisiting Deep Learning Models for Tabular Data , 2021, NeurIPS.

[4]  Jenna Wiens,et al.  A Hierarchical Approach to Multi-Event Survival Analysis , 2021, AAAI.

[5]  Sanjay Purushotham,et al.  DeepPseudo: Pseudo Value Based Deep Learning Models for Competing Risk Analysis , 2021, AAAI.

[6]  Ruishan Liu,et al.  Evaluating eligibility criteria of oncology trials using real-world data and AI , 2021, Nature.

[7]  Yun Zhao,et al.  BERTSurv: BERT-Based Survival Models for Predicting Outcomes of Trauma Patients , 2021, ArXiv.

[8]  Shao-Lun Huang,et al.  Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback , 2020, NeurIPS.

[9]  Donald K. K. Lee,et al.  BoXHED: Boosted eXact Hazard Estimator with Dynamic covariates , 2020, ICML.

[10]  Artur Dubrawski,et al.  Deep Survival Machines: Fully Parametric Survival Regression and Representation Learning for Censored Data With Competing Risks , 2020, IEEE Journal of Biomedical and Health Informatics.

[11]  Changhee Lee,et al.  Dynamic-DeepHit: A Deep Learning Approach for Dynamic Survival Analysis With Competing Risks Based on Longitudinal Data , 2020, IEEE Transactions on Biomedical Engineering.

[12]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[13]  Ørnulf Borgan,et al.  Continuous and discrete-time survival prediction with neural networks , 2019, Lifetime Data Analysis.

[14]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[15]  Ping Wang,et al.  Machine Learning for Survival Analysis , 2019, ACM Comput. Surv..

[16]  P. Allison,et al.  Missing data , 2005, European Radiology.

[17]  Lei Zheng,et al.  Deep Recurrent Survival Analysis , 2018, AAAI.

[18]  Changhee Lee,et al.  DeepHit: A Deep Learning Approach to Survival Analysis With Competing Risks , 2018, AAAI.

[19]  Lawrence Carin,et al.  Adversarial Time-to-Event Modeling , 2018, ICML.

[20]  Mihaela van der Schaar,et al.  Tree-based Bayesian Mixture Model for Competing Risks , 2018, AISTATS.

[21]  Chandan K. Reddy,et al.  Machine Learning for Survival Analysis: A Survey , 2017, ArXiv.

[22]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[23]  Sepp Hochreiter,et al.  Self-Normalizing Neural Networks , 2017, NIPS.

[24]  Yoshua Bengio,et al.  Deep Learning for Patient-Specific Kidney Graft Survival Analysis , 2017, ArXiv.

[25]  Jieping Ye,et al.  Transfer Learning for Survival Analysis via Efficient L2,1-Norm Regularized Cox Regression , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[26]  Jieping Ye,et al.  A Multi-Task Learning Formulation for Survival Analysis , 2016, KDD.

[27]  Adler J. Perotte,et al.  Deep Survival Analysis , 2016, MLHC.

[28]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[29]  A. Blaes,et al.  Shared Risk Factors in Cardiovascular Disease and Cancer , 2016, Circulation.

[30]  Thorsten Joachims,et al.  Recommendations as Treatments: Debiasing Learning and Evaluation , 2016, ICML.

[31]  Nassir Navab,et al.  Fast Training of Support Vector Machines for Survival Analysis , 2015, ECML/PKDD.

[32]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[33]  Bhanukiran Vinzamuri,et al.  Active Learning based Survival Regression for Censored Data , 2014, CIKM.

[34]  G. Schmidt,et al.  Applying competing risks regression models: an overview , 2012, Lifetime Data Analysis.

[35]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[36]  Guosheng Yin,et al.  Fundamentals of Clinical Trials , 2012 .

[37]  M. Pencina,et al.  On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival data , 2011, Statistics in medicine.

[38]  H. Ishwaran,et al.  Random survival forests , 2008, 0811.1645.

[39]  Elia Biganzoli,et al.  A time‐dependent discrimination index for survival data , 2005, Statistics in medicine.

[40]  Egill A. Fridgeirsson,et al.  Transformer-Based Deep Survival Analysis , 2021, SPACA.

[41]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[42]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[43]  Mihaela van der Schaar,et al.  Multitask Boosting for Survival Analysis with Competing Risks , 2018, NeurIPS.

[44]  Javier A. Castañeda,et al.  Appraisal of Several Methods to Model Time to Multiple Events per Subject: Modelling Time to Hospitalizations and Death Revisión de varios métodos para modelar tiempo a múltiples eventos por sujeto: modelamiento de tiempo a hospitalizaciones y muerte , 2010 .

[45]  Sabine Van Huffel,et al.  Support vector machines for survival analysis , 2007 .

[46]  Edsel A. Pefia,et al.  MODELS FOR RECURRENT EVENTS IN RELIABILITY AND SURVIVAL ANALYSIS , 2004 .

[47]  Jeff Morrison,et al.  Introduction to Survival Analysis in Business , 2003 .

[48]  R. Oye,et al.  The SUPPORT prognostic model. Objective estimates of survival for seriously ill hospitalized adults. Study to understand prognoses and preferences for outcomes and risks of treatments. , 1995, Annals of internal medicine.

[49]  D.,et al.  Regression Models and Life-Tables , 2022 .