Modelling Survival by Machine Learning Methods in Liver Transplantation: Application to the UNOS Dataset

The aim of this study is to develop and validate a machine learning (ML) model for predicting survival after liver transplantation based on pre-transplant donor and recipient characteristics. For this purpose, we consider a database from the United Network for Organ Sharing (UNOS), containing 29 variables and 39,095 donor-recipient pairs, describing liver transplantations performed in the United States of America from November 2004 until June 2015. The dataset contains more than a \(74\%\) of censoring, being a challenging and difficult problem. Several methods including proportional-hazards regression models and ML methods such as Gradient Boosting were applied, using 10 donor characteristics, 15 recipient characteristics and 4 shared variables associated with the donor-recipient pair. In order to measure the performance of the seven state-of-the-art methodologies, three different evaluation metrics are used, being the concordance index (ipcw) the most suitable for this problem. The results achieved show that, for each measure, a different technique obtains the highest value, performing almost the same, but, if we focus on ipcw, Gradient Boosting outperforms the rest of the methods.

[1]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[2]  S. Paton,et al.  Survival analysis of liver transplant patients in Canada 1997-2002. , 2006, Transplantation proceedings.

[3]  Paul D. Allison,et al.  Survival analysis using sas®: a practical guide , 1995 .

[4]  Zachary A. Szpiech,et al.  Statistical Applications in Genetics and Molecular Biology Comparing Spatial Maps of Human Population-Genetic Variation Using Procrustes Analysis , 2011 .

[5]  M. Pencina,et al.  On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival data , 2011, Statistics in medicine.

[6]  Jamileh Abolghasemi,et al.  Survival analysis of liver cirrhosis patients after transplantation using accelerated failure time models , 2018, Biomedical Research and Therapy.

[7]  Sylvie Chevret,et al.  Summary measure of discrimination in survival models based on cumulative/dynamic time-dependent ROC curves , 2016, Statistical methods in medical research.

[8]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[9]  Bing Zhang,et al.  A Unified Mixed Effects Model for Gene Set Analysis of Time Course Microarray Experiments , 2009, Statistical applications in genetics and molecular biology.

[10]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[11]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[12]  Ping Wang,et al.  Machine Learning for Survival Analysis , 2019, ACM Comput. Surv..

[13]  S. Sathiya Keerthi,et al.  Efficient algorithms for ranking with SVMs , 2010, Information Retrieval.

[14]  Nicolás Jarufe,et al.  Accuracy of the BAR score in the prediction of survival after liver transplantation. , 2019, Annals of hepatology.

[15]  Daniel B. Mark,et al.  TUTORIAL IN BIOSTATISTICS MULTIVARIABLE PROGNOSTIC MODELS: ISSUES IN DEVELOPING MODELS, EVALUATING ASSUMPTIONS AND ADEQUACY, AND MEASURING AND REDUCING ERRORS , 1996 .

[16]  C. Wang,et al.  Statistical Applications in Genetics and Molecular Biology Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data , 2011 .

[17]  Nassir Navab,et al.  Fast Training of Support Vector Machines for Survival Analysis , 2015, ECML/PKDD.

[18]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[19]  Hamid Sheikhzadeh,et al.  Relevance Vector Machine for Survival Analysis , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Winfried Stute,et al.  Consistent estimation under random censorship when covariables are present , 1993 .