Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches.

INTRODUCTION Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. METHODS An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. RESULTS All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. DISCUSSION The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts.

[1]  I. Buchan,et al.  A model of British in-hospital mortality among burns patients. , 2014, Burns : journal of the International Society for Burn Injuries.

[2]  Yoshua Bengio,et al.  Inference for the Generalization Error , 1999, Machine Learning.

[3]  R F Edlich,et al.  The abbreviated burn severity index. , 1982, Annals of emergency medicine.

[4]  Tianxi Cai,et al.  The Performance of Risk Prediction Models , 2008, Biometrical journal. Biometrische Zeitschrift.

[5]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[6]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[7]  H. K. Estahbanati,et al.  Role of artificial neural networks in prediction of survival of burn patients-a new approach. , 2002, Burns : journal of the International Society for Burn Injuries.

[8]  D. Bacquer,et al.  Development and validation of a model for prediction of mortality in patients with acute burn injury , 2009, The British journal of surgery.

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  Hitinder S. Gurm,et al.  A Random Forest Based Risk Model for Reliable and Accurate Prediction of Receipt of Transfusion in Patients Undergoing Percutaneous Coronary Intervention , 2014, PloS one.

[11]  Samer A M Nashef,et al.  EuroSCORE II. , 2012, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery.

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[14]  E. Steyerberg Clinical Prediction Models , 2008, Statistics for Biology and Health.

[15]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[18]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[19]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[20]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[21]  A. Hussain,et al.  Predicting survival in thermal injury: a systematic review of methodology of composite prediction models. , 2013, Burns : journal of the International Society for Burn Injuries.