Generalized Additive Neural Networks for mortality prediction using automated and Genetic Algorithms

The prediction of mortality has shown to be a challenge for hospital management. To help in this task, metrics were developed to predict the evolution of the disease severity. One of the most commonly used metric in Intensive Care Units (ICUs) is the SAPS II, based on Generalized Linear Models (GLMs). However, the use of the more flexible Generalized Additive Models (GAMs) provide better results when the association between the outcome and the continuous covariates is nonlinear. Neural networks have also been used for prediction namely those based in the Multi Layer Perceptron (MLP) architecture, as, in theory, they are universal approximators to any continuous function. Some studies have shown that their performances are equivalent to GLMs and, naturally, inspired by GAMs, Generalized Additive Neural Networks (GANNs) were proposed. Because the construction of a GANN is based in a subjective decision making process through the analysis of the residuals plots, studies to automate this process emerged originating new methodologies (AutoGANN). However, these are not free from problems when the number of variables is large. Some improvements were then introduced for model selection, such as, a multistep algorithm that allows more than one modification at the same time in GANNs's architecture. Methods described above have correspondence to evolutionary programming as the search of a better result is performed by small modifications, closely resembling the mutation operator. AutoGANN method and Genetic Algorithm were used in order to find optimal models for predicting mortality at an ICU. These models, as well as a MLP model, were compared regarding their predictive and discriminative abilities.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  C. Geraldes,et al.  Generalized Linear Models, Generalized Additive Models and Neural Networks: Comparative Study in Medical Applications , 2013 .

[3]  D. A. de Waal,et al.  AUTOMATION OF GENERALIZED ADDITIVE NEURAL NETWORKS FOR PREDICTIVE DATA MINING , 2011 .

[4]  David A. de Waal,et al.  Automation of Generalized Additive Neural Networks for Predictive Data Mining , 2011, Appl. Artif. Intell..

[5]  Ewout W Steyerberg,et al.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers , 2011, Statistics in medicine.

[6]  D. D. Waal,et al.  Spam Detection using Generalized Additive Neural Networks , 2010 .

[7]  Takumi Ichimura,et al.  Neural Networks Applied to Medical Data for Prediction of Patient Outcome , 2008 .

[8]  Ziding Feng,et al.  Evaluating the Predictiveness of a Continuous Marker , 2007, Biometrics.

[9]  David A. de Waal,et al.  Generalized Additive Models from a Neural Network Perspective , 2007, ICDM Workshops.

[10]  J. Zimmerman,et al.  Acute Physiology and Chronic Health Evaluation (APACHE) IV: Hospital mortality assessment for today’s critically ill patients* , 2006, Critical care medicine.

[11]  D. Toit,et al.  Automated construction of generalized additive neural networks for predictive data mining , 2006 .

[12]  J. le Gall,et al.  SAPS 3—From evaluation of the patient to evaluation of the intensive care unit. Part 1: Objectives, methods and cohort description , 2005, Intensive Care Medicine.

[13]  T. Higgins,et al.  SAPS 3--From evaluation of the patient to evaluation of the intensive care unit. Part 1: Objectives, methods and cohort description. , 2005 .

[14]  Paulo J. G. Lisboa,et al.  A review of evidence of health benefit from artificial neural networks in medical intervention , 2002, Neural Networks.

[15]  V Marigliano,et al.  Neural network analysis in predicting 2-year survival in elderly people: a new statistical-mathematical approach. , 2001, Archives of gerontology and geriatrics.

[16]  William J. E. Potts,et al.  Generalized additive neural networks , 1999, KDD '99.

[17]  Steven Walczak,et al.  Heuristic principles for the design of artificial neural networks , 1999, Inf. Softw. Technol..

[18]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[19]  R. Tibshirani,et al.  Generalized additive models for medical research , 1995, Statistical methods in medical research.

[20]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[21]  Warren S. Sarle,et al.  Neural Networks and Statistical Models , 1994 .

[22]  S. Lemeshow,et al.  A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study , 1993 .

[23]  S. Lemeshow,et al.  Mortality Probability Models (MPM II) based on an international cohort of intensive care unit patients. , 1993, JAMA.

[24]  S. Lemeshow,et al.  A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. , 1993, JAMA.

[25]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[26]  W. Knaus,et al.  The APACHE III prognostic system. Risk prediction of hospital mortality for critically ill hospitalized adults. , 1991, Chest.

[27]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[28]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[29]  S Lemeshow,et al.  Refining intensive care unit outcome prediction by using changing probabilities of mortality. , 1988, Critical care medicine.

[30]  H Pastides,et al.  A method for predicting survival and mortality of ICU patients using objectively derived weights , 1985, Critical care medicine.

[31]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[32]  R. Faure,et al.  Introduction to operations research , 1968 .