Setting up standards: A methodological proposal for pediatric Triage machine learning model construction based on clinical outcomes

Abstract Triage is a critical process in hospital emergency departments (ED). Specifically, we consider how to achieve fast and accurate patient Triage in the ED of a pediatric hospital. The goal of this paper is to establish methodological best practices for the application of machine learning (ML) to Triage in pediatric ED, providing a comprehensive comparison of the performance of ML techniques over a large dataset. Our work is among the first attempts in this direction. Following very recent works in the literature, we use the clinical outcome of a case as its label for supervised ML model training, instead of the more uncertain labels provided by experts. The experimental dataset contains the records along 3 years of operation of the hospital ED. It consists of 189,718 patients visits to the hospital. The clinical outcome of 9271 cases (4.98%) wa hospital admission, therefore our dataset is highly class imbalanced. Our reported performance comparison results focus on four ML models: Deep Learning (DL), Random Forest (RF), Naive Bayes (NB) and Support Vector Machines (SVM). Data preprocessing includes class imbalance correction, and case re-labeling. We use different well known metrics to evaluate performance of ML models in three different experimental settings: (a) classification of each case into the standard five Triage urgency levels, (b) discrimination of high versus low case severity according to its clinical outcome, and (c) comparison of the number of patients assigned to each standard Triage urgency level against the Triage rule based expert system currently in use at the hospital. RF achieved greater AUC, accuracy, PPV and specificity than the other models in the dychotomic classification experiments. On the implementation side, our study shows that ML predictive models trained according to clinical outcomes, provide better Triage performance than the current rule based expert system in operation at the hospital.

[1]  D. Amre,et al.  Evaluation of the Paediatric Canadian Triage and Acuity Scale in a pediatric ED. , 2005, The American journal of emergency medicine.

[2]  U. Pandee,et al.  Validation of different pediatric triage systems in the emergency department. , 2017, World journal of emergency medicine.

[3]  Mohd Alauddin Mohd Ali,et al.  Comparison of adaptive neuro-fuzzy inference system and artificial neutral networks model to categorize patients in the emergency department , 2013, SpringerPlus.

[4]  Marvin Karson,et al.  Handbook of Methods of Applied Statistics. Volume I: Techniques of Computation Descriptive Methods, and Statistical Inference. Volume II: Planning of Surveys and Experiments. I. M. Chakravarti, R. G. Laha, and J. Roy, New York, John Wiley; 1967, $9.00. , 1968 .

[5]  Adam Heller,et al.  Efficient p ‐ InP ( Rh ‐ H alloy ) and p ‐ InP ( Re ‐ H alloy ) Hydrogen Evolving Photocathodes , 1982 .

[6]  Manuel Graña,et al.  Predictive models for hospital readmission risk: A systematic review of methods , 2018, Comput. Methods Programs Biomed..

[7]  Yuval Shahar,et al.  Classification of patients by severity grades during triage in the emergency department using data mining methods. , 2012, Journal of evaluation in clinical practice.

[8]  L. P. Tham,et al.  The Singapore Paediatric Triage Scale Validation Study. , 2018, Singapore medical journal.

[9]  Vimla L Patel,et al.  Calibrating urgency: triage decision-making in a pediatric emergency department , 2008, Advances in health sciences education : theory and practice.

[10]  H. Haas Outils de triage aux urgences pédiatriques , 2005 .

[11]  P. Maningas,et al.  The use of the Soterion Rapid Triage System in children presenting to the Emergency Department. , 2006, The Journal of emergency medicine.

[12]  I. V. D. Wulp,et al.  Adjusting weighted kappa for severity of mistriage decreases reported reliability of emergency department triage systems: a comparative study. , 2009 .

[13]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[14]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[15]  Manuel Graña,et al.  Machine Learning Readmission Risk Modeling: A Pediatric Case Study , 2019, BioMed research international.

[16]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[17]  Ewout W Steyerberg,et al.  Multiple performance measures are needed to evaluate triage systems in the emergency department. , 2018, Journal of clinical epidemiology.

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[19]  M. Twomey,et al.  Inter-rater and intrarater reliability of the South African Triage Scale in low-resource settings of Haiti and Afghanistan , 2018, Emergency Medicine Journal.

[20]  S. Manzano,et al.  Validity of the Canadian Paediatric Triage and Acuity Scale in a tertiary care hospital. , 2009, CJEM.

[21]  Stephen Pax Leonard,et al.  Consistency of retrospective triage decisions as a standardised instrument for audit. , 1999, Journal of accident & emergency medicine.

[22]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[23]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[24]  William Caicedo-Torres,et al.  A Machine Learning Model for Triage in Lean Pediatric Emergency Departments , 2016, IBERAMIA.

[25]  José Hernández-Orallo,et al.  An experimental comparison of performance measures for classification , 2009, Pattern Recognit. Lett..

[26]  B J Skipper,et al.  Triage: limitations in predicting need for emergent care and hospital admission. , 1996, Annals of emergency medicine.

[27]  Peyman Rezaei-Hachesu,et al.  Decision support system for triage management: A hybrid approach using rule-based reasoning and fuzzy logic , 2018, Int. J. Medical Informatics.

[28]  Scott Levin,et al.  Accuracy of emergency department triage using the Emergency Severity Index and independent predictors of under-triage and over-triage in Brazil: a retrospective cohort analysis , 2018, International Journal of Emergency Medicine.

[29]  Martin Gellerstedt,et al.  Manchester Triage in Sweden - interrater reliability and accuracy. , 2009, International emergency nursing.

[30]  Scott Levin,et al.  Machine‐Learning‐Based Electronic Triage More Accurately Differentiates Patients With Respect to Clinical Outcomes Compared With the Emergency Severity Index , 2017, Annals of emergency medicine.

[31]  Masatomi Ikusaka,et al.  Validation of the modified Japanese Triage and Acuity Scale-based triage system emphasizing the physiologic variables or mechanism of injuries , 2016, International Journal of Emergency Medicine.

[32]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[33]  Henriëtte A Moll,et al.  Challenges in the validation of triage systems at emergency departments. , 2010, Journal of clinical epidemiology.

[34]  Björn Eskofier,et al.  Generic performance measure for multiclass-classifiers , 2017, Pattern Recognit..

[35]  R. Goldman,et al.  The Canadian Triage and Acuity Scale for children: a prospective multicenter evaluation. , 2012, Annals of emergency medicine.

[36]  Shen-Tsu Wang,et al.  Construct an Optimal Triage Prediction Model: A Case Study of the Emergency Department of a Teaching Hospital in Taiwan , 2013, Journal of Medical Systems.

[37]  M. C. Magalhães-Barbosa,et al.  Reliability of triage systems for paediatric emergency care: a systematic review , 2017, Emergency Medicine Journal.

[38]  Gayane Yenokyan,et al.  An Electronic Emergency Triage System to Improve Patient Distribution by Critical Outcomes. , 2016, The Journal of emergency medicine.

[39]  Gül E. Okudan Kremer,et al.  Model comparison in Emergency Severity Index level prediction , 2013, Expert Syst. Appl..

[40]  L. Wallis,et al.  Reliability and validity of pediatric triage tools evaluated in Low resource settings: a systematic review , 2017, BMC Pediatrics.

[41]  E W Steyerberg,et al.  Validity of the Manchester Triage System in paediatric emergency care , 2006, Emergency Medicine Journal.

[42]  Douglas A. Talbert,et al.  A Machine Learning and Data Mining Framework to Enable Evolutionary Improvement in Trauma Triage , 2011, MLDM.

[43]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .