Generating contextual embeddings for emergency department chief complaints

Abstract Objective We learn contextual embeddings for emergency department (ED) chief complaints using Bidirectional Encoder Representations from Transformers (BERT), a state-of-the-art language model, to derive a compact and computationally useful representation for free-text chief complaints. Materials and methods Retrospective data on 2.1 million adult and pediatric ED visits was obtained from a large healthcare system covering the period of March 2013 to July 2019. A total of 355 497 (16.4%) visits from 65 737 (8.9%) patients were removed for absence of either a structured or unstructured chief complaint. To ensure adequate training set size, chief complaint labels that comprised less than 0.01%, or 1 in 10 000, of all visits were excluded. The cutoff threshold was incremented on a log scale to create seven datasets of decreasing sparsity. The classification task was to predict the provider-assigned label from the free-text chief complaint using BERT, with Long Short-Term Memory (LSTM) and Embeddings from Language Models (ELMo) as baselines. Performance was measured as the Top-k accuracy from k = 1:5 on a hold-out test set comprising 5% of the samples. The embedding for each free-text chief complaint was extracted as the final 768-dimensional layer of the BERT model and visualized using t-distributed stochastic neighbor embedding (t-SNE). Results The models achieved increasing performance with datasets of decreasing sparsity, with BERT outperforming both LSTM and ELMo. The BERT model yielded Top-1 accuracies of 0.65 and 0.69, Top-3 accuracies of 0.87 and 0.90, and Top-5 accuracies of 0.92 and 0.94 on datasets comprised of 434 and 188 labels, respectively. Visualization using t-SNE mapped the learned embeddings in a clinically meaningful way, with related concepts embedded close to each other and broader types of chief complaints clustered together. Discussion Despite the inherent noise in the chief complaint label space, the model was able to learn a rich representation of chief complaints and generate reasonable predictions of their labels. The learned embeddings accurately predict provider-assigned chief complaint labels and map semantically similar chief complaints to nearby points in vector space. Conclusion Such a model may be used to automatically map free-text chief complaints to structured fields and to assist the development of a standardized, data-driven ontology of chief complaints for healthcare institutions.

[1]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[2]  Yacine Jernite,et al.  Predicting Chief Complaints at Triage Time in the Emergency Department , 2013 .

[3]  Jingqi Wang,et al.  Enhancing Clinical Concept Extraction with Contextual Embedding , 2019, J. Am. Medical Informatics Assoc..

[4]  Rajesh Ranganath,et al.  ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission , 2019, ArXiv.

[5]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[6]  Lina M. Sulieman,et al.  Classifying patient portal messages using Convolutional Neural Networks , 2017, J. Biomed. Informatics.

[7]  Mike Conway,et al.  Using chief complaints for syndromic surveillance: A review of chief complaint based classifiers in North America , 2013, J. Biomed. Informatics.

[8]  Woo Suk Hong,et al.  Predicting hospital admission at emergency department triage using machine learning , 2018, PloS one.

[9]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[10]  Peter J. Haug,et al.  Classifying free-text triage chief complaints into syndromic categories with natural language processing , 2005, Artif. Intell. Medicine.

[11]  Larry A. Nathanson,et al.  Consensus Development of a Modern Ontology of Emergency Department Presenting Problems – the HierArchical Presenting Problem ontologY (HaPPy) , 2017 .

[12]  Eric P. Xing,et al.  Proceedings of the 31st International Conference on International Conference on Machine Learning - Volume 32 , 2014 .

[13]  Ioannis Ch. Paschalidis,et al.  Clinical Concept Extraction with Contextual Word Embedding , 2018, NIPS 2018.

[14]  William Fleischman,et al.  Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach. , 2016, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[15]  Nicholas W. Sterling,et al.  Prediction of emergency department patient disposition based on natural language processing of triage notes , 2019, Int. J. Medical Informatics.

[16]  Fei Wang,et al.  Readmission prediction via deep contextual embedding of clinical concepts , 2018, PloS one.

[17]  Slobodan Vucetic,et al.  Joint learning of representations of medical concepts and words from EHR data , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[18]  R. Somasundaram,et al.  Chief complaints in medical emergencies: do they relate to underlying disease and outcome? The Charité Emergency Medicine Study (CHARITEM) , 2013, European journal of emergency medicine : official journal of the European Society for Emergency Medicine.

[19]  Larry A. Nathanson,et al.  Improving documentation of presenting problems in the emergency department using a domain-specific ontology and machine learning-driven user interfaces , 2019, Int. J. Medical Informatics.

[20]  Christopher Beach,et al.  Chief complaint-based performance measures: a new focus for acute care quality measurement. , 2015, Annals of emergency medicine.

[21]  P J Haug,et al.  A comprehensive set of coded chief complaints for the emergency department. , 2001, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[22]  Rong Jin,et al.  Distance Metric Learning: A Comprehensive Survey , 2006 .

[23]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[24]  A. Haimovich,et al.  Predicting 72-hour and 9-day return to the emergency department using machine learning , 2019, JAMIA Open.

[25]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[26]  David Sontag,et al.  Learning Low-Dimensional Representations of Medical Concepts , 2016, CRI.

[27]  Steven Horng,et al.  Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning , 2017, PloS one.

[28]  Stephanie W. Haas,et al.  Toward vocabulary control for chief complaint. , 2008, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[29]  Andrew L. Beam,et al.  Learning Contextual Hierarchical Structure of Medical Concepts with Poincairé Embeddings to Clarify Phenotypes , 2018, PSB.

[30]  Slobodan Vucetic,et al.  EHR phenotyping via jointly embedding medical concepts and words into a unified vector space , 2018, BMC Medical Informatics and Decision Making.

[31]  Geoffrey E. Hinton,et al.  Visualizing non-metric similarities in multiple maps , 2011, Machine Learning.

[32]  Z. Obermeyer,et al.  Making recording and analysis of chief complaint a priority for global emergency care research in low-income countries. , 2013, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[33]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[34]  J. Hollander,et al.  Patient returns to the emergency department: the time-to-return curve. , 2014, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[35]  Ilya Valmianski,et al.  Evaluating robustness of language models for chief complaint extraction from patient-generated text , 2019, ArXiv.

[36]  Wendy W Chapman,et al.  Classification of emergency department chief complaints into 7 syndromes: a retrospective analysis of 527,228 patients. , 2005, Annals of emergency medicine.