Building a Natural Language Processing Tool to Identify Patients With High Clinical Suspicion for Kawasaki Disease from Emergency Department Notes.

OBJECTIVE Delayed diagnosis of Kawasaki disease (KD) may lead to serious cardiac complications. We sought to create and test the performance of a natural language processing (NLP) tool, the KD-NLP, in the identification of emergency department (ED) patients for whom the diagnosis of KD should be considered. METHODS We developed an NLP tool that recognizes the KD diagnostic criteria based on standard clinical terms and medical word usage using 22 pediatric ED notes augmented by Unified Medical Language System vocabulary. With high suspicion for KD defined as fever and three or more KD clinical signs, KD-NLP was applied to 253 ED notes from children ultimately diagnosed with either KD or another febrile illness. We evaluated KD-NLP performance against ED notes manually reviewed by clinicians and compared the results to a simple keyword search. RESULTS KD-NLP identified high-suspicion patients with a sensitivity of 93.6% and specificity of 77.5% compared to notes manually reviewed by clinicians. The tool outperformed a simple keyword search (sensitivity = 41.0%; specificity = 76.3%). CONCLUSIONS KD-NLP showed comparable performance to clinician manual chart review for identification of pediatric ED patients with a high suspicion for KD. This tool could be incorporated into the ED electronic health record system to alert providers to consider the diagnosis of KD. KD-NLP could serve as a model for decision support for other conditions in the ED.

[1]  J. Newburger,et al.  The treatment of Kawasaki syndrome with intravenous gamma globulin. , 1986 .

[2]  M Takahashi,et al.  A single intravenous infusion of gamma globulin as compared with four infusions in the treatment of acute Kawasaki syndrome. , 1991, The New England journal of medicine.

[3]  D. Lindberg,et al.  The Unified Medical Language System , 1993, Methods of Information in Medicine.

[4]  N L Jain,et al.  Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[5]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[6]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[7]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[8]  Wendy W. Chapman,et al.  Fever detection from free-text clinical records for biosurveillance , 2004, Journal of Biomedical Informatics.

[9]  Walter R Wilson,et al.  Diagnosis, Treatment, and Long-Term Management of Kawasaki Disease: A Statement for Health Professionals From the Committee on Rheumatic Fever, Endocarditis and Kawasaki Disease, Council on Cardiovascular Disease in the Young, American Heart Association , 2004, Pediatrics.

[10]  M. Glodé,et al.  Delayed Diagnosis of Kawasaki Syndrome: An Analysis of the Problem , 2005, Pediatrics.

[11]  L. Palinkas,et al.  Delayed Diagnosis by Physicians Contributes to the Development of Coronary Artery Aneurysms in Children With Kawasaki Syndrome , 2007, The Pediatric infectious disease journal.

[12]  S. Colan,et al.  Delayed Diagnosis of Kawasaki Disease: What Are the Risk Factors? , 2007, Pediatrics.

[13]  Roger J Lewis,et al.  Pediatric Preparedness of US Emergency Departments: A 2003 Survey , 2007, Pediatrics.

[14]  H. Senzaki Long-term outcome of Kawasaki disease. , 2008, Circulation.

[15]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[16]  Andrew M Kahn,et al.  When children with Kawasaki disease grow up: Myocardial and vascular complications in adulthood. , 2009, Journal of the American College of Cardiology.

[17]  Andrew J. Capraro,et al.  Utility of Lumbar Puncture for First Simple Febrile Seizure Among Children 6 to 18 Months of Age , 2009, Pediatrics.

[18]  Andrew J. Capraro,et al.  Yield of Lumbar Puncture Among Children Who Present With Their First Complex Febrile Seizure , 2010, Pediatrics.

[19]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[20]  J. Whitin,et al.  A diagnostic algorithm combining clinical and molecular data distinguishes Kawasaki disease from other febrile illnesses , 2011, BMC medicine.

[21]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[22]  Christopher G. Chute,et al.  Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience , 2011, J. Am. Medical Informatics Assoc..

[23]  C. Chute,et al.  Electronic Medical Records for Genetic Research: Results of the eMERGE Consortium , 2011, Science Translational Medicine.

[24]  Steven H. Brown,et al.  Automated identification of postoperative complications within an electronic medical record using natural language processing. , 2011, JAMA.

[25]  J. Burns,et al.  Prevalence of Kawasaki Disease in Young Adults With Suspected Myocardial Ischemia , 2011, Circulation.

[26]  Jihoon Kim,et al.  iDASH: integrating data for analysis, anonymization, and sharing , 2012, J. Am. Medical Informatics Assoc..

[27]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[28]  J. Kanegaye,et al.  Lymph-node-first presentation of Kawasaki disease compared with bacterial cervical adenitis and typical Kawasaki disease. , 2013, The Journal of pediatrics.

[29]  H. Cohen,et al.  Point-of-care differentiation of Kawasaki disease from other febrile illnesses. , 2015, Jornal de Pediatria.

[30]  I. Solti,et al.  Developing and evaluating an automated appendicitis risk stratification algorithm for pediatric patients in the emergency department , 2013, Journal of the American Medical Informatics Association : JAMIA.

[31]  Hongfang Liu,et al.  Research and applications: Patient-level temporal aggregation for text-based asthma status ascertainment , 2014, J. Am. Medical Informatics Assoc..

[32]  K. Bretonnel Cohen,et al.  Assessing the similarity of surface linguistic features related to epilepsy across pediatric hospitals , 2014, J. Am. Medical Informatics Assoc..

[33]  Son Doan,et al.  Natural Language Processing in Biomedicine: A Unified System Architecture Overview , 2014, Methods in molecular biology.

[34]  J. Burns,et al.  Acute myocardial ischemia in adults secondary to missed Kawasaki disease in childhood. , 2014, The American journal of cardiology.

[35]  Judith W. Dexheimer,et al.  Natural Language Processing: Applications in Pediatric Research , 2016 .