Developing and evaluating an automated appendicitis risk stratification algorithm for pediatric patients in the emergency department

Objective To evaluate a proposed natural language processing (NLP) and machine-learning based automated method to risk stratify abdominal pain patients by analyzing the content of the electronic health record (EHR). Methods We analyzed the EHRs of a random sample of 2100 pediatric emergency department (ED) patients with abdominal pain, including all with a final diagnosis of appendicitis. We developed an automated system to extract relevant elements from ED physician notes and lab values and to automatically assign a risk category for acute appendicitis (high, equivocal, or low), based on the Pediatric Appendicitis Score. We evaluated the performance of the system against a manually created gold standard (chart reviews by ED physicians) for recall, specificity, and precision. Results The system achieved an average F-measure of 0.867 (0.869 recall and 0.863 precision) for risk classification, which was comparable to physician experts. Recall/precision were 0.897/0.952 in the low-risk category, 0.855/0.886 in the high-risk category, and 0.854/0.766 in the equivocal-risk category. The information that the system required as input to achieve high F-measure was available within the first 4 h of the ED visit. Conclusions Automated appendicitis risk categorization based on EHR content, including information from clinical notes, shows comparable performance to physician chart reviewers as measured by their inter-annotator agreement and represents a promising new approach for computerized decision support to promote application of evidence-based medicine at the point of care.

[1]  D. Brenner,et al.  Estimated risks of radiation-induced fatal cancer from pediatric CT. , 2001, AJR. American journal of roentgenology.

[2]  R. Bachur,et al.  Comparison of pediatric emergency physicians' and surgeons' evaluation and diagnosis of appendicitis. , 2008, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[3]  T. Bell,et al.  Changing epidemiology of acute appendicitis in the United States: study period 1993-2008. , 2011, The Journal of surgical research.

[4]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[5]  D A Pierce,et al.  Studies of the mortality of atomic bomb survivors. Report 12, Part I. Cancer: 1950-1990. , 1996, Radiation research.

[6]  F. Ducharme,et al.  Prospective validation of the pediatric appendicitis score in a Canadian pediatric emergency department. , 2009, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[7]  Guergana K. Savova,et al.  Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing , 2009, Journal of Digital Imaging.

[8]  Christopher G. Chute,et al.  Prospective recruitment of patients with congestive heart failure using an ad-hoc binary classifier , 2005, J. Biomed. Informatics.

[9]  Jesse M. Pines,et al.  Trends in the Rates of Radiography Use and Important Diagnoses in Emergency Department Patients With Abdominal Pain , 2009, Medical care.

[10]  Hongfang Liu,et al.  Clinical decision support with automated text processing for cervical cancer screening , 2012, J. Am. Medical Informatics Assoc..

[11]  P. Shekelle,et al.  Systematic Review: Impact of Health Information Technology on Quality, Efficiency, and Costs of Medical Care , 2006, Annals of Internal Medicine.

[12]  Björn Olsson,et al.  Classification of Information Fusion Methods in Systems Biology , 2009, Silico Biol..

[13]  G Hripcsak,et al.  Evaluating Natural Language Processors in the Clinical Domain , 1998, Methods of Information in Medicine.

[14]  Yukiko Shimizu,et al.  Studies of the Mortality of Atomic Bomb Survivors, Report 14, 1950–2003: An Overview of Cancer and Noncancer Diseases , 2012, Radiation research.

[15]  L. McCaig,et al.  Emergency department visits for chest pain and abdominal pain: United States, 1999-2008. , 2010, NCHS data brief.

[16]  Peter J. Haug,et al.  Combining decision support methodologies to diagnose pneumonia , 2001, AMIA.

[17]  P. Dayan,et al.  Interrater Reliability of Clinical Findings in Children With Possible Appendicitis , 2012, Pediatrics.

[18]  R. Gonzales,et al.  Computed Tomography Use Among Children Presenting to Emergency Departments With Abdominal Pain , 2012, Pediatrics.

[19]  Wendy W Chapman,et al.  Classification of emergency department chief complaints into 7 syndromes: a retrospective analysis of 527,228 patients. , 2005, Annals of emergency medicine.

[20]  George Hripcsak,et al.  Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries , 1999, AMIA.

[21]  Madan Samuel,et al.  Pediatric appendicitis score. , 2002, Journal of pediatric surgery.

[22]  E. Uleryk,et al.  Does this child have appendicitis? A systematic review of clinical prediction rules for children with acute abdominal pain. , 2013, Journal of clinical epidemiology.

[23]  Pierre Zweigenbaum,et al.  Automatic computation of CHA2DS2-VASc score: information extraction from clinical texts for thromboembolism risk assessment. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[24]  Derek Stephens,et al.  Prospective validation of the pediatric appendicitis score. , 2008, The Journal of pediatrics.

[25]  K. J. Evans,et al.  Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .

[26]  C. Cubells,et al.  Prospective Validation of Two Systems of Classification for the Diagnosis of Acute Appendicitis , 2011, Pediatric emergency care.

[27]  Tom Ziemke,et al.  On the Definition of Information Fusion as a Field of Research , 2007 .

[28]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[29]  M. González-Sagrado,et al.  Application of Pediatric Appendicitis Score on the Emergency Department of a Secondary Level Hospital , 2012, Pediatric emergency care.

[30]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..