Structuring Free-text Microbiology Culture Reports For Secondary Use

Microbiology lab culture reports are a frequently used diagnostic tool for clinical providers. However, their incorporation into clinical surveillance applications and evidence-based medicine can be severely hindered by the free-text nature of these reports. In this work, we (1) created a microbiology culture template to structure free-text microbiology reports, (2) generated an annotated microbiology report corpus, and (3) built a microbiology information extraction system. Specifically, we combined rule-based, hybrid, and statistical techniques to extract microbiology entities and fill templates for structuring data. System performances were favorable, with entity f1-score 0.889 and relation f1-score 0.795. We plan to incorporate these extractions as features for our ongoing ventilator-associated pneumonia surveillance project, though this tool can be used as an upstream process in other applications. Our newly created corpus includes 1442 unique gram stain and culture microbiology reports generated from a cohort of 715 patients at the University of Washington Medical Facilities.

[1]  Cosmin Adrian Bejan,et al.  On-time clinical phenotype prediction based on narrative reports , 2013, AMIA.

[2]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[3]  E. Baron,et al.  Executive summary: a guide to utilization of the microbiology laboratory for diagnosis of infectious diseases: 2013 recommendations by the Infectious Diseases Society of America (IDSA) and the American Society for Microbiology (ASM)(a). , 2013, Clinical Infectious Diseases.

[4]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[5]  Suzanne Lutwick,et al.  Validation of a multicenter computer-based surveillance system for hospital-acquired bloodstream infections in neonatal intensive care departments. , 2004, American journal of infection control.

[6]  Peter L. Elkin,et al.  Detection of Blood Culture Bacterial Contamination using Natural Language Processing , 2009, AMIA.

[7]  Jacques Bille,et al.  Comparison of Automated Strategies for Surveillance of Nosocomial Bacteremia , 2007, Infection Control & Hospital Epidemiology.

[8]  E. Baron,et al.  A Guide to Utilization of the Microbiology Laboratory for Diagnosis of Infectious Diseases: 2013 Recommendations by the Infectious Diseases Society of America (IDSA) and the American Society for Microbiology (ASM)a , 2013, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[9]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[10]  Matthew H. Samore,et al.  Identification of methicillin-resistant Staphylococcus aureus within the Nation’s Veterans Affairs Medical Centers using natural language processing , 2012, BMC Medical Informatics and Decision Making.