Prescription information is an important component of electronic health records (EHRs). This information contains detailed medication instructions that are crucial for patients' well-being and is often detailed in the narrative portions of EHRs. As a result, narratives of EHRs need to be processed with natural language processing (NLP) methods that can extract medication and prescription information from free text. However, automatic methods for medication and prescription extraction from narratives face two major challenges: (1) dictionaries can fall short even when identifying well-defined and syntactically consistent categories of medication entities, (2) some categories of medication entities are sparse, and at the same time lexically (and syntactically) diverse. In this paper, we describe FABLE, a system for automatically extracting prescription information from discharge summaries. FABLE utilizes unannotated data to enhance annotated training data: it performs semi-supervised extraction of medication information using pseudo-labels with Conditional Random Fields (CRFs) to improve its understanding of incomplete, sparse, and diverse medication entities. When evaluated against the official benchmark set from the 2009 i2b2 Shared Task and Workshop on Medication Extraction, FABLE achieves a horizontal phrase-level F1-measure of 0.878, giving state-of-the-art performance and significantly improving on nearly all entity categories.
[1]
Goran Nenadic,et al.
Medication information extraction with linguistic pattern matching and semantic rules
,
2010,
J. Am. Medical Informatics Assoc..
[2]
Alan R. Aronson,et al.
An overview of MetaMap: historical perspective and recent advances
,
2010,
J. Am. Medical Informatics Assoc..
[3]
Özlem Uzuner,et al.
Extracting medication information from clinical text
,
2010,
J. Am. Medical Informatics Assoc..
[4]
Özlem Uzuner,et al.
Prescription extraction using CRFs and word embeddings
,
2017,
J. Biomed. Informatics.
[5]
Peter Szolovits,et al.
MIMIC-III, a freely accessible critical care database
,
2016,
Scientific Data.
[6]
Shuying Shen,et al.
2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text
,
2011,
J. Am. Medical Informatics Assoc..
[7]
Carol Friedman,et al.
A broad-coverage natural language processing system
,
2000,
AMIA.
[8]
Joel D. Martin,et al.
Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010
,
2011,
J. Am. Medical Informatics Assoc..