ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports

In this paper we describe an algorithm called ConText for determining whether clinical conditions mentioned in clinical reports are negated, hypothetical, historical, or experienced by someone other than the patient. The algorithm infers the status of a condition with regard to these properties from simple lexical clues occurring in the context of the condition. The discussion and evaluation of the algorithm presented in this paper address the questions of whether a simple surface-based approach which has been shown to work well for negation can be successfully transferred to other contextual properties of clinical conditions, and to what extent this approach is portable among different clinical report types. In our study we find that ConText obtains reasonable to good performance for negated, historical, and hypothetical conditions across all report types that contain such conditions. Conditions experienced by someone other than the patient are very rarely found in our report set. A comprehensive solution to the problem of determining whether a clinical condition is historical or recent requires knowledge above and beyond the surface clues picked up by ConText.

[1]  Danielle L. Mowery,et al.  Temporal Annotation of Clinical Text , 2008, BioNLP.

[2]  Wendy W. Chapman,et al.  ConText: An Algorithm for Identifying Contextual Features from Clinical Text , 2007, BioNLP@ACL.

[3]  Wendy W. Chapman,et al.  Identifying Respiratory Findings in Emergency Department Reports for Biosurveillance using MetaMap , 2004, MedInfo.

[4]  Yang Huang,et al.  A novel hybrid approach to automated negation detection in clinical radiology reports. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[5]  Özlem Uzuner,et al.  Machine learning and rule-based approaches to assertion classification. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[6]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[7]  Peter J. Haug,et al.  Natural language processing to extract medical problems from electronic clinical documents: Performance evaluation , 2006, J. Biomed. Informatics.

[8]  Ricky K. Taira,et al.  A statistical natural language processor for medical reports , 1999, AMIA.

[9]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[10]  Prakash M. Nadkarni,et al.  Research Paper: Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLS , 2001, J. Am. Medical Informatics Assoc..

[11]  Dean F Sittig,et al.  Application of Information Technology j MediClass : A System for Detecting and Classifying Encounter-based Clinical Events in Any Electronic Medical , 2005 .

[12]  Martin Romacker,et al.  MedSynDikate - a natural language system for the extraction of medical information from findings reports , 2002, Int. J. Medical Informatics.

[13]  Wendy W. Chapman,et al.  Evaluation of negation phrases in narrative clinical reports , 2001, AMIA.

[14]  R G Newcombe,et al.  Improved confidence intervals for the difference between binomial proportions based on paired data. , 1998, Statistics in medicine.

[15]  Olivier Bodenreider,et al.  From indexing the biomedical literature to coding clinical text: experience with MTI and machine learning approaches , 2007, BioNLP@ACL.

[16]  Gregory F Cooper,et al.  Research Paper: Creating a Text Classifier to Detect Radiology Reports Describing Mediastinal Findings Associated with Inhalational Anthrax and Other Disorders , 2003, J. Am. Medical Informatics Assoc..

[17]  C A Sneiderman,et al.  Finding the findings: identification of findings in medical literature using restricted natural language processing. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[18]  Shuying Shen,et al.  Optimizing A Syndromic Surveillance Text Classifier for Influenza-like Illness: Does Document Source Matter? , 2008, AMIA.

[19]  Wendy W. Chapman,et al.  Evaluating the Effectiveness of Four Contextual Features in Classifying Annotated Clinical Conditions in Emergency Department Reports , 2006, AMIA.

[20]  A. Berger FUNDAMENTALS OF BIOSTATISTICS , 1969 .

[21]  Peter J. Haug,et al.  MPLUS: a probabilistic medical language understanding system , 2002, ACL Workshop on Natural Language Processing in the Biomedical Domain.

[22]  W. Bruce Croft,et al.  Research Paper: Ad Hoc Classification of Radiology Reports , 1999, J. Am. Medical Informatics Assoc..

[23]  Bernard R. Rosner,et al.  Fundamentals of Biostatistics. , 1992 .

[24]  S. Trent Rosenbloom,et al.  NLP-based Identification of Pneumonia Cases from Free-Text Radiological Reports , 2008, AMIA.

[25]  S. Trent Rosenbloom,et al.  Biosurveillance Evaluation of SNOMED CT's Terminology (BEST Trial): Coverage of Chief Complaints , 2008, MIE.

[26]  George Hripcsak,et al.  A temporal constraint structure for extracting temporal information from clinical narrative , 2006, J. Biomed. Informatics.

[27]  G Hripcsak,et al.  Natural language processing and its future in medicine. , 1999, Academic medicine : journal of the Association of American Medical Colleges.

[28]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[29]  Jules J Berman,et al.  Implementation and evaluation of a negation tagger in a pipeline-based system for information extract from pathology reports. , 2004, Studies in health technology and informatics.

[30]  Wendy W. Chapman,et al.  Inductive creation of an annotation schema for manually indexing clinical conditions from emergency department reports , 2006, J. Biomed. Informatics.

[31]  Padmini Srinivasan,et al.  The Language of Bioscience: Facts, Speculations, and Statements In Between , 2004, HLT-NAACL 2004.

[32]  Nigel Collier,et al.  Zone analysis in biology articles as a basis for information extraction , 2006, Int. J. Medical Informatics.

[33]  Katharina Kaiser,et al.  Syntactical Negation Detection in Clinical Practice Guidelines , 2008, MIE.

[34]  Peter L. Elkin,et al.  A controlled trial of automated classification of negation from clinical notes , 2005, BMC Medical Informatics Decis. Mak..

[35]  Jules J. Berman,et al.  Implementation and Evaluation of a Negation Tagger in a Pipeline-based System for Information Extraction from Pathology Reports , 2004, MedInfo.

[36]  Ben Medlock,et al.  Exploring hedge identification in biomedical literature , 2008, J. Biomed. Informatics.