Evaluation of negation phrases in narrative clinical reports

OBJECTIVE Automatically identifying findings or diseases described in clinical textual reports requires determining whether clinical observations are present or absent. We evaluate the use of negation phrases and the frequency of negation in free-text clinical reports. METHODS A simple negation algorithm was applied to ten types of clinical reports (n=42,160) dictated during July 2000. We counted how often each of 66 negation phrases was used to mark a clinical observation as absent. Physicians read a random sample of 400 sentences, and precision was calculated for the negation phrases. We measured what proportion of clinical observations were marked as absent. RESULTS The negation algorithm was triggered by sixty negation phrases with just seven of the phrases accounting for 90% of the negations. The negation phrases received an overall precision of 97%, with "not" earning the lowest precision of 63%. Between 39% and 83% of all clinical observations were identified as absent by the negation algorithm, depending on the type of report analyzed. The most frequently used clinical observations were negated the majority of the time. CONCLUSION Because clinical observations in textual patient records are frequently negated, identifying accurate negation phrases is important to any system processing these reports.

[1]  John K. Vries,et al.  The medical archival system: An information retrieval system based on distributed parallel processing , 1991, Inf. Process. Manag..

[2]  Bruce G. Buchanan,et al.  Using computer modeling to help identify patient subgroups in clinical data repositories , 1998, AMIA.

[3]  Peter J. Haug,et al.  Symtext: a natural language understanding system for encoding free text medical data , 1998 .

[4]  George Hripcsak,et al.  A Health Information Network for Managing Innercity Tuberculosis: Bridging Clinical Care, Public Health, and Home Care, , 1999, Comput. Biomed. Res..

[5]  Bruce G. Buchanan,et al.  Identifying patient subgroups with simple Bayes' , 1999, AMIA.

[6]  George Hripcsak,et al.  Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries , 1999, AMIA.

[7]  Peter J. Haug,et al.  Using medical language processing to support real-time evaluation of pneumonia guidelines , 2000, AMIA.

[8]  Peter J. Haug,et al.  Research Paper: Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray Reports , 2000, J. Am. Medical Informatics Assoc..

[9]  Cynthia Brandt,et al.  Research Paper: UMLS Concept Indexing for Production Databases: A Feasibility Study , 2001, J. Am. Medical Informatics Assoc..

[10]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[11]  Peter L. Elkin,et al.  UMLS Concept Indexing for Production Databases: A Feasibility Study , 2001, J. Am. Medical Informatics Assoc..