A corpus of clinical narratives annotated with temporal information

Clinical reports often include descriptions of events in the patient's medical history, as well as explicit or implicit temporal information about these events. We are working towards applying deep Natural Language Processing tools towards understanding such narratives. This requires both the extraction and classification of the relevant events, and the placing of those events in time, or at least in relation to one another. Although several corpora of news data exist that have been annotated using the TimeML schema, similar corpora of clinical reports are not readily available. In this paper we report on the design of a small corpus and the annotation schema we developed, based on data from the fourth i2b2/VA challenge. These data include, among others, annotations for medical problems, tests, and treatments in clinical reports from several healthcare institutions. We have selected a subset of clinical reports and added annotations similar to those used in the TempEval tasks for the annotation of events, time expressions and temporal relations for the news domain. The annotations have been made freely available to the research community.

[1]  Tommaso Caselli,et al.  SemEval-2010 Task 13: TempEval-2 , 2010, *SEMEVAL.

[2]  James Pustejovsky,et al.  The TempEval challenge: identifying temporal relations in text , 2009, Lang. Resour. Evaluation.

[3]  James Pustejovsky,et al.  Annotating temporal and event quantification , 2010 .

[4]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[5]  Danielle L. Mowery,et al.  Temporal Annotation of Clinical Text , 2008, BioNLP.

[6]  Wayne H. Ward,et al.  Towards Temporal Relation Discovery from the Clinical Narrative , 2009, AMIA.

[7]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[8]  S. A. Jalaee,et al.  Abstract , 1999, Veterinary Record.

[9]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[10]  George Hripcsak,et al.  A temporal constraint structure for extracting temporal information from clinical narrative , 2006, J. Biomed. Informatics.

[11]  Nate Blaylock,et al.  Building Timelines from Narrative Clinical Records: Initial Results Based-on Deep Natural Language Understanding , 2011, BioNLP@ACL.

[12]  Christopher G Chute,et al.  CNTRO: A Semantic Web Ontology for Temporal Relation Inferencing in Clinical Narratives. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.