Towards practical temporal relation extraction from clinical notes: An analysis of direct temporal relations

Following the conventions developed in general domain, most of the current work on clinical temporal relation identification aims to identify a comprehensive set of temporal relations from source documents. This includes both explicit relations that is described in the documents and implicit relations that are identifiable only through inference. Although such an approach may provide a complete view of temporal information provided in a document, some temporal relations may not be practically essential, depending on the clinical application at hand. In addition, the performances of current systems that identify both explicit and implicit relations are still low and how to enhance the performances to be enough for practical use is not clear yet. In this paper, we propose focus on a subset of temporal relations, in order to provide insights into how to develop practically useful temporal information extraction methods for clinical text. We focus on “direct” temporal relations, which are intra-sentential temporal relations between a time expression and an event mention with limited syntactic distance. A corpus of 120 discharge summaries is constructed, leveraging an existing corpus, the 2012 i2b2 corpus. We show that the direct temporal relations constitute a major category of temporal relations. In addition, we show that the performance of the state-of-the art temporal relation extraction system, which is developed for both implicit and explicit relations, on direct temporal relations is still low. This indicates the need for development of methods tailored to direct temporal relations.

[1]  Danielle L. Mowery,et al.  Task 2 : ShARe/CLEF eHealth Evaluation Lab 2014 , 2013 .

[2]  Chen Lin,et al.  Multilayered temporal modeling for the clinical domain , 2016, J. Am. Medical Informatics Assoc..

[3]  George Hripcsak,et al.  Handling Implicit and Uncertain Temporal Information in Medical Text , 2006, AMIA.

[4]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[5]  James Pustejovsky,et al.  SemEval-2017 Task 12: Clinical TempEval , 2017, *SEMEVAL.

[6]  Yaoyun Zhang,et al.  UTHealth at SemEval-2016 Task 12: an End-to-End System for Temporal Information Extraction from Clinical Notes , 2016, *SEMEVAL.

[7]  George Hripcsak,et al.  Research Paper: The Evaluation of a Temporal Reasoning System in Processing Clinical Discharge Summaries , 2008, J. Am. Medical Informatics Assoc..

[8]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[9]  James Pustejovsky,et al.  SemEval-2015 Task 6: Clinical TempEval , 2015, *SEMEVAL.

[10]  Yaoyun Zhang,et al.  UTH-CCB: The Participation of the SemEval 2015 Challenge – Task 14 , 2015, *SEMEVAL.

[11]  Michael Gertz,et al.  HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions , 2010, *SEMEVAL.

[12]  Anna Rumshisky,et al.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge , 2013, J. Am. Medical Informatics Assoc..

[13]  Yaoyun Zhang,et al.  Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature , 2016, BMC Systems Biology.

[14]  Hans-Peter Kriegel,et al.  Extraction of semantic biomedical relations from text using conditional random fields , 2008, BMC Bioinformatics.

[15]  George Hripcsak,et al.  System Architecture for Temporal Information Extraction, Representationand Reasoning in Clinical Narrative Reports , 2005, AMIA.

[16]  Hua Xu,et al.  A hybrid system for temporal information extraction from clinical text , 2013, J. Am. Medical Informatics Assoc..

[17]  Anna Rumshisky,et al.  Annotating temporal information in clinical narratives , 2013, J. Biomed. Informatics.