A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports

The sheer volume of textual information that needs to be reviewed and analyzed in many clinical settings requires the automated retrieval of key clinical and temporal information. The existing natural language processing systems are often challenged by the low quality of clinical texts and do not demonstrate the required performance. In this study, we focus on medical product safety report narratives and investigate the association of the clinical events with appropriate time information. We developed a novel algorithm for tagging and extracting temporal information from the narratives, and associating it with related events. The proposed algorithm minimizes the performance dependency on text quality by relying only on shallow syntactic information and primitive properties of the extracted event and time entities. We demonstrated the effectiveness of the proposed algorithm by evaluating its tagging and time assignment capabilities on 140 randomly selected reports from the US Vaccine Adverse Event Reporting System (VAERS) and the FDA (Food and Drug Administration) Adverse Event Reporting System (FAERS). We compared the performance of our tagger with the SUTime and HeidelTime taggers, and our algorithm's event-time associations with the Temporal Awareness and Reasoning Systems for Question Interpretation (TARSQI). We further evaluated the ability of our algorithm to correctly identify the time information for the events in the 2012 Informatics for Integrating Biology and the Bedside (i2b2) Challenge corpus. For the time tagging task, our algorithm performed better than the SUTime and the HeidelTime taggers (F-measure in VAERS and FAERS: Our algorithm: 0.86 and 0.88, SUTime: 0.77 and 0.74, and HeidelTime 0.75 and 0.42, respectively). In the event-time association task, our algorithm assigned an inappropriate timestamp for 25% of the events, while the TARSQI toolkit demonstrated a considerably lower performance, assigning inappropriate timestamps in 61.5% of the same events. Our algorithm also supported the correct calculation of 69% of the event relations to the section time in the i2b2 testing set.

[1]  Eric Fosler-Lussier,et al.  Temporal Classification of Medical Events , 2012, BioNLP@HLT-NAACL.

[2]  Marc Boyer,et al.  Use of data mining at the Food and Drug Administration , 2016, J. Am. Medical Informatics Assoc..

[3]  Natalia Grabar,et al.  Tuning HeidelTime for identifying time expressions in clinical texts in English and French , 2014, Louhi@EACL.

[4]  Yung-Chun Chang,et al.  TEMPTING system: A hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries , 2013, J. Biomed. Informatics.

[5]  Natalia Grabar,et al.  Eventual situations for timeline extraction from clinical reports , 2013, J. Am. Medical Informatics Assoc..

[6]  T Botsis,et al.  Can Natural Language Processing Improve the Efficiency of Vaccine Adverse Event Report Review? , 2015, Methods of Information in Medicine.

[7]  Anna Rumshisky,et al.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge , 2013, J. Am. Medical Informatics Assoc..

[8]  Özlem Uzuner,et al.  Chronology of your health events: Approaches to extracting temporal relations from medical narratives , 2013, J. Biomed. Informatics.

[9]  Anna Rumshisky,et al.  Temporal reasoning over clinical text: the state of the art , 2013, J. Am. Medical Informatics Assoc..

[10]  Anna Rumshisky,et al.  Annotating temporal information in clinical narratives , 2013, J. Biomed. Informatics.

[11]  Michael Gertz,et al.  HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions , 2010, *SEMEVAL.

[12]  Daniel Sonntag,et al.  Deep Parsing at the CLEF2014 IE Task , 2014, CLEF.

[13]  Chen Lin,et al.  Multilayered temporal modeling for the clinical domain , 2016, J. Am. Medical Informatics Assoc..

[14]  Vincent Ng,et al.  Temporal Relation Identification and Classification in Clinical Notes , 2013, BCB.

[15]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[16]  Andrew McCallum,et al.  Information extraction from research papers using conditional random fields , 2006, Inf. Process. Manag..

[17]  George Hripcsak,et al.  A temporal constraint structure for extracting temporal information from clinical narrative , 2006, J. Biomed. Informatics.

[18]  Cui Tao,et al.  Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification , 2013, J. Am. Medical Informatics Assoc..

[19]  Chen Lin,et al.  Temporal Annotation in the Clinical Domain , 2014, TACL.

[20]  Michael Gertz,et al.  Multilingual and cross-domain temporal tagging , 2012, Language Resources and Evaluation.

[21]  Nate Blaylock,et al.  Building Timelines from Narrative Clinical Records: Initial Results Based-on Deep Natural Language Understanding , 2011, BioNLP@ACL.

[22]  Jean-Raymond Abrial,et al.  On B , 1998, B.

[23]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[24]  James Pustejovsky,et al.  Increasing Informativeness in Temporal Annotation , 2011, Linguistic Annotation Workshop.

[25]  James Pustejovsky,et al.  SemEval-2015 Task 6: Clinical TempEval , 2015, *SEMEVAL.

[26]  Leon Derczynski,et al.  USFD2: Annotating Temporal Expresions and TLINKs for TempEval-2 , 2010, *SEMEVAL.

[27]  Robert T. Chen,et al.  Understanding vaccine safety information from the Vaccine Adverse Event Reporting System. , 2004, The Pediatric infectious disease journal.

[29]  Robert Ball,et al.  Vaccine adverse event text mining system for extracting features from vaccine safety reports , 2012, J. Am. Medical Informatics Assoc..

[30]  Hua Xu,et al.  A hybrid system for temporal information extraction from clinical text , 2013, J. Am. Medical Informatics Assoc..

[31]  James Pustejovsky,et al.  ISO-TimeML: An International Standard for Semantic Annotation , 2010, LREC.

[32]  正博 頭金 FDA adverse event reporting system (FAERS) , 2015 .

[33]  Graciela Gonzalez-Hernandez,et al.  Towards generating a patient's timeline: Extracting temporal relationships from clinical notes , 2013, J. Biomed. Informatics.

[34]  Joshua C. Denny,et al.  Detecting temporal expressions in medical narratives , 2013, Int. J. Medical Informatics.

[35]  Christopher G Chute,et al.  CNTRO: A Semantic Web Ontology for Temporal Relation Inferencing in Clinical Narratives. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.