Extracting temporal constraints from clinical research eligibility criteria using conditional random fields.

Temporal constraints are present in 38% of clinical research eligibility criteria and are crucial for screening patients. However, eligibility criteria are often written as free text, which is not amenable for computer processing. In this paper, we present an ontology-based approach to extracting temporal information from clinical research eligibility criteria. We generated temporal labels using a frame-based temporal ontology. We manually annotated 150 free-text eligibility criteria using the temporal labels and trained a parser using Conditional Random Fields (CRFs) to automatically extract temporal expressions from eligibility criteria. An evaluation of an additional 60 randomly selected eligibility criteria using manual review achieved an overall precision of 83%, a recall of 79%, and an F-score of 80%. We illustrate the application of temporal extraction with the use cases of question answering and free-text criteria querying.

[1]  James Pustejovsky,et al.  Automating Temporal Annotation with TARSQI , 2005, ACL.

[2]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[3]  S W Tu,et al.  PROTEGE-II: computer support for development of intelligent systems from libraries of components. , 1995, Medinfo. MEDINFO.

[4]  Nikolai Vazov A System For Extraction Of Temporal Expressions From French Texts Based On Syntactic And Semantic Constraints , 2001, ACL 2001.

[5]  George Hripcsak,et al.  Research Paper: The Evaluation of a Temporal Reasoning System in Processing Clinical Discharge Summaries , 2008, J. Am. Medical Informatics Assoc..

[6]  A. McCray The UMLS Semantic Network. , 1989 .

[7]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[8]  Burr Settles,et al.  Biomedical Named Entity Recognition using Conditional Random Fields and Rich Feature Sets , 2004, NLPBA/BioNLP.

[9]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[10]  George Hripcsak,et al.  A temporal constraint structure for extracting temporal information from clinical narrative , 2006, J. Biomed. Informatics.

[11]  Mor Peleg,et al.  A practical method for transforming free-text eligibility criteria into computable criteria , 2011, J. Biomed. Informatics.

[12]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[13]  Inderjeet Mani,et al.  Recent developments in temporal information extraction , 2003, RANLP.

[14]  Natalya F. Noy,et al.  Knowledge-Acquisition Interfaces for Domain Experts: An Empirical Evaluation of Protégé-2000 , 2000 .

[15]  Xiaoying Wu,et al.  EliXR: an approach to eligibility criteria extraction and representation , 2011, J. Am. Medical Informatics Assoc..

[16]  M A Musen,et al.  Knowledge acquisition for temporal abstraction. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[17]  Chunhua Weng,et al.  Temporal knowledge representation for scheduling tasks in clinical trial protocols , 2002, AMIA.

[18]  Chunhua Weng,et al.  Corpus-based Approach to Creating a Semantic Lexicon for Clinical Research Eligibility Criteria from UMLS , 2010, Summit on translational bioinformatics.

[19]  Christopher G Chute,et al.  CNTRO: A Semantic Web Ontology for Temporal Relation Inferencing in Clinical Narratives. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.