Assessing clinical trial eligibility with logic expression queries

This paper introduces a system that processes clinical trials using a combination of natural language processing and database techniques. We process web-based clinical trial recruitment pages to extract semantic information reflecting eligibility criteria for potential participants. From this information we then formulate a query that can match criteria against medical data in patient records. The resulting system reflects a tight coupling of web-based information extraction, natural language processing, medical informatic approaches to clinical knowledge representation, and large-scale database technologies. We present an evaluation of the system and future directions for further system development.

[1]  Stanley M. Huff,et al.  Detailed Clinical Models for Sharable, Executable Guidelines , 2004, MedInfo.

[2]  James Pustejovsky,et al.  Automatic Extraction of Acronym-meaning Pairs from MEDLINE Databases , 2001, MedInfo.

[3]  Javed Mostafa,et al.  Detecting Gene Relations from MEDLINE Abstracts , 2000, Pacific Symposium on Biocomputing.

[4]  C. Lindberg The Unified Medical Language System (UMLS) of the National Library of Medicine. , 1990, Journal.

[5]  Marti A. Hearst,et al.  A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text , 2002, Pacific Symposium on Biocomputing.

[6]  Clint Tustison Logical Form Identification for Medical Clinical Trials Logical Form Identification for Medical Clinical Trials , 2004 .

[7]  Gerold Schneider,et al.  A Linguistic Comparison of Constituency, Dependency and Link Grammar , 1998 .

[8]  T A Pryor,et al.  The arden syntax for medical logic modules , 1990, International journal of clinical monitoring and computing.

[9]  David W. Embley,et al.  Generating Medical Logic Modules for Clinical Trial Eligibility Criteria , 2003, AMIA.

[10]  Lawrence Hunter,et al.  Extracting Molecular Binding Relationships from Biomedical Text , 2000, ANLP.

[11]  Peter Szolovits,et al.  Adding a Medical Lexicon to an English Parser , 2003, AMIA.

[12]  Allen Newell,et al.  SOAR: An Architecture for General Intelligence , 1987, Artif. Intell..

[13]  William Taysom,et al.  An NLP System for Extracting and Representing Knowledge from Abbreviated Text , 2001 .

[14]  Clint A.Tustison Automatically extracting predicate-argument structures from natural language texts , 2004 .

[15]  P J Haug,et al.  Designing a controlled medical vocabulary server: the VOSER project. , 1994, Computers and biomedical research, an international journal.

[16]  A. McCray Better Access to Information about Clinical Trials , 2000, Annals of Internal Medicine.

[17]  Matteo Magnani,et al.  A Unified Approach to Structured, Semistructured and Unstructured Data , 2004 .

[18]  A. Newell Unified Theories of Cognition , 1990 .

[19]  Razvan C. Bunescu,et al.  Integrating Co-occurrence Statistics with Information Extraction for Robust Retrieval of Protein Interactions from Medline , 2006, BioNLP@NAACL-HLT.

[21]  Kent A. Spackman,et al.  Compositional concept representation using SNOMED: towards further convergence of clinical terminologies , 1998, AMIA.

[22]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[23]  James Pustejovsky,et al.  Robust Relational Parsing Over Biomedical Literature: Extracting Inhibit Relations , 2001, Pacific Symposium on Biocomputing.

[24]  S M Huff,et al.  Linking a Medical Vocabulary to a Clinical Data Model using Abstract Syntax Notation 1 , 1998, Methods of Information in Medicine.