A novel semantic representation for eligibility criteria in clinical trials

Eligibility Criteria (EC) comprise an important part of a clinical study, being determinant of its cost, duration and overall success. Their formal, computer-processable description can significantly improve clinical trial design and conduction by enabling their intelligent processing, replicability and linkability with other data. For EC representation purposes, related standards were investigated, along with published literature. Moreover, a considerable number of clinicaltrials.gov studies was analyzed in collaboration with clinical experts for the determination and classification of parameters of clinical research importance. The outcome of this process was the EC Representation; a CDISC-compliant schema for organizing criteria along with a patient-centric model for their formal expression, properly linked with international classifications and codifications. Its evaluation against 200 randomly selected EC indicated that it can adequately serve its purpose, while it can be also combined with existing tools and components developed for both EC specification and especially application to Electronic Health Records.

[1]  Frank van Harmelen,et al.  Enhancing reuse of structured eligibility criteria and supporting their relaxation , 2015, J. Biomed. Informatics.

[2]  M. Dickson,et al.  The cost of new drug discovery and development. , 2004, Discovery medicine.

[3]  Donald D. Chamberlin,et al.  SEQUEL: A structured English query language , 1974, SIGFIDET '74.

[4]  R. Kush,et al.  Global clinical data interchange standards are here! , 2007, Drug discovery today.

[5]  A. Kiss,et al.  Eligibility criteria of randomized controlled trials published in high-impact general medical journals: a systematic sampling review. , 2007, JAMA.

[6]  Christopher G Chute,et al.  Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[7]  Michael N. Cantor,et al.  Analysis of eligibility criteria representation in industry-standard clinical trial protocols , 2013, J. Biomed. Informatics.

[8]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[9]  Chunhua Weng,et al.  Dynamic categorization of clinical research eligibility criteria by hierarchical clustering , 2011, J. Biomed. Informatics.

[10]  E. Brown,et al.  The Medical Dictionary for Regulatory Activities (MedDRA) , 1999, Drug safety.

[11]  Adaline C. Hayden,et al.  Classification of Diseases , 1896, The Dental register.

[12]  Theodora A. Varvarigou,et al.  Building a Repository for Inferring the Meaning of Abbreviations Used in Clinical Studies , 2017, J. Comput..

[13]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[14]  P. Rothwell,et al.  External validity of randomised controlled trials: “To whom do the results of this trial apply?” , 2005, The Lancet.

[15]  T Beale,et al.  openEHR Architecture Architecture Overview , 2006 .

[16]  Julie Evans,et al.  Model Formulation: The BRIDG Project: A Technical Report , 2008, J. Am. Medical Informatics Assoc..

[17]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[18]  I. Kola,et al.  Can the pharmaceutical industry reduce attrition rates? , 2004, Nature Reviews Drug Discovery.

[19]  Theodora A. Varvarigou,et al.  An Intelligent Ontology Alignment Tool Dealing with Complicated Mismatches , 2014, SWAT4LS.

[20]  Ivar Jacobson,et al.  The unified modeling language reference manual , 2010 .

[21]  Rebecca Kush,et al.  A Standard Computable Clinical Trial Protocol: The Role of the BRIDG Model , 2007 .

[22]  Theodora A. Varvarigou,et al.  An Advanced Query and Result Rewriting Mechanism for Information Retrieval Purposes from RDF Datasources , 2015, KESW.

[23]  Claudio Gutiérrez,et al.  The Expressive Power of SPARQL , 2008, SEMWEB.

[24]  Marco Eichelberg,et al.  A survey and analysis of Electronic Healthcare Record standards , 2005, CSUR.

[25]  Mor Peleg,et al.  A practical method for transforming free-text eligibility criteria into computable criteria , 2011, J. Biomed. Informatics.

[26]  Gerold Schneider,et al.  Attempto Controlled English Meets the Challenges of Knowledge Representation, Reasoning, Interoperability and User Interfaces , 2006, FLAIRS.

[27]  Charles N. Mead,et al.  The HL7 Reference Information Model Under Scrutiny , 2006, MIE.

[28]  G Hripcsak,et al.  Natural language processing and its future in medicine. , 1999, Academic medicine : journal of the Association of American Medical Colleges.

[29]  Chunhua Weng,et al.  Semi-Automatically Inducing Semantic Classes of Clinical Research Eligibility Criteria Using UMLS and Hierarchical Clustering. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[30]  Oren Etzioni,et al.  The Tradeoffs Between Open and Traditional Relation Extraction , 2008, ACL.

[31]  Martin Gogolla,et al.  Object Constraint Language (OCL): A Definitive Guide , 2012, SFM.

[32]  Theodora A. Varvarigou,et al.  PAT: An Intelligent Authoring Tool for Facilitating Clinical Trial Design , 2014, MIE.

[33]  Boris Motik,et al.  OWL 2: The next step for OWL , 2008, J. Web Semant..

[34]  T A Pryor,et al.  The arden syntax for medical logic modules , 1990, International journal of clinical monitoring and computing.

[35]  Omolola Ogunyemi,et al.  GELLO: An Object-Oriented Query and Expression Language for Clinical Decision Support: AMIA 2003 Open Source Expo , 2003, AMIA.

[36]  Margaret W. Haber,et al.  Controlled Terminology for Clinical Research: A Collaboration between CDISC and NCI Enterprise Vocabulary Services , 2007 .

[37]  Diego Calvanese,et al.  Ontop: Answering SPARQL queries over relational databases , 2016, Semantic Web.

[38]  Kent A. Spackman,et al.  The SNOMED clinical terms development process: refinement and analysis of content , 2002, AMIA.

[39]  Theodora A. Varvarigou,et al.  Meaning Inference of Abbreviations Appearing in Clinical Studies , 2015, SLATE.

[40]  Atta Abbas,et al.  DIAGNOSTIC AND STATISTICAL MANUAL OF MENTAL DISORDERS, FIFTH EDITION , 2013 .

[41]  Rehan Zaidi JavaScript Object Notation (JSON) , 2017 .

[42]  C. McDonald,et al.  LOINC, a universal standard for identifying laboratory observations: a 5-year update. , 2003, Clinical chemistry.

[43]  Joel Lexchin,et al.  The cost of drug development: a systematic review. , 2011, Health policy.

[44]  Christian Bizer,et al.  D2R Server - Publishing Relational Databases on the Semantic Web , 2004 .

[45]  Xiaoying Wu,et al.  EliXR: an approach to eligibility criteria extraction and representation , 2011, J. Am. Medical Informatics Assoc..

[46]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[47]  R. W. Hansen,et al.  The price of innovation: new estimates of drug development costs. , 2003, Journal of health economics.

[48]  Theodora A. Varvarigou,et al.  A novel query rewriting mechanism for semantically interlinking clinical research with electronic health records , 2012, WIMS '12.

[49]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[50]  Ivar Jacobson,et al.  Unified Modeling Language Reference Manual, The (2nd Edition) , 2004 .

[51]  Chunhua Weng,et al.  Extracting temporal constraints from clinical research eligibility criteria using conditional random fields. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[52]  Nigel Shadbolt,et al.  Resource Description Framework (RDF) , 2009 .

[53]  Theodora Varvarigou,et al.  A Novel Framework for User-Friendly Ontology-Mediated Access to Relational Databases , 2015 .

[54]  S. Tu,et al.  Analysis of Eligibility Criteria Complexity in Clinical Trials , 2010, Summit on translational bioinformatics.

[55]  Chunhua Weng,et al.  Formal representation of eligibility criteria: A literature review , 2010, J. Biomed. Informatics.