Analyzing the heterogeneity and complexity of Electronic Health Record oriented phenotyping algorithms.

The need for formal representations of eligibility criteria for clinical trials - and for phenotyping more generally - has been recognized for some time. Indeed, the availability of a formal computable representation that adequately reflects the types of data and logic evidenced in trial designs is a prerequisite for the automatic identification of study-eligible patients from Electronic Health Records. As part of the wider process of representation development, this paper reports on an analysis of fourteen Electronic Health Record oriented phenotyping algorithms (developed as part of the eMERGE project) in terms of their constituent data elements, types of logic used and temporal characteristics. We discovered that the majority of eMERGE algorithms analyzed include complex, nested boolean logic and negation, with several dependent on cardinality constraints and complex temporal logic. Insights gained from the study will be used to augment the CDISC Protocol Representation Model.

[1]  Mor Peleg,et al.  A practical method for transforming free-text eligibility criteria into computable criteria , 2011, J. Biomed. Informatics.

[2]  Joshua C Denny,et al.  Modulators of normal electrocardiographic intervals identified in a large electronic medical record. , 2011, Heart rhythm.

[3]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[4]  Marek J. Sergot,et al.  Computer Support for Protocol-Based Treatment of Cancer , 1996, J. Log. Program..

[5]  S W Tu,et al.  AIDS2: a decision-support tool for decreasing physicians' uncertainty regarding patient eligibility for HIV treatment protocols. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.

[6]  Jin Fan,et al.  Leveraging informatics for genetic studies: use of the electronic medical record to enable a genome-wide association study of peripheral arterial disease , 2010, J. Am. Medical Informatics Assoc..

[7]  Omolola Ogunyemi,et al.  GLIF3: a representation format for sharable computer-interpretable clinical practice guidelines , 2004, J. Biomed. Informatics.

[8]  John Powell,et al.  Paying clinicians to join clinical trials: a review of guidelines and interview study of trialists , 2009, Trials.

[9]  Karthik Gomadam,et al.  TrialX: Using semantic technologies to match patients to relevant clinical trials based on their Personal Health Records , 2010, J. Web Semant..

[10]  Chunhua Weng,et al.  Formal representation of eligibility criteria: A literature review , 2010, J. Biomed. Informatics.

[11]  L. Anthony,et al.  AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom , 2005, IPCC 2005. Proceedings. International Professional Communication Conference, 2005..

[12]  Omolola Ogunyemi,et al.  Description and Status Update on GELLO: a Proposed Standardized Object-Oriented Expression Language for Clinical Decision Support , 2004, MedInfo.

[13]  Christopher G. Chute,et al.  A Genome-Wide Association Study of Red Blood Cell Traits Using the Electronic Medical Record , 2010, PloS one.

[14]  Peter Tarczy-Hornoch,et al.  Natural language processing of clinical trial announcements: exploratory-study of building an automated screening application. , 2008, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[15]  S. Tu,et al.  Analysis of Eligibility Criteria Complexity in Clinical Trials , 2010, Summit on translational bioinformatics.