A multi-technique approach to bridge electronic case report form design and data standard adoption

BACKGROUND AND OBJECTIVE The importance of data standards when integrating clinical research data has been recognized. The common data element (CDE) is a consensus-based data element for data harmonization and sharing between clinical researchers, it can support data standards adoption and mapping. However, the lack of a suitable methodology has become a barrier to data standard adoption. Our aim was to demonstrate an approach that allowed clinical researchers to design electronic case report forms (eCRFs) that complied with the data standard. METHODS We used a multi-technique approach, including information retrieval, natural language processing and an ontology-based knowledgebase to facilitate data standard adoption using the eCRF design. The approach took research questions as query texts with the aim of retrieving and associating relevant CDEs with the research questions. RESULTS The approach was implemented using a CDE-based eCRF builder, which was evaluated using CDE- related questions from CRFs used in the Parkinson Disease Biomarker Program, as well as CDE-unrelated questions from a technique support website. Our approach had a precision of 0.84, a recall of 0.80, a F-measure of 0.82 and an error of 0.31. Using the 303 testing CDE-related questions, our approach responded and provided suggested CDEs for 88.8% (269/303) of the study questions with a 90.3% accuracy (243/269). The reason for any missed and failed responses was also analyzed. CONCLUSION This study demonstrates an approach that helps to cross the barrier that inhibits data standard adoption in eCRF building and our evaluation reveals the approach has satisfactory performance. Our CDE-based form builder provides an alternative perspective regarding data standard compliant eCRF design.

[1]  H. Calkins,et al.  ACC/AHA/HRS 2006 key data elements and definitions for electrophysiological studies and procedures: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Data Standards (ACC/AHA/HRS Writing Committee to Develop Data Standards on Electrophysiology). , 2006, Journal of the American College of Cardiology.

[2]  Sheng Yu,et al.  An Investigation of Semantic Links to Archetypes in an External Clinical Terminology through the Construction of Terminological "Shadows" , 2010 .

[3]  Cui Tao,et al.  Towards Semantic-Web Based Representation and Harmonization of Standard Meta-data Models for Clinical Studies , 2011, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[4]  C Ohmann,et al.  Future Developments of Medical Informatics from the Viewpoint of Networked Clinical Research , 2009, Methods of Information in Medicine.

[5]  Prakash M. Nadkarni,et al.  Data standards for clinical research data collection forms: current status and challenges , 2011, J. Am. Medical Informatics Assoc..

[6]  G. Chowdhury,et al.  Introduction to Modern Information Retrieval, 3rd Edition , 2010 .

[7]  Gilad J. Kuperman,et al.  Application of information technology: Developing data content specifications for the Nationwide Health Information Network Trial Implementations , 2010, J. Am. Medical Informatics Assoc..

[8]  Asuman Dogac,et al.  Providing Semantic Interoperability Between Clinical Care and Clinical Research Domains , 2013, IEEE Journal of Biomedical and Health Informatics.

[9]  Gilberto Fragoso,et al.  caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability , 2008, J. Biomed. Informatics.

[10]  Robert Stevens,et al.  Putting OWL in Order: Patterns for Sequences in OWL , 2006, OWLED.

[11]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[12]  Bridget M. Kuehn Youth Suicide Screening , 2013 .

[13]  Winston A Hide,et al.  Big data: The future of biocuration , 2008, Nature.

[14]  Mor Peleg,et al.  The Ontology of Clinical Research (OCRe): An informatics foundation for the science of clinical research , 2014, J. Biomed. Informatics.

[15]  Ram Chilukuri,et al.  Common Data Element (CDE) Management and Deployment in Clinical Trials , 2003, AMIA.

[16]  Cui Tao,et al.  A semantic-web oriented representation of the clinical element model for secondary use of electronic health records data , 2013, J. Am. Medical Informatics Assoc..

[17]  Hua Min,et al.  Sharing behavioral data through a grid infrastructure using data standards , 2014, J. Am. Medical Informatics Assoc..

[18]  Melissa Haendel,et al.  A sea of standards for omics data: sink or swim? , 2013, J. Am. Medical Informatics Assoc..

[19]  Zuhair Bandar,et al.  Sentence similarity based on semantic nets and corpus statistics , 2006, IEEE Transactions on Knowledge and Data Engineering.

[20]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[21]  Philip R. O. Payne,et al.  TRIAD: The Translational Research Informatics and Data Management Grid , 2011, Applied Clinical Informatics.

[22]  Christopher G. Chute,et al.  A Collaborative Framework for Representation and Harmonization of Clinical Study Data Elements Using Semantic MediaWiki , 2010, Summit on translational bioinformatics.

[23]  C A Brandt,et al.  Approaches and Informatics Tools to Assist in the Integration of Similar Clinical Research Questionnaires , 2004, Methods of Information in Medicine.

[24]  Kathryn Stone NerveCenter: NINDS common data element project: A long‐awaited break through in streamlining trials , 2010, Annals of neurology.

[25]  Shigemi Matsumoto,et al.  A Data Capture System for Outcomes Studies that Integrates with Electronic Health Records: Development and Potential Uses , 2008, Journal of Medical Systems.

[26]  Joel H. Saltz,et al.  caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid , 2006, Bioinform..

[27]  Mária Bieliková,et al.  From Ambiguous Words to Key-Concept Extraction , 2013, 2013 24th International Workshop on Database and Expert Systems Applications.

[28]  Wen Nie,et al.  A web-based, meta-data drive, clinical research platform for managing multiple clinical research studies. , 2007, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[29]  Bridget M. Kuehn Parkinson Biomarker Program , 2013 .

[30]  Gobinda G. Chowdhury,et al.  Introduction to Modern Information Retrieval , 1999 .

[31]  Jesse James Garrett Ajax: A New Approach to Web Applications , 2007 .