Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use

Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating semantic DQ that will allow researchers to evaluate fitness for use prior to analyses.

[1]  Steven G. Johnson,et al.  A Data Quality Ontology for the Secondary Use of EHR Data , 2015, AMIA.

[2]  Shawn N Murphy,et al.  Semi-supervised Encoding for Outlier Detection in Clinical Observation Data , 2018, bioRxiv.

[3]  Tony Norris,et al.  The strategic management of data quality in healthcare , 2008, Health Informatics J..

[4]  David M. Hartley,et al.  Use of EHR-Based Pediatric Quality Measures: Views of Health System Leaders and Parents , 2020, American journal of medical quality : the official journal of the American College of Medical Quality.

[5]  K. Aleksandrova,et al.  Development and reliability assessment of a new quality appraisal tool for cross-sectional studies using biomarker data (BIOCROSS) , 2018, BMC Medical Research Methodology.

[6]  Serhan Dagtas,et al.  Rule-Based Data Quality Assessment and Monitoring System in Healthcare Facilities , 2019, ITCH.

[7]  Victoria Charnock,et al.  Electronic healthcare records and data quality , 2019, Health information and libraries journal.

[8]  Elmer V. Bernstam,et al.  A federated EHR network data completeness tracking system , 2019, J. Am. Medical Informatics Assoc..

[9]  Keith Marsolo,et al.  A longitudinal analysis of data quality in a large pediatric data research network , 2017, J. Am. Medical Informatics Assoc..

[10]  Peter J. Gates,et al.  Preventable Adverse Drug Events Among Inpatients: A Systematic Review , 2018, Pediatrics.

[11]  Helen Coelho,et al.  iCCM data quality: an approach to assessing iCCM reporting systems and data quality in 5 African countries , 2019, Journal of global health.

[12]  J J Cimino,et al.  The False Security of Blind Dates , 2012, Applied Clinical Informatics.

[13]  Michael Marschollek,et al.  Clinical Information Model Based Data Quality Checks: Theory and Example , 2019, EFMI-STC.

[14]  Giri N. K. Rangan,et al.  Committee , 2019, 2019 International Conference on Radar, Antenna, Microwave, Electronics, and Telecommunications (ICRAMET).

[15]  Haleh Ayatollahi,et al.  A Review of Data Quality Assessment in Emergency Medical Services , 2018, The open medical informatics journal.

[16]  Patrick B. Ryan,et al.  Transparent Reporting of Data Quality in Distributed Data Networks , 2015, EGEMS.

[17]  S. Bakken,et al.  A Data Quality Assessment Guideline for Electronic Health Record Data Reuse , 2017, EGEMS.

[18]  Keith Marsolo,et al.  PEDSnet: a National Pediatric Learning Health System , 2014, J. Am. Medical Informatics Assoc..

[19]  George Hripcsak,et al.  Caveats for the use of operational electronic health record data in comparative effectiveness research. , 2013, Medical care.

[20]  Steven G. Johnson,et al.  A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data , 2016, EGEMS.

[21]  Matthias Eikermann,et al.  Codifying healthcare – big data and the issue of misclassification , 2015, BMC Anesthesiology.

[22]  Richard E Gliklich,et al.  Assessing Real-World Data Quality: The Application of Patient Registry Quality Criteria to Real-World Data and Real-World Evidence. , 2020, Therapeutic innovation & regulatory science.

[23]  Jyotishman Pathak,et al.  A Framework for Data Quality Assessment in Clinical Research Datasets , 2017, AMIA.

[24]  Anita Burgun-Parenthoine,et al.  The Need of an Open Data Quality Policy: The Case of the "Transparency - Health" Database in the Prevention of Conflict of Interest , 2018, MIE.

[25]  Juliane S. Reynolds,et al.  Early Antibiotic Exposure and Weight Outcomes in Young Children , 2018, Pediatrics.

[26]  Amanda Nolen,et al.  Thematic analysis of qualitative research data: Is it as easy as it sounds? , 2018, Currents in pharmacy teaching & learning.

[27]  Reesa Laws,et al.  Data Quality Assessment and Multi-Organizational Reporting: Tools to Enhance Network Knowledge , 2019, EGEMS.

[28]  John H. Zhang,et al.  Data Standardization and Quality Management , 2017, Translational Stroke Research.

[29]  Jerry Fowler,et al.  System for Quality‐Assured Data Analysis: Flexible, reproducible scientific workflows , 2019, Genetic epidemiology.

[30]  Hossein Estiri,et al.  DQe-v: A Database-Agnostic Framework for Exploring Variability in Electronic Health Record Data Across Time and Site Location , 2017, EGEMS.

[31]  George Hripcsak,et al.  Bias Associated with Mining Electronic Health Records , 2011, Journal of biomedical discovery and collaboration.

[32]  Dimitrios I Fotiadis,et al.  Enhancing medical data quality through data curation: a case study in primary Sjögren's syndrome. , 2019, Clinical and experimental rheumatology.

[33]  Hairong Yu,et al.  Structured data quality reports to improve EHR data quality , 2015, Int. J. Medical Informatics.

[34]  Lucy Savitz,et al.  Challenges in Using Electronic Health Record Data for CER: Experience of 4 Learning Organizations and Solutions Applied , 2013, Medical care.

[35]  Amardeep Thind,et al.  A basic model for assessing primary health care electronic medical record data quality , 2019, BMC Medical Informatics and Decision Making.

[36]  Lisiane Pruinelli,et al.  A Framework for Visualizing Data Quality for Predictive Models and Clinical Quality Measures. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[37]  Dimitrios I. Fotiadis,et al.  Medical data quality assessment: On the development of an automated framework for medical data curation , 2019, Comput. Biol. Medicine.

[38]  Patrick B. Ryan,et al.  Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Data Sets , 2016, EGEMS.

[39]  Shelli L Feder,et al.  Data Quality in Electronic Health Records Research: Quality Domains and Assessment Methods , 2018, Western journal of nursing research.

[40]  Lauren Houston,et al.  Assessing data quality and the variability of source data verification auditing methods in clinical research settings , 2018, J. Biomed. Informatics.

[41]  C. Forrest,et al.  Using Electronic Health Record Data to Rapidly Identify Children with Glomerular Disease for Clinical Research. , 2019, Journal of the American Society of Nephrology : JASN.

[42]  Ping Yu,et al.  Defining and Developing a Generic Framework for Monitoring Data Quality in Clinical Research , 2018, AMIA.

[43]  L Charles Bailey,et al.  PCORnet Antibiotics and Childhood Growth Study: Process for Cohort Creation and Cohort Description. , 2018, Academic pediatrics.

[44]  Shuang Wang,et al.  Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research , 2014, BMC Medical Informatics and Decision Making.

[45]  Lina Palmér,et al.  Qualitative thematic analysis based on descriptive phenomenology , 2019, Nursing open.

[46]  Chunhua Weng,et al.  A Data Element-Function Conceptual Model for Data Quality Checks , 2019, EGEMS.

[47]  Naila A. Shaheen,et al.  Reducing defects in the datasets of clinical research studies: conformance with data quality metrics , 2019, BMC Medical Research Methodology.

[48]  V. Braun,et al.  Using thematic analysis in psychology , 2006 .

[49]  Jeffrey G. Klann,et al.  A clustering approach for detecting implausible observation values in electronic health records data , 2019, bioRxiv.

[50]  Andoni Beristain,et al.  TAQIH, a tool for tabular data quality assessment and improvement in the context of health data , 2019, Comput. Methods Programs Biomed..

[51]  Keith Marsolo,et al.  Evaluating Foundational Data Quality in the National Patient-Centered Clinical Research Network (PCORnet®) , 2018, EGEMS.