Multi-Center Healthcare Data Quality Measurement Model and Assessment Using OMOP CDM

Healthcare data has economic value and is evaluated as such. Therefore, it attracted global attention from observational and clinical studies alike. Recently, the importance of data quality research emerged in healthcare data research. Various studies are being conducted on this topic. In this study, we propose a DQ4HEALTH model that can be applied to healthcare when reviewing existing data quality literature. The model includes 5 dimensions and 415 validation rules. The four evaluation indicators include the net pass rate (NPR), weighted pass rate (WPR), net dimensional pass rate (NDPR), and weighted dimensional pass rate (WDPR). They were used to evaluate the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) at three medical institutions. These indicators identify differences in data quality between the institutions. The NPRs of the three institutions (A, B, and C) were 96.58%, 90.08%, and 90.87%, respectively, and the WPR was 98.52%, 94.26%, and 94.81%, respectively. In the quality evaluation of the dimensions, the consistency was 70.06% of the total error data. The WDPRs were 98.22%, 94.74%, and 95.05% for institutions A, B, and C, respectively. This study presented indices for comparing quality evaluation models and quality in the healthcare field. Using these indices, medical institutions can evaluate the quality of their data and suggest practical directions for decreasing errors.

[1]  Dibya Jyoti Bora Big Data Analytics in Healthcare: A Critical Analysis , 2019, Big Data Analytics for Intelligent Healthcare Management.

[2]  Patrick B. Ryan,et al.  Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Data Sets , 2016, EGEMS.

[3]  A. F. Bochner,et al.  Challenges in data quality: the influence of data quality assessments on data availability and completeness in a voluntary medical male circumcision programme in Zimbabwe , 2017, BMJ Open.

[4]  Carlo Batini,et al.  Methodologies for data quality assessment and improvement , 2009, CSUR.

[5]  L. Green,et al.  Limitations of the randomized controlled trial in evaluating population-based health interventions. , 2007, American journal of preventive medicine.

[6]  Jay Daniel,et al.  Data Completeness in Healthcare: A Literature Survey , 2017, Pac. Asia J. Assoc. Inf. Syst..

[7]  Chunhua Weng,et al.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research , 2013, J. Am. Medical Informatics Assoc..

[8]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[9]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[10]  Guillermina Noël,et al.  Improving the quality of healthcare data through information design , 2017 .

[11]  Samuel T. Savitz,et al.  How much can we trust electronic health record data? , 2020, Healthcare.

[12]  Kristine E Lynch,et al.  Incrementally Transforming Electronic Medical Records into the Observational Medical Outcomes Partnership Common Data Model: A Multidimensional Quality Assurance Approach , 2019, Applied Clinical Informatics.

[13]  Ajit Londhe,et al.  Extending Achilles Heel Data Quality Tool with New Rules Informed by Multi-Site Data Quality Comparison , 2019, MedInfo.

[14]  Andrew P. Reimer,et al.  Data quality assessment framework to assess electronic medical record data for use in research , 2016, Int. J. Medical Informatics.

[15]  Luigi Palmieri,et al.  Quality Assessment of Healthcare Databases , 2017 .

[16]  J. Steiner,et al.  A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. , 2012, Medical care.

[17]  Domenica Taruscio,et al.  Data Quality in Rare Diseases Registries. , 2017, Advances in experimental medicine and biology.

[18]  Patrick B. Ryan,et al.  A Comparison of Data Quality Assessment Checks in Six Data Sharing Networks , 2017, EGEMS.

[19]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.

[20]  Patrick B. Ryan,et al.  Validation of a common data model for active safety surveillance research , 2012, J. Am. Medical Informatics Assoc..

[21]  Kenneth Sherr,et al.  An assessment of data quality in a multi-site electronic medical record system in Haiti , 2016, Int. J. Medical Informatics.

[22]  Nancy Puttkammer,et al.  The impact of routine data quality assessments on electronic medical record data quality in Kenya , 2018, PloS one.

[23]  Serhan Dagtas,et al.  A Rule-Based Data Quality Assessment System for Electronic Health Record Data , 2020, Applied Clinical Informatics.

[24]  Martijn J. Schuemie,et al.  Conversion and Data Quality Assessment of Electronic Health Record Data at a Korean Tertiary Teaching Hospital to a Common Data Model for Distributed Network Research , 2016, Healthcare informatics research.

[25]  P. Embí,et al.  Toward Reuse of Clinical Data for Research and Quality Improvement: The End of the Beginning? , 2009, Annals of Internal Medicine.

[26]  Jerry Zeyu Gao,et al.  Big Data Validation and Quality Assurance -- Issuses, Challenges, and Needs , 2016, 2016 IEEE Symposium on Service-Oriented System Engineering (SOSE).

[27]  Amardeep Thind,et al.  A basic model for assessing primary health care electronic medical record data quality , 2019, BMC Medical Informatics and Decision Making.

[28]  C. Maier,et al.  Towards Implementation of OMOP in a German University Hospital Consortium , 2018, Applied Clinical Informatics.

[29]  Elizabeth S. Chen,et al.  Quality Informatics: The Convergence of Healthcare Data, Analytics, and Clinical Excellence , 2019, Applied Clinical Informatics.

[30]  Yu-Chuan Li,et al.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers , 2015, MedInfo.

[31]  A. Bouguettaya,et al.  Healthcare data warehousing and quality assurance , 2001 .

[32]  S. Bakken,et al.  A Data Quality Assessment Guideline for Electronic Health Record Data Reuse , 2017, EGEMS.

[33]  Steven G. Johnson,et al.  A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data , 2016, EGEMS.

[34]  Shelli L Feder,et al.  Data Quality in Electronic Health Records Research: Quality Domains and Assessment Methods , 2018, Western journal of nursing research.