A Rule-Based Data Quality Assessment System for Electronic Health Record Data

OBJECTIVE  Rule-based data quality assessment in health care facilities was explored through compilation, implementation, and evaluation of 63,397 data quality rules in a single-center case study to assess the ability of rules-based data quality assessment to identify data errors of importance to physicians and system owners. METHODS  We applied a design science framework to design, demonstrate, test, and evaluate a scalable framework with which data quality rules can be managed and used in health care facilities for data quality assessment and monitoring. RESULTS  We identified 63,397 rules partitioned into 28 logic templates. A total of 819,683 discrepancies were identified by 4.5% of the rules. Nine out of 11 participating clinical and operational leaders indicated that the rules identified data quality problems and articulated next steps that they wanted to take based on the reported information. DISCUSSION  The combined rule template and knowledge table approach makes governance and maintenance of otherwise large rule sets manageable. Identified challenges to rule-based data quality monitoring included the lack of curated and maintained knowledge sources relevant to data error detection and lack of organizational resources to support clinical and operational leaders with investigation and characterization of data errors and pursuit of corrective and preventative actions. Limitations of our study included implementation within a single center and dependence of the results on the implemented rule set. CONCLUSION  This study demonstrates a scalable framework (up to 63,397 rules) with which data quality rules can be implemented and managed in health care facilities to identify data errors. The data quality problems identified at the implementation site were important enough to prompt action requests from clinical and operational leaders.

[1]  J. Steiner,et al.  A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. , 2012, Medical care.

[2]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[3]  Sabine Koch,et al.  Exploring Vital Sign Data Quality in Electronic Health Records with Focus on Emergency Care Warning Scores , 2017, Applied Clinical Informatics.

[4]  D. L. Rossmann,et al.  Data entry errors in an on-line operation. , 1981, Computers and biomedical research, an international journal.

[5]  Dimitrios I. Fotiadis,et al.  Medical data quality assessment: On the development of an automated framework for medical data curation , 2019, Comput. Biol. Medicine.

[6]  Stuart Speedie,et al.  Quantifying the Effect of Data Quality on the Validity of an eMeasure , 2017, Applied Clinical Informatics.

[7]  J W Bellville,et al.  The use of computers in clinical trials. , 1967, British journal of anaesthesia.

[8]  I K Crombie,et al.  An investigation of data entry methods with a personal computer. , 1986, Computers and biomedical research, an international journal.

[9]  Andrew P. Reimer,et al.  Data quality assessment framework to assess electronic medical record data for use in research , 2016, Int. J. Medical Informatics.

[10]  P. Grambsch,et al.  Forms control and error detection procedures used at the Coordinating Center of the Multiple Risk Factor Intervention Trial (MRFIT). , 1986, Controlled clinical trials.

[11]  Philip J. B. Brown,et al.  Data quality probes - exploiting and improving the quality of electronic patient record data and patient care , 2002, Int. J. Medical Informatics.

[12]  Maria W. G. Nijhuis-van der Sanden,et al.  Data extraction from electronic health records (EHRs) for quality measurement of the physical therapy process: comparison between EHR data and survey data , 2016, BMC Medical Informatics and Decision Making.

[13]  Ronald Cornet,et al.  Impact of Electronic versus Paper-Based Recording before EHR Implementation on Health Care Professionals' Perceptions of EHR Use, Data Quality, and Data Reuse , 2019, Applied Clinical Informatics.

[14]  Rowena J Dolor,et al.  The MURDOCK Study: a long-term initiative for disease reclassification through advanced biomarker discovery and integration with electronic health records. , 2012, American journal of translational research.

[15]  G. Knatterud,et al.  Methods of quality control and of continuous audit procedures for controlled clinical trials. , 1981, Controlled clinical trials.

[16]  S B Hulley,et al.  Community surveillance of cardiovascular diseases in the Stanford Five-City Project. Methods and initial experience. , 1986, American journal of epidemiology.

[17]  S Hulley,et al.  Data quality in a distributed data processing system: the SHEP Pilot Study. , 1986, Controlled clinical trials.

[18]  Ping Chen,et al.  C-A1-02: Developing a Structure for Programmatic Quality Assurance Checks on the Virtual Data Warehouse , 2011, Clinical Medicine & Research.

[19]  M J Gillespie,et al.  Data management for a large collaborative clinical trial (CASS: Coronary Artery Surgery Study). , 1978, Computers and biomedical research, an international journal.

[20]  Shelli L Feder,et al.  Data Quality in Electronic Health Records Research: Quality Domains and Assessment Methods , 2018, Western journal of nursing research.

[21]  Lauren Houston,et al.  Exploring Data Quality Management within Clinical Trials , 2018, Applied Clinical Informatics.