Data Quality in Clinical Research

Every scientist knows that research results are only as good as the data upon which the conclusions were formed. However, most scientists receive no training in methods for achieving, assessing, or controlling the quality of research data—topics central to clinical research informatics. This chapter covers the basics of acquiring or collecting and processing data for research given the available data sources, systems, and people. Data quality dimensions specific to the clinical research context are used, and a framework for data quality practice and planning is developed. Available research is summarized, providing estimates of data quality capability for common clinical research data collection and processing methods. This chapter provides researchers, informaticists, and clinical research data managers basic tools to assure, assess, and control the quality of data for research.

[1]  Jeremy Wyatt Dm Mrcp Acquisition and use of clinical data for audit and research. , 1995 .

[2]  Todd R. Johnson,et al.  Factors Affecting Accuracy of Data Abstracted from Medical Records , 2015, PloS one.

[3]  N C Andreasen,et al.  Effects of errors in a multicenter medical study: preventing misinterpreted data. , 1994, Journal of psychiatric research.

[4]  J. Bartko,et al.  Penny-wise and pound-foolish: the impact of measurement error on sample size requirements in clinical trials , 2000, Biological Psychiatry.

[5]  John Ladley Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program , 2012 .

[6]  Lucila Ohno-Machado,et al.  Validation of an Automated Safety Surveillance System with Prospective, Randomized Trial Data , 2009, Medical decision making : an international journal of the Society for Medical Decision Making.

[7]  M. Clarke,et al.  Increasing response rates to postal questionnaires: systematic review , 2002, BMJ : British Medical Journal.

[8]  Amy Trentham-Dietz,et al.  Quality of cancer registry data: findings from CDC-NPCR's Breast and Prostate Cancer Data Quality and Patterns of Care Study. , 2011, Journal of registry management.

[9]  Carl F. Pieper,et al.  Quantifying Data Quality for Clinical Trials Using Electronic Data Capture , 2008, PloS one.

[10]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[11]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.

[12]  F. Ahmed,et al.  Case completeness and data accuracy in the Centers for Disease Control and Prevention's National Program of Cancer Registries , 2007, Cancer.

[13]  Kwan Lee,et al.  Keen eye on core measures. Joint Commission data quality study offers insights into data collection, abstracting processes. , 2003, Journal of AHIMA.

[14]  Meredith Nahm,et al.  What can we learn from a decade of database audits? The Duke Clinical Research Institute experience, 1997—2006 , 2009, Clinical trials.

[15]  S D Stellman The case of the missing eights. An object lesson in data quality assurance. , 1989, American journal of epidemiology.

[16]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[17]  J. Steiner,et al.  A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. , 2012, Medical care.

[18]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[19]  A R Feinstein,et al.  The Epidemiology of Cancer Therapy: III. The Management of Imperfect Data , 1969 .

[20]  S S Stevens,et al.  On the Theory of Scales of Measurement. , 1946, Science.

[21]  I. Kohane,et al.  Finding the missing link for big biomedical data. , 2014, JAMA.

[22]  Michael J. Roszkowski,et al.  Believe it or not! longer questionnaires have lower response rates , 1990 .

[23]  Angela Colantonio,et al.  Medical Record Review Conduction Model for Improving Interrater Reliability of Abstracting Medical-Related Information , 2009, Evaluation & the health professions.

[24]  Roger Frost,et al.  International Organization for Standardization (ISO) , 2004 .

[25]  Michael M. Wagner,et al.  Review: Accuracy of Data in Computer-based Patient Records , 1997, J. Am. Medical Informatics Assoc..

[26]  V. Loerzel,et al.  Application of the CuSum Technique to Evaluate Changes in Recruitment Strategies , 2005, Nursing research.

[27]  T. To,et al.  Examining intra-rater and inter-rater response agreement: A medical chart abstraction study of a community-based asthma care program , 2008, BMC medical research methodology.

[28]  Richard W. Kobylinski,et al.  The art and science of chart review. , 2000, The Joint Commission journal on quality improvement.

[29]  Andy Koronios,et al.  IQM-CMM: A Framework For Assessing Organizational Information Quality Management Capability Maturity , 2007, ICIQ.

[30]  Carl J Stepnowsky,et al.  The effect of measurement unreliability on sleep and respiratory variables. , 2004, Sleep.

[31]  J. G. Hollands,et al.  Engineering Psychology and Human Performance , 1984 .

[32]  Rowena Jacobs,et al.  How Robust Are Hospital Ranks Based on Composite Performance Measures? , 2005, Medical care.

[33]  Peter Sandercock,et al.  Sensible approaches for reducing clinical trial costs , 2008, Clinical trials.

[34]  G Svolba,et al.  Statistical quality control in clinical trials. , 1999, Controlled clinical trials.

[35]  K C Stange,et al.  How valid are medical records and patient questionnaires for physician profiling and health services research? A comparison with direct observation of patients visits. , 1998, Medical care.

[36]  Patrick B. Ryan,et al.  A Comparison of Data Quality Assessment Checks in Six Data Sharing Networks , 2017, EGEMS.

[37]  D. Sarfati,et al.  An audit of colon cancer data on the New Zealand Cancer Registry. , 2008, The New Zealand medical journal.

[38]  Charles P. Friedman,et al.  Viewpoint Paper: A "Fundamental Theorem" of Biomedical Informatics , 2009, J. Am. Medical Informatics Assoc..

[39]  N. O’Farrell Letting Them Die—Why HIV/AIDs prevention programmes fail , 2004, Sexually Transmitted Infections.

[40]  Kitty S. Chan,et al.  Review: Electronic Health Records and the Reliability and Validity of Quality Measures: A Review of the Literature , 2010, Medical care research and review : MCRR.

[41]  Hossein Estiri,et al.  DQe-v: A Database-Agnostic Framework for Exploring Variability in Electronic Health Record Data Across Time and Site Location , 2017, EGEMS.

[42]  N J Banks,et al.  Designing medical record abstraction forms. , 1998, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[43]  B. Yawn,et al.  Interrater reliability: completing the methods description in medical records review studies. , 2005, American journal of epidemiology.

[44]  Bruce G. Link,et al.  Impact of measurement error in the study of sexually transmitted infections , 2004, Sexually Transmitted Infections.

[45]  Carl F. Pieper,et al.  Analysis of professional competencies for the clinical research data management profession: implications for training and professional certification , 2017, J. Am. Medical Informatics Assoc..

[46]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[47]  L. A. Falk A GUIDE TO MEDICAL CARE ADMINISTRATION. VOL. II. MEDICAL CARE APPRAISAL-QUALITY AND UTILIZATION , 1970 .

[48]  Chunhua Weng,et al.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research , 2013, J. Am. Medical Informatics Assoc..

[49]  E. Wong,et al.  Comparing Bloodstream Infection Rates: The Effect of Indicator Specifications in the Evaluation of Processes and Indicators in Infection Control (EPIC) Study , 2006, Infection Control & Hospital Epidemiology.

[50]  K. Thiru,et al.  Systematic review of scope and quality of electronic patient record data in primary care , 2003, BMJ : British Medical Journal.

[51]  K. Dickersin,et al.  Comparison of information obtained by operative note abstraction with that recorded on a standardized data collection form. , 2003, Surgery.

[52]  Meredith Nahm Zozus,et al.  Data management plans: the missing perspective , 2017, J. Biomed. Informatics.

[53]  David F.M. Brown,et al.  The accuracy and completeness of data collected by prospective and retrospective methods. , 2005, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[54]  Meredith Zozus The Data Book: Collection and Management of Research Data , 2017 .

[55]  L S Freedman,et al.  The impact of dietary measurement error on planning sample size required in a cohort study. , 1990, American journal of epidemiology.

[56]  Christina Pagel,et al.  Exploring potential consequences on mortality estimates of errors in clinical databases , 2009 .

[57]  George Hripcsak,et al.  Defining and measuring completeness of electronic health records for secondary use , 2013, J. Biomed. Informatics.

[58]  Nicolette de Keizer,et al.  Model Formulation: Defining and Improving Data Quality in Medical Registries: A Literature Review, Case Study, and Generic Framework , 2002, J. Am. Medical Informatics Assoc..

[59]  M. Reeves,et al.  Inter-rater reliability of data elements from a prototype of the Paul Coverdell National Acute Stroke Registry , 2008, BMC neurology.

[60]  Eric J. Topol,et al.  The emerging field of mobile health , 2015, Science Translational Medicine.

[61]  Jerod M Loeb,et al.  Assessing the reliability of standardized performance indicators. , 2006, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[62]  Eric L Eisenstein,et al.  Reducing the costs of phase III cardiovascular clinical trials. , 2005, American heart journal.

[63]  Danette McGilvray,et al.  Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information TM , 2008 .

[64]  C G Cayten,et al.  Interobserver and intraobserver reliability in the collection of emergency medical services data. , 1980, Health services research.

[65]  Ronald W. Helms Data Quality Issues in Electronic Data Capture , 2001 .

[66]  Patrick B. Ryan,et al.  Transparent Reporting of Data Quality in Distributed Data Networks , 2015, EGEMS.

[67]  Calum MacAulay,et al.  Quality Assurance System Using Statistical Process Control: An Implementation for Image Cytometry , 2004, Cellular oncology : the official journal of the International Society for Cellular Oncology.

[68]  J P Mullooly,et al.  The effects of data entry error: an analysis of partial verification. , 1990, Computers and biomedical research, an international journal.

[69]  D. Fergusson,et al.  Ensuring high accuracy of data abstracted from patient charts: the use of a standardized medical record as a training tool. , 2005, Journal of clinical epidemiology.

[70]  K. Liu,et al.  Measurement error and its impact on partial correlation and multiple linear regression analyses. , 1988, American journal of epidemiology.

[71]  Malcolm L. Schuyl,et al.  A Review of the Source Document Verification Process in Clinical Trials , 1995 .

[72]  W. Edwards Deming,et al.  On Sample Inspection in the Processing of Census Returns , 1941 .

[73]  D. Goldhill,et al.  APACHE II, data accuracy and outcome prediction , 1998, Anaesthesia.

[74]  Douglas G Altman,et al.  Ensuring trial validity by data quality assurance and diversification of monitoring methods , 2008, Clinical trials.

[75]  Michael J. Beatty,et al.  A Test of the Cognitive Load Hypothesis: Investigating the Impact of Number of Nonverbal Cues Coded and Length of Coding Session on Observer Accuracy , 2007 .

[76]  Swapaka Listya Trusthi Analisis Capability Maturity Model Integration for Development (CMMI-DEV) dengan IDEAL dalam Proses Pengembangan Human Resources Management Information System (HRMIS) Telkom University , 2014 .