What can we learn from a decade of database audits? The Duke Clinical Research Institute experience, 1997—2006

Background Despite a pressing and well-documented need for better sharing of information on clinical trials data quality assurance methods, many research organizations remain reluctant to publish descriptions of and results from their internal auditing and quality assessment methods. Purpose We present findings from a review of a decade of internal data quality audits performed at the Duke Clinical Research Institute, a large academic research organization that conducts data management for a diverse array of clinical studies, both academic and industry-sponsored. In so doing, we hope to stimulate discussions that could benefit the wider clinical research enterprise by providing insight into methods of optimizing data collection and cleaning, ultimately helping patients and furthering essential research. Methods We present our audit methodologies, including sampling methods, audit logistics, sample sizes, counting rules used for error rate calculations, and characteristics of audited trials. We also present database error rates as computed according to two analytical methods, which we address in detail, and discuss the advantages and drawbacks of two auditing methods used during this 10-year period. Results Our review of the DCRI audit program indicates that higher data quality may be achieved from a series of small audits throughout the trial rather than through a single large database audit at database lock. We found that error rates trended upward from year to year in the period characterized by traditional audits performed at database lock (1997—2000), but consistently trended downward after periodic statistical process control type audits were instituted (2001—2006). These increases in data quality were also associated with cost savings in auditing, estimated at 1000 h per year, or the efforts of one-half of a full time equivalent (FTE). Limitations Our findings are drawn from retrospective analyses and are not the result of controlled experiments, and may therefore be subject to unanticipated confounding. In addition, the scope and type of audits we examine here are specific to our institution, and our results may not be broadly generalizable. Conclusions Use of statistical process control methodologies may afford advantages over more traditional auditing methods, and further research will be necessary to confirm the reliability and usability of such techniques. We believe that open and candid discussion of data quality assurance issues among academic and clinical research organizations will ultimately benefit the entire research community in the coming era of increased data sharing and re-use. Clinical Trials 2009; 6: 141—150. http://ctj.sagepub.com

[1]  S L George,et al.  Guidelines for quality assurance in multicenter trials: a position paper. , 1998, Controlled clinical trials.

[2]  Jonathan R Davis,et al.  Assuring Data Quality and Validity in Clinical Trials for Regulatory Decision Making , 1999 .

[3]  E van der Putten,et al.  A pilot study on the quality of data management in a cancer clinical trial. , 1987, Controlled clinical trials.

[4]  L. Crocker,et al.  Introduction to Classical and Modern Test Theory , 1986 .

[5]  S Day,et al.  Double data entry: what value, what price? , 1998, Controlled clinical trials.

[6]  D Spiegelman,et al.  Measurement error correction for logistic regression models with an "alloyed gold standard". , 1997, American journal of epidemiology.

[7]  B A Blumenstein Verifying keyed medical research data. , 1993, Statistics in medicine.

[8]  Wayne A. Fuller,et al.  Measurement Error Models , 1988 .

[9]  A. G. Duchene,et al.  An examination of the efficiency of some quality assurance methods commonly employed in clinical trials. , 1990, Statistics in medicine.

[10]  Ronald W. Helms Data Quality Issues in Electronic Data Capture , 2001 .

[11]  J. Verweij,et al.  Chemotherapy administration and data collection in an EORTC collaborative group--can we trust the results? , 1993, European journal of cancer.

[12]  M Thoresen,et al.  A simulation study of measurement error correction methods in logistic regression. , 2000, Biometrics.

[13]  J P Mullooly,et al.  The effects of data entry error: an analysis of partial verification. , 1990, Computers and biomedical research, an international journal.

[14]  N C Andreasen,et al.  Effects of errors in a multicenter medical study: preventing misinterpreted data. , 1994, Journal of psychiatric research.

[15]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. , 1990, American journal of epidemiology.

[16]  A. Shamoo,et al.  Principles of research data audit , 1991 .

[17]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[18]  M. Hughes,et al.  Regression dilution in the proportional hazards model. , 1993, Biometrics.

[19]  G. Knatterud,et al.  Methods of quality control and of continuous audit procedures for controlled clinical trials. , 1981, Controlled clinical trials.

[20]  Eric L Eisenstein,et al.  Reducing the costs of phase III cardiovascular clinical trials. , 2005, American heart journal.

[21]  S D Stellman The case of the missing eights. An object lesson in data quality assurance. , 1989, American journal of epidemiology.

[22]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[23]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[24]  J. Gassman,et al.  Data quality assurance, monitoring, and reporting. , 1995, Controlled clinical trials.

[25]  Damian McEntegart,et al.  Checks of Case Record Forms versus the Database for Efficacy Variables when Validation Programs Exist , 1999 .

[26]  S Hulley,et al.  Data quality in a distributed data processing system: the SHEP Pilot Study. , 1986, Controlled clinical trials.

[27]  E. B. Godsey Short Run SPC , 1992 .

[28]  M. Christian,et al.  The National Cancer Institute audit of the National Surgical Adjuvant Breast and Bowel Project Protocol B-06. , 1995, The New England journal of medicine.