Evaluation of an automated method to assist with error detection in the ACCORD central laboratory

Background Errors in clinical laboratory data are rare but their potential high cost both in terms of harm to the subject as well as diluted statistical power results in a significant workload for experts, who must review large volumes of data in the search for these errors. Purpose The current research objective is to develop and evaluate a method to assist in detecting potential errors in laboratory data for an interventional clinical trial, such as Action to Control Cardiovascular Risk in Diabetes, where treatment effects may be influenced by errors in the data. Methods Utilizing data from a clinical trial investigating the effect of intensive glycemic control on major cardiovascular disease events, we constructed an algorithm that conducts probabilistic error detection called a ‘Bayesian network’. Using a synthetic error model, errors were introduced into a testing dataset, and the Bayesian network’s performance in identifying those errors was compared to laboratory experts. For each laboratory result we used the Bayesian network to compute the probability, the measured value was erroneous. This probability was then used to compute a receiver operating characteristic (ROC) curve. Three laboratory experts were recruited and took a survey consisting of 200 laboratory results. The task was to evaluate if these results were erroneous or not and to provide a confidence rating on a 6-point subjective probability scale. Results The Bayesian network’s overall area under the ROC curve was calculated to be 0.79, whereas the three laboratory experts had areas under the ROC curve of 0.73, 0.73, and 0.72. Perfect error prediction and random guessing yield a ROC of 1.00 and 0.50, respectively. This difference in performance was statistically significant for all three experts. Human experts were also generally overconfident in their ability to detect errors. Limitations The model is, by design, specific to a novel intervention in a specific diabetic population and, therefore, the specific Bayesian network discussed may not generalize to other interventions and populations. In addition, the study is limited by the small number of expert eligible to complete the survey. Conclusions The results of this study suggest that continuous Bayesian networks, suitably constructed, may serve as an effective tool to assist experts in the review of voluminous laboratory data by flagging unlikely results for more thorough review. Clinical Trials 2010; 7: 380—389. http://ctj.sagepub.com

[1]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[2]  Mario Plebani,et al.  Errors in laboratory medicine. , 2002, Clinical chemistry.

[3]  Ira J. Kalet,et al.  A decision aid for intensity-modulated radiation-therapy plan selection in prostate cancer based on a prognostic Bayesian network and a Markov model , 2009, Artif. Intell. Medicine.

[4]  W. Cushman,et al.  Prevention of cardiovascular disease in persons with type 2 diabetes mellitus: current knowledge and rationale for the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial. , 2007, The American journal of cardiology.

[5]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[6]  Eugene S Pearlman,et al.  Implications of autoverification for the clinical laboratory. , 2002, Clinical leadership & management review : the journal of CLMA.

[7]  K Akazawa,et al.  Accuracy in the Diagnostic Prediction of Acute Appendicitis Based on the Bayesian Network Model , 2007, Methods of Information in Medicine.

[8]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[9]  Holger Fröhlich,et al.  A Bayesian Network View on Nested Effects Models , 2008, EURASIP J. Bioinform. Syst. Biol..

[10]  H. Gerstein,et al.  Glycemia treatment strategies in the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial. , 2007, The American journal of cardiology.

[11]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[12]  Melvin Prince,et al.  The Diabetes Prevention Program. Design and methods for a clinical trial in the prevention of type 2 diabetes. , 1999, Diabetes care.

[13]  Afif Masmoudi,et al.  Causal inference in biomolecular pathways using a Bayesian network approach and an Implicit method. , 2008, Journal of theoretical biology.

[14]  D. Goldstein,et al.  Defining the relationship between plasma glucose and HbA(1c): analysis of glucose profiles and HbA(1c) in the Diabetes Control and Complications Trial. , 2002, Diabetes care.

[15]  J. Cutler,et al.  Rationale and design for the blood pressure intervention of the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial. , 2007, The American journal of cardiology.

[16]  Roger. T. Anderson,et al.  Health-related quality of life and cost-effectiveness components of the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial: rationale and design. , 2007, The American journal of cardiology.

[17]  Hamilton E. Link,et al.  Discrete dynamic Bayesian network analysis of fMRI data , 2009, Human brain mapping.

[18]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[19]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[20]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[21]  D. Goff,et al.  Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial: design and methods. , 2007, The American journal of cardiology.

[22]  Lawrence A Leiter,et al.  Evolution of the lipid trial protocol of the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial. , 2007, The American journal of cardiology.

[23]  J. Williamson,et al.  Recruitment strategies in the Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial. , 2007, The American journal of cardiology.

[24]  Mario Plebani,et al.  Errors in a stat laboratory: types and frequencies 10 years later. , 2007, Clinical chemistry.