Bayesian inferences for receiver operating characteristic curves in the absence of a gold standard

Sensitivity and specificity are used to characterize the accuracy of a diagnostic test. Receiver operating characteristic (ROC) analysis can be used more generally to plot the sensitivity versus (1-specificity) over all possible cutoff points. We develop an ROC analysis that can be applied to diagnostic tests with and without a gold standard. Moreover, the method can be applied to multiple correlated diagnostic tests that are used on the same individual. Simulation studies were performed to assess the discrimination ability of the no-gold-standard method compared with the situation where a gold standard exists. We used the area under the ROC curve (AUC) to quantify the diagnostic accuracy of tests and the difference between AUCs to compare their accuracies. In particular, we can estimate the prevalence of disease/infection under the no-gold-standard method. The method we proposed works well in the absence of a gold standard for correlated test data. Correlation affected the width of posterior probability intervals for these differences. The proposed method was used to analyze ELISA test scores for Johne’s disease in dairy cattle.

[1]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[2]  Wesley O. Johnson,et al.  Correlation‐adjusted estimation of sensitivity and specificity of two diagnostic tests , 2003 .

[3]  R. F. Wagner,et al.  The problem of ROC analysis without truth: the EM algorithm and the information matrix , 2000, Medical Imaging.

[4]  W O Johnson,et al.  Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. , 2000, Preventive veterinary medicine.

[5]  K. Zou,et al.  Comparison of correlated receiver operating characteristic curves derived from repeated diagnostic test data. , 2001, Academic radiology.

[6]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[7]  Margaret S. Pepe,et al.  A regression modelling framework for receiver operating characteristic curves in medical diagnostic testing , 1997 .

[8]  M. Greiner,et al.  Principles and practical application of the receiver-operating characteristic analysis for diagnostic tests. , 2000, Preventive veterinary medicine.

[9]  M. Pepe An Interpretation for the ROC Curve and Inference Using GLM Procedures , 2000, Biometrics.

[10]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Adrian F. M. Smith,et al.  Sampling-Based Approaches to Calculating Marginal Densities , 1990 .

[12]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[13]  L. Tierney Markov Chains for Exploring Posterior Distributions , 1994 .

[14]  M S Pepe,et al.  Three approaches to regression analysis of receiver operating characteristic curves for continuous test results. , 1998, Biometrics.

[15]  L. Joseph,et al.  Bayesian Approaches to Modeling the Conditional Dependence Between Multiple Diagnostic Tests , 2001, Biometrics.

[16]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.