Advances in statistical methodology for the evaluation of diagnostic and laboratory tests.

The ROC plot is a useful tool in the evaluation of the performance of medical tests for separating two populations. For a two-state decision rule based on such a test, the ROC plot is the graph of all observed (1-specificity, sensitivity) pairs. Each point on this empirical plot can be represented by a 2 x 2 contingency table. The non-parametric statistics of Mann-Whitney and Kolmogorov-Smirnov can be immediately identified on this plot. Local non-parametric confidence interval procedures related to the theoretical ROC curve are briefly reviewed. For continuous data, two new simultaneous confidence regions associated with the ROC curve are presented, one based on Kolmogorov-Smirnov confidence bands for distribution functions and the other based on bootstrapping. Two different tests on the same patients can be compared on the ROC scale. For continuous data, one important problem concerns the comparison of two ROC plots (as would arise from two correlated diagnostic tests on each patient) using a sup norm (this metric can detect differences that the ROC area cannot). The distribution of a statistic based on this norm is studied, using the bootstrap. A biomedical example illustrates the methodologies.

[1]  S. Greenhouse,et al.  The evaluation of diagnostic tests. , 1950, Biometrics.

[2]  T. Marill Detection theory and psychophysics , 1956 .

[3]  D. Young Interpretation of clinical laboratory data. , 1975, Federation proceedings.

[4]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[5]  M H Gail,et al.  A generalization of the one-sided two-sample Kolmogorov-Smirnov statistic for evaluating diagnostic tests. , 1976, Biometrics.

[6]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[7]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[8]  D. Freedman,et al.  Some Asymptotic Theory for the Bootstrap , 1981 .

[9]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[10]  C. Metz,et al.  A New Approach for Testing the Significance of Differences Between ROC Curves Measured from Correlated Data , 1984 .

[11]  J. Hanley,et al.  Statistical Approaches to the Analysis of Receiver Operating Characteristic (ROC) Curves , 1984, Medical decision making : an international journal of the Society for Medical Decision Making.

[12]  P Ducimetière,et al.  Comparison of receiver operating curves derived from the same population: a bootstrapping approach. , 1985, Computers and biomedical research, an international journal.

[13]  J R Beck,et al.  The use of relative operating characteristic (ROC) curves in test performance evaluation. , 1986, Archives of pathology & laboratory medicine.

[14]  A. Zinsmeister,et al.  Apolipoproteins and coronary artery disease. , 1986, Mayo Clinic proceedings.

[15]  Joseph P. Romano A Bootstrap Revival of Some Nonparametric Distance Tests , 1988 .

[16]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[17]  James J. Bailey,et al.  Nonparametric comparison of two tests of cardiac function on the same patient population using the entire ROC curve , 1988, Proceedings. Computers in Cardiology 1988.

[18]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[19]  K. Linnet,et al.  Assessing diagnostic tests by a strictly proper scoring rule. , 1989, Statistics in medicine.

[20]  P. Bickel,et al.  Confidence Bands for a Distribution Function Using the Bootstrap , 1989 .

[21]  Mitchell H. Gail,et al.  A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data , 1989 .

[22]  J J Bailey,et al.  Bootstrap comparison of fuzzy ROC curves for ECG-LVH algorithms using data from the Framingham Heart Study. , 1990, Journal of electrocardiology.

[23]  R. Hilgers Distribution-Free Confidence Bounds for ROC Curves , 1991, Methods of Information in Medicine.

[24]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.