Measuring and estimating diagnostic accuracy when there are three ordinal diagnostic groups

This article studies the problem of measuring and estimating the diagnostic accuracy when there are three ordinal diagnostic groups. We use a receiver operating characteristic (ROC) surface to describe the probabilities of correct classifications into three diagnostic groups based on various sets of diagnostic thresholds of a test and propose to use the entire and the partial volume under the surface to measure the diagnostic accuracy. Mathematical properties and probabilistic interpretations of the proposed measure of diagnostic accuracy are discussed. Under the assumption of normal distributions of the diagnostic test from three diagnostic groups, we present the maximum likelihood estimate to the volume under the ROC surface and give the asymptotic variance to the estimate. We further propose several asymptotic confidence interval estimates to the volume under the ROC surface. The performance of these confidence interval estimates is evaluated in terms of attaining the nominal coverage probability based on a simulation study. In addition, we develop a method of sample size determination to achieve an adequate accuracy of the confidence interval estimate. Finally, we demonstrate the proposed methodology by applying it to the clinical diagnosis of early stage Alzheimer's disease based on the neuropsychological database of the Washington University Alzheimer's Disease Research Center. Copyright © 2005 John Wiley & Sons, Ltd.

[1]  D. Dorfman,et al.  Maximum-likelihood estimation of parameters of signal-detection theory and determination of confidence intervals—Rating-method data , 1969 .

[2]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[3]  John A. Swets,et al.  Evaluation of diagnostic systems : methods from signal detection theory , 1982 .

[4]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[5]  C. Metz,et al.  A New Approach for Testing the Significance of Differences Between ROC Curves Measured from Correlated Data , 1984 .

[6]  E. DeLong,et al.  Sensitivity and specificity of a monitoring test. , 1985, Biometrics.

[7]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[8]  L. Wolfson,et al.  Clinico‐pathologic studies in dementia , 1988, Neurology.

[9]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[10]  R. Katzman.,et al.  Clinical, pathological, and neurochemical changes in dementia: A subgroup with preserved mental status and numerous neocortical plaques , 1988, Annals of neurology.

[11]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[12]  R D Hill,et al.  Very mild senile dementia of the Alzheimer type. II. Psychometric test performance. , 1989, Archives of neurology.

[13]  Mitchell H. Gail,et al.  A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data , 1989 .

[14]  M. Albert,et al.  Prevalence of Alzheimer's disease in a community population of older persons. Higher than previously reported. , 1989, JAMA.

[15]  J. Price,et al.  Very mild Alzheimer's disease , 1991, Neurology.

[16]  J. Morris The Clinical Dementia Rating (CDR) , 1993, Neurology.

[17]  W. Hall,et al.  Confidence Bands for Receiver Operating Characteristic Curves , 1993, Medical decision making : an international journal of the Society for Medical Decision Making.

[18]  L A Beckett,et al.  Age-specific incidence of Alzheimer's disease in a community population. , 1995, JAMA.

[19]  E. S. Venkatraman,et al.  A distribution-free procedure for comparing receiver operating characteristic curves from a paired experiment , 1996 .

[20]  N A Obuchowski,et al.  Sample size determination for diagnostic accuracy studies involving binormal ROC curve indices. , 1997, Statistics in medicine.

[21]  J. Miller,et al.  A prospective study of cognitive function and onset of dementia in cognitively healthy elders. , 1998, Archives of neurology.

[22]  D. Balota,et al.  Relating anatomy to function in Alzheimer's disease , 1998, Neurology.

[23]  C A Roe,et al.  Statistical Comparison of Two ROC-curve Estimates Obtained from Partially-paired Datasets , 1998, Medical decision making : an international journal of the Society for Medical Decision Making.

[24]  D. Mossman Three-way ROCs , 1999, Medical decision making : an international journal of the Society for Medical Decision Making.

[25]  M. Binder,et al.  Comparing Three-class Diagnostic Tests by Three-way ROC Analysis , 2000, Medical decision making : an international journal of the Society for Medical Decision Making.

[26]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[27]  J Philip Miller,et al.  Rates of progression in mild cognitive impairment and early Alzheimer’s disease , 2002, Neurology.

[28]  J. Morris,et al.  Neuropathologic Criteria for Diagnosing Alzheimer Disease in Persons with Pure Dementia of Alzheimer Type , 2004, Journal of neuropathology and experimental neurology.

[29]  H. Braak,et al.  Neuropathological stageing of Alzheimer-related changes , 2004, Acta Neuropathologica.