What's under the ROC? An Introduction to Receiver Operating Characteristics Curves

It is often necessary to dichotomize a continuous scale to separate respondents into normal and abnormal groups. However, because the distributions of the scores in these 2 groups most often overlap, any cut point that is chosen will result in 2 types of errors: false negatives (that is, abnormal cases judged to be normal) and false positives (that is, normal cases placed in the abnormal group). Changing the cut point will alter the numbers of erroneous judgments but will not eliminate the problem. A technique called receiver operating characteristic (ROC) curves allows us to determine the ability of a test to discriminate between groups, to choose the optimal cut point, and to compare the performance of 2 or more tests. We discuss how to calculate and compare ROC curves and the factors that must be considered in choosing an optimal cut point.

[1]  Elisa T. Lee,et al.  Use of relative operating characteristic analysis in epidemiology. A method for dealing with subjective judgement. , 1981, American journal of epidemiology.

[2]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[3]  D. Streiner,et al.  Evaluation of 2 Measures of Psychological Distress as Screeners for Depression in the General Population , 2007, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[4]  Lucila Ohno-Machado,et al.  The use of receiver operating characteristic curves in biomedical informatics , 2005, J. Biomed. Informatics.

[5]  D. Streiner,et al.  Health Measurement Scales: A practical guide to thier development and use , 1989 .

[6]  J. Swets ROC analysis applied to the evaluation of medical imaging techniques. , 1979, Investigative radiology.

[7]  J. Swets,et al.  Assessment of diagnostic technologies. , 1979, Science.

[8]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[9]  R. Jaeschke,et al.  A readers' guide to the interpretation of diagnostic test properties: clinical example of sepsis , 2003, Intensive Care Medicine.

[10]  N. Obuchowski Receiver operating characteristic curves and their use in radiology. , 2003, Radiology.

[11]  David L Streiner,et al.  Diagnosing Tests: Using and Misusing Diagnostic and Screening Tests , 2003, Journal of personality assessment.

[12]  L. Radloff The CES-D Scale , 1977 .

[13]  A. Bergman,et al.  The morbidity of cardiac nondisease in schoolchildren. , 1967, The New England journal of medicine.

[14]  L B Lusted,et al.  Radiographic applications of receiver operating characteristic (ROC) curves. , 1974, Radiology.

[15]  A. Popli,et al.  Can the MiniSCID improve the detection of bipolarity in private practice? , 2005, Journal of affective disorders.

[16]  R. Kessler,et al.  The performance of the K6 and K10 screening scales for psychological distress in the Australian National Survey of Mental Health and Well-Being , 2003, Psychological Medicine.

[17]  D. Streiner Building a Better Model: An Introduction to Structural Equation Modelling , 2006, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[18]  John A. Swets The science of choosing the right decision threshold in high-stakes diagnostics. , 1992 .

[19]  M. Charlson,et al.  Labelling and absenteeism: the Massachusetts Mutual experience. , 1981, Clinical and investigative medicine. Medecine clinique et experimentale.

[20]  D. Bamber The area above the ordinal dominance graph and the area below the receiver operating characteristic graph , 1975 .

[21]  R. Kessler,et al.  Short screening scales to monitor population prevalences and trends in non-specific psychological distress , 2002, Psychological Medicine.

[22]  David L Streiner,et al.  Breaking up is Hard to Do: The Heartbreak of Dichotomizing Continuous Data , 2002, Canadian journal of psychiatry. Revue canadienne de psychiatrie.