Receiver operating characteristic curve inference from a sample with a limit of detection.

The receiver operating characteristic curve is a commonly used tool for evaluating biomarker usefulness in clinical diagnosis of disease. Frequently, biomarkers being assessed have immeasurable or unreportable samples below some limit of detection. Ignoring observations below the limit of detection leads to negatively biased estimates of the area under the curve. Several correction methods are suggested in the areas of mean estimation and testing but nothing regarding the receiver operating characteristic curve or its summary measures. In this paper, the authors show that replacement values below the limit of detection, including those suggested, result in the same biased area under the curve when properly accounted for, but they also provide guidance on the usefulness of these values in limited situations. The authors demonstrate maximum likelihood techniques leading to asymptotically unbiased estimators of the area under the curve for both normally and gamma distributed biomarker levels. Confidence intervals are proposed, the coverage probability of which is scrutinized by simulation study. An example using polychlorinated biphenyl levels to classify women with and without endometriosis illustrates the potential benefits of these methods.

[1]  Enrique F Schisterman,et al.  Adjusting the generalized ROC curve for covariates , 2004, Statistics in medicine.

[2]  Enrique F Schisterman,et al.  ROC analysis for markers with mass at zero , 2006, Statistics in medicine.

[3]  A. Gupta,et al.  ESTIMATION OF THE MEAN AND STANDARD DEVIATION OF A NORMAL POPULATION FROM A CENSORED SAMPLE , 1952 .

[4]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[5]  H. Harter,et al.  Iterative maximum-likelihood estimation of the parameters of normal populations from singly and doubly censored samples. , 1969, Biometrika.

[6]  R. Hornung,et al.  Estimation of Average Concentration in the Presence of Nondetectable Values , 1990 .

[7]  Albert H. Moore,et al.  Asymptotic Variances and Covariances of Maximum-Likelihood Estimators, from Censored Samples, of the Parameters of Weibull and Gamma Populations , 1967 .

[8]  B. Whitcomb,et al.  Environmental PCB exposure and risk of endometriosis. , 2005, Human reproduction.

[9]  Enrique F Schisterman,et al.  Roc Curve Analysis for Biomarkers Based on Pooled Assessments , 2022 .

[10]  C. Haas,et al.  Estimation of averages in truncated samples , 1990 .

[11]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[12]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[13]  John M. Nocerino,et al.  Robust estimation of mean and variance using environmental data sets with below detection limit observations , 2002 .

[14]  A. Cohen,et al.  Estimating the Mean and Variance of Normal Populations from Singly Truncated and Doubly Truncated Samples , 1950 .

[15]  H. Lynn,et al.  Maximum likelihood inference for left-censored HIV RNA data. , 2001, Statistics in medicine.