Comparative study of statistical methods for clustered ROC data: nonparametric methods and multiple outputation methods

In clustered receiver operating characteristic (ROC) data each patient has several normal and abnormal observations. Within the same cluster, observations are naturally correlated. Several nonparametric methods have been proposed in the literature to handle clustered data structure, but their performances on simulated and real datasets have not been compared. Recently, a multiple outputation method has been considered for clustered data in areas other than diagnostic accuracy to account for within-cluster correlation. The multiple outputation method offers a resampling-based alternative for one sample clustered data with or without covariates, or for hypothesis testing in two sample clustered data. The method does not require a specific within-cluster correlation structure and yields a valid estimator while accounting for the within-cluster correlations. This paper contributes to the literature by introducing the multiple outputation method to the ROC setting, and empirically comparing the performance of these clustered ROC curve methods. The performance of these methods is also evaluated through two real examples.

[1]  N A Obuchowski,et al.  Nonparametric analysis of clustered ROC curve data. , 1997, Biometrics.

[2]  R. Glynn,et al.  Incorporation of Clustering Effects for the Wilcoxon Rank Sum Test: A Large‐Sample Approach , 2003, Biometrics.

[3]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[4]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[5]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[6]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[7]  Gang Li,et al.  A Unified Approach to Nonparametric Comparison of Receiver Operating Characteristic Curves for Longitudinal and Clustered Data , 2008, Journal of the American Statistical Association.

[8]  Pranab Kumar Sen,et al.  Within‐cluster resampling , 2001 .

[9]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[10]  Zhen Chen,et al.  Marginal analysis of measurement agreement among multiple raters with non-ignorable missing ratings , 2014 .

[11]  Nancy A Obuchowski,et al.  Receiver operating characteristic (ROC) curves: review of methods with applications in diagnostic medicine , 2018, Physics in medicine and biology.

[12]  Somnath Datta,et al.  Rank-Sum Tests for Clustered Data , 2005 .

[13]  Eleanor M Pullenayegum,et al.  Multiple outputation for the analysis of longitudinal data subject to irregular observation , 2016, Statistics in medicine.

[14]  E. Leifer,et al.  Multiple Outputation: Inference for Complex Clustered Data by Averaging Analyses from Independent Data , 2003, Biometrics.