A random-sum Wilcoxon statistic and its application to analysis of ROC and LROC data.

The Wilcoxon-Mann-Whitney statistic is commonly used for a distribution-free comparison of two groups. One requirement for its use is that the sample sizes of the two groups are fixed. This is violated in some of the applications such as medical imaging studies and diagnostic marker studies; in the former, the violation occurs since the number of correctly localized abnormal images is random, while in the latter the violation is due to some subjects not having observable measurements. For this reason, we propose here a random-sum Wilcoxon statistic for comparing two groups in the presence of ties, and derive its variance as well as its asymptotic distribution for large sample sizes. The proposed statistic includes the regular Wilcoxon rank-sum statistic. Finally, we apply the proposed statistic for summarizing location response operating characteristic data from a liver computed tomography study, and also for summarizing diagnostic accuracy of biomarker data.

[1]  P F Judy,et al.  Visualization and detection-localization on computed tomographic images. , 1991, Investigative radiology.

[2]  Enrique F Schisterman,et al.  Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection , 2008, Biometrics.

[3]  C. Metz,et al.  Visual detection and localization of radiographic images. , 1975, Radiology.

[4]  D. Chakraborty,et al.  Free-response methodology: alternate analysis and a new observer-performance experiment. , 1990, Radiology.

[5]  I Hinberg,et al.  Receiver operator characteristic (ROC) curves and non-normal data: an empirical study. , 1990, Statistics in medicine.

[6]  Enrique F Schisterman,et al.  ROC analysis for markers with mass at zero. , 2006, Statistics in medicine.

[7]  H. Robbins On the Asymptotic Distribution of the Sum of a Random Number of Random Variables. , 1948, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Guiasu On the Asymptotic Distribution of the Sequences of Random Variables with Random Indices , 1971 .

[9]  D. Szasz Limit Theorems for the Distributions of the Sums of a Random Number of Random Variables , 1972 .

[10]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[11]  Chris Lloyd,et al.  Using Smoothed Receiver Operating Characteristic Curves to Summarize and Compare Diagnostic Systems , 1998 .

[12]  D P Chakraborty,et al.  Maximum likelihood analysis of free-response receiver operating characteristic (FROC) data. , 1989, Medical physics.

[13]  Mitchell J. Mergenthaler Nonparametrics: Statistical Methods Based on Ranks , 1979 .

[14]  B. Gnedenko,et al.  Random Summation: Limit Theorems and Applications , 1996 .

[15]  Enrique F Schisterman,et al.  Receiver operating characteristic curve inference from a sample with a limit of detection. , 2006, American journal of epidemiology.

[16]  Xiao-Hua Zhou,et al.  Statistical Methods in Diagnostic Medicine , 2002 .

[17]  Maria Kallergi,et al.  High-performance wavelet compression for mammography: localization response operating characteristic evaluation. , 2006, Radiology.

[18]  R. Swensson Unified measurement of observer performance in detecting and localizing target objects on images. , 1996, Medical physics.

[19]  Estimation of the mean of a K‐sample U‐statistic with missing outcomes and auxiliaries , 2001 .