Comparing observer performance with mixture distribution analysis when there is no external gold standard

Mixture distribution analysis (MDA) is proposed as a statistical methodology for comparing observer readings on different imaging modalities when the image findings cannot be independently verified. The study utilized a data set consisting of independent, blinded readings by 4 radiologists of a stratified sample of 95 bedside chest images obtained using computed radiography. Each case was rad on hard and soft copy. The area under the ROC curve (AUC) was calculated using ROCFIT and the relative percent correct (RPC) was calculated from point distributions estimated by the MDA. The expectation maximization algorithm was used to perform a maximum likelihood estimation of the fit to either 3, 4 or 5 point distributions. There was agreement between the AUC and the RPC based upon 3 point distributions representing easy normals, hard normals and abnormals, easy abnormals, hard normals, hard abnormals and easy abnormals. We conclude that the MDA may be a viable alternative to the ROC for evaluating observer performance on imaging modalities in clinical settings where image verification is either difficult or impossible.

[1]  K. Berbaum,et al.  Receiver operating characteristic rating analysis. Generalization to the population of readers and patients with the jackknife method. , 1992, Investigative radiology.

[2]  B. Efron Better Bootstrap Confidence Intervals , 1987 .

[3]  M. Bronskill,et al.  Receiver Operator characteristic (ROC) Analysis without Truth , 1990, Medical decision making : an international journal of the Society for Medical Decision Making.

[4]  M W Vannier,et al.  MR examination of the knee: interpretation with multiscreen digital workstation vs hardcopy format. , 1991, AJR. American journal of roentgenology.

[5]  M. Aickin Maximum likelihood estimation of agreement in the constant predictive probability model, and its relation to Cohen's kappa. , 1990, Biometrics.

[6]  R K Taira,et al.  Receiver-operating-characteristic study of chest radiographs in children: digital hard-copy film vs 2K x 2K soft-copy images. , 1992, AJR. American journal of roentgenology.

[7]  H E Rockette,et al.  Receiver operating characteristic analysis of chest image interpretation with conventional, laser-printed, and high-resolution workstation images. , 1990, Radiology.

[8]  H. Kundel,et al.  The Effect of Verification on the Assessment of Imaging Techniques , 1983, Investigative radiology.

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[10]  H E Rockette,et al.  Practical issues of experimental ROC analysis. Selection of controls. , 1990, Investigative radiology.

[11]  B. McNeil,et al.  Assessment of radiologic tests: control of bias and other design considerations. , 1988, Radiology.

[12]  D C Sullivan,et al.  Chest radiography: comparison of high-resolution digital displays with conventional and digital film. , 1992, Investigative radiology.

[13]  G G Cox,et al.  Chest radiography: comparison of high-resolution digital displays with conventional and digital film. , 1990, Radiology.

[14]  John Uebersax,et al.  Statistical Modeling of Expert Ratings on Medical Treatment Appropriateness , 1993 .

[15]  C B Begg,et al.  Consensus Diagnoses and "Gold Standards" , 1990, Medical decision making : an international journal of the Society for Medical Decision Making.

[16]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[17]  D Magid,et al.  Subtle orthopedic fractures: teleradiology workstation versus film interpretation. , 1993, Radiology.

[18]  H L Kundel,et al.  The evaluation of radiographic techniques by observer tests: problems, pitfalls, and procedures. , 1974, Investigative radiology.

[19]  M L Giger,et al.  Digital chest radiography: effect on diagnostic accuracy of hard copy, conventional video, and reversed gray scale video display formats. , 1988, Radiology.

[20]  H L Kundel,et al.  Mixture distribution and receiver operating characteristic analysis of bedside chest imaging with screen-film and computed radiography. , 1997, Academic radiology.