Inter-observer variation in masked and unmasked images for quality evaluation of clinical radiographs.

PURPOSE To investigate the influence of masking on the inter-observer variation in image quality evaluation of clinical radiographs of chest and lumbar spine. BACKGROUND Inter-observer variation is a big problem in image quality evaluation since this variation is often much bigger than the variation in image quality between, for example, two radiographic systems. In this study, we have evaluated the effect of masking on the inter-observer variation. The idea of the masking was to force every observer to view exactly the same part of the image and to avoid the effect of the overall 'first impression' of the image. A discussion with a group of European expert radiologists before the study indicated that masking might be a good way to reduce the inter-observer variation. METHODS Five chest and five lumbar spine radiographs were collected together with detailed information regarding exposure conditions. The radiographs were digitised with a high-performance scanner and five different manipulations were performed, simulating five different exposure conditions. The contrast, noise and spatial resolution were manipulated by this method. The images were printed onto the film and the individual masks were produced for each film, showing only the parts of the images that were necessary for the image quality evaluation. The quality of the images was evaluated on ordinary viewing boxes by a large group of experienced radiologists. The images were examined with and without the masks with a set of image criteria (if fulfilled, 1 point; and not fulfilled, 0 point), and the mean score was calculated for each simulated exposure condition. RESULTS The results of this study indicate that-contrary to what was supposed-the inter-observer variation increased when the images were masked. In some cases, especially for chest, this increase was statistically significant. CONCLUSIONS Based on the results of this study, image masking in studies of fulfilment of image criteria cannot be recommended.

[1]  C. Nodine,et al.  The Nature of Expertise in Radiology , 2000 .

[2]  M Ruschin,et al.  Clinical evaluation of a new set of image quality criteria for mammography. , 2005, Radiation protection dosimetry.

[3]  M Ruschin,et al.  Can the average glandular dose in routine digital mammography screening be reduced? A pilot study using revised image quality criteria. , 2005, Radiation protection dosimetry.

[4]  D R Dance,et al.  Influence of the characteristic curve on the clinical image quality of lumbar spine and chest radiographs. , 2004, The British journal of radiology.

[5]  D. Blanc,et al.  European guidelines on quality criteria for diagnostic radiographic images , 1998 .

[6]  M Båth,et al.  The influence of different technique factors on image quality of chest radiographs as evaluated by modified CEC image quality criteria. , 2002, The British journal of radiology.

[7]  M Zankl,et al.  The influence of different technique factors on image quality of lumbar spine radiographs as evaluated by established CEC image criteria. , 2000, The British journal of radiology.

[8]  E. Samei,et al.  Effects of Anatomical Structure on Signal Detection , 2000 .

[9]  Harold L. Kundel,et al.  Visual Search in Medical Images , 2000 .

[10]  P. Robinson,et al.  Radiology's Achilles' heel: error and variation in the interpretation of the Röntgen image. , 1997, The British journal of radiology.

[11]  S. Mattsson,et al.  Comparison of two methods for evaluating image quality of chest radiographs , 2000, Medical Imaging.

[12]  Anders Tingberg Quantifying the quality of medical x-ray images. An evaluation based on normal anatomy for lumbar spine and chest radiography. , 2000 .