Statistical Analysis for Radiologists’ Interpretations Variability in Mammograms

Conventional mammography is considered the modality of choice for the detection of breast cancer. The process involves a human radiologist visually diagnosing the mammogram, which causes limitations such as missing a cancer and/or diagnosing a false cancer. Another disadvantage of conventional mammography is the variability among screening radiologists in interpreting mammographic images. The objectives of this study are to verify this variability and to develop an image processing algorithm that can automatically detect benign tumors of the female breast. A sample of ten digital mammograms obtained from the MiniMIAS database was distributed to four different radiologists in order to verify the variability among them. Furthermore, three algorithms were developed in order to automatically detect benign tumors of the female breast. The proposed algorithms were based on combinations of certain statistical features and were tested on the same sample of images. Results showed that the detection mechanism using the proposed algorithms was acceptable despite the fact that they exhibited a few errors. It was concluded that the use of a combination of the mean and median statistical tools is effective in assisting radiologists in interpreting mammographic images containing benign tumors. DOI: 10.4018/ijsbbt.2012100103 International Journal of Systems Biology and Biomedical Technologies, 1(4), 28-46, October-December 2012 29 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. of normal tissue development (Locasale & Cantley, 2010). The cell metabolism increases to meet the requirements of rapid cell proliferation, autonomous cell growth and to maintain its survival (Locasale & Cantley, 2010). The most common symptom of breast cancer is the presence of painless and slowly growth lump that may alter the contour or size of the breast. It is also characterized by skin changes, inverted nipple and bloodstained nipple discharge (Zou & Guo, 2003). The lymphatic nodes under the armpit may be swollen if affected by cancer. In late stage, the growth may ulcerate through the skin and infected (Zou & Guo, 2003). Bone pain, tenderness over the liver, headaches, shortness of breath and chronic cough may be an indication of the cancer spreading to other organs in the body. Early diagnosis requires an accurate and reliable diagnosis procedure that allows physicians to distinguish benign breast tumors from malignant ones without going for surgical biopsy (Cheng et al., 2003). Mammography is a specific type of imaging that uses a low-dose X-ray system to examine breasts. The dose is typically known to be around 0.7 mSv. Mammography plays a central part in early detection of breast cancers because it can show changes in the breast up to two years before a patient or physician can feel them. The detection process is based on the identification of areas of high intensities that indicate the presence of either benign or malignant tumors. Two types of mammography are known; Screen-Film Mammography (SFM) and Digital Mammography (DM). DM was introduced as an alternative diagnostic technique in order to overcome the problems of SFM which include the variability of diagnosis among the screening radiologists as well as limitations in the detection of benign tumors (Hambly et al., 2009; Skaane, 2009; Skaane et al., 2007; Van Ongeval et al., 2005). DM incorporates a new technique called Computer-Aided Diagnosis (CAD) which employs the tools of image processing for image enhancement and diagnosis (Balakumaran et al., 2010; Noble et al., 2009; Li et al., 2008; Doi, 2007; Zheng et al., 2004, 2002; Christoyianni et al., 2002; Doi et al., 1999). It takes an electronic image of the breast and stores it directly in a computer. DM uses less radiation than film mammography. The goal of CAD is to improve radiologists’ performance by indicating the sites of potential abnormalities, to reduce the number of missed lesions, and/or providing quantitative analysis of specific regions in an image to improve diagnosis (Visser et al., 2012; Bick & Diekmann 2007; Bazzocchi et al., 2001). Computer-aided methods for detecting malignant texture have been achieved using different techniques (Oliver et al., 2010, 2006; Houssami et al., 2009; Elter & Horsch 2009; Kim et al., 2008; Yang et al., 2007; Sakka et al., 2006; Arodz et al., 2005; Mavroforakis et al., 2005; Sajda et al., 2002; Gavrielides et al., 2002; Li et al., 2002; Verma & Zakos 2001; Kobatake et al., 1999). CAD systems typically operate as automated “second opinion” or “double reading” systems that indicate lesion location and/or type. Since individual human observers overlook different findings, it has been shown that “double reading” increases the detection rate of breast cancers by 5%-15% (Bazzocchi et al., 2001). Subjectivity among screening radiologists in the interpretation of mammograms results in a high percentage of misdiagnosed cancer cases and a high percentage of missed cancer cases (Cornford et al., 2011, 2005; Halladay et al., 2010; Molins et al., 2008; Gur et al., 2008; Elmore et al., 2009, 2003, 1998, 1994; Coldman et al., 2006; Beam et al., 1996, 2003; Moss et al., 2005; Blanks et al., 1998). This subjectivity is the end product of several factors including radiologists’ fatigue, incompetence and lack of training to name a few. The ultimate diagnosis of all types of breast disease depends on a biopsy. In most cases the decision for a biopsy is based on mammography findings. Biopsy results indicate that 65-90% of suspected cancer detected by mammography turned out to be benign (Cheng et al., 2006). Therefore, it would be valuable to develop a computer aided method for mass classification based on extracted features from the region of interests (ROI) in mammograms. This would reduce the number of unnecessary biopsies in patients with benign disease and thus avoid patients’ physical and mental suffering, 17 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/statistical-analysis-radiologistsinterpretations-variability/75152?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Medicine, Healthcare, and Life Science, InfoSciHealthcare Administration, Clinical Practice, and Bioinformatics eJournal Collection, InfoSci-Physical Sciences, Biological Sciences, and Engineering eJournal Collection, InfoSci-Journal Disciplines Computer Science, Security, and Information Technology, InfoSci-Journal Disciplines Engineering, Natural, and Physical Science. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2

[1]  Lihua Li,et al.  Computer-aided diagnosis of masses with full-field digital mammography. , 2002, Academic radiology.

[2]  Emily F Conant,et al.  Association of volume and volume-independent factors with accuracy in screening mammogram interpretation. , 2003, Journal of the National Cancer Institute.

[3]  Marios A Gavrielides,et al.  Parameter optimization of a computer-aided diagnosis scheme for the segmentation of microcalcification clusters in mammograms. , 2002, Medical physics.

[4]  K. Doi,et al.  Computer-aided diagnosis in radiology: potential and pitfalls. , 1999, European journal of radiology.

[5]  H Bosmans,et al.  Current challenges of full field digital mammography. , 2005, Radiation protection dosimetry.

[6]  L. Cantley,et al.  Altered metabolism in cancer , 2010, BMC Biology.

[7]  Nico Karssemeijer,et al.  Increase in perceived case suspiciousness due to local contrast optimisation in digital screening mammography , 2011, European Radiology.

[8]  T. Balakumaran,et al.  Microcalcification detection in digital mammograms using novel filter bank , 2010, Biometrics Technology.

[9]  N Houssami,et al.  Early detection of breast cancer: Overview of the evidence on computer‐aided detection in mammography screening * , 2009, Journal of medical imaging and radiation oncology.

[10]  Robert Marti,et al.  A Comparison of Breast Tissue Classification Techniques , 2006, MICCAI.

[11]  D Koutsouris,et al.  Classification algorithms for microcalcifications in mammograms (Review). , 2006, Oncology reports.

[12]  C. D'Orsi,et al.  Accuracy of screening mammography interpretation by characteristics of radiologists. , 2004, Journal of the National Cancer Institute.

[13]  R. Blanks,et al.  Is radiologists' volume of mammography reading related to accuracy? A critical review of the literature. , 2005, Clinical radiology.

[14]  J. Elmore,et al.  Does diagnostic accuracy in mammography depend on radiologists' experience? , 1998, Journal of women's health.

[15]  David Gur,et al.  A method to test the reproducibility and to improve performance of computer-aided detection schemes for digitized mammograms. , 2004, Medical physics.

[16]  Craig A. Beam,et al.  Variability in the interpretation of screening mammograms by US radiologists. Findings from a national sample. , 1996, Archives of internal medicine.

[17]  Luisa P. Wallace,et al.  The "laboratory" effect: comparing radiologists' performance and variability during prospective clinical and laboratory mammography interpretations. , 2008, Radiology.

[18]  Brijesh Verma,et al.  A computer-aided diagnosis system for digital mammograms based on fuzzy-neural and feature extraction techniques , 2001, IEEE Transactions on Information Technology in Biomedicine.

[19]  Wendy Bruening,et al.  Computer-aided detection mammography for breast cancer screening: systematic review and meta-analysis , 2009, Archives of Gynecology and Obstetrics.

[20]  R. Blanks,et al.  A comparison of cancer detection rates achieved by breast cancer screening programmes by number of readers, for one and two view mammography: results from the UK National Health Service breast screening programme , 1998, Journal of medical screening.

[21]  Masayuki Murakami,et al.  Computerized detection of malignant tumors on digital mammograms , 1999, IEEE Transactions on Medical Imaging.

[22]  A. Evans,et al.  Optimal screening mammography reading volumes; evidence from real life in the East Midlands region of the NHS Breast Screening Programme. , 2011, Clinical radiology.

[23]  David A. Yuen,et al.  Pattern recognition techniques for automatic detection of suspicious-looking anomalies in mammograms , 2005, Comput. Methods Programs Biomed..

[24]  Woo Kyung Moon,et al.  Computer-aided detection in full-field digital mammography: sensitivity and reproducibility in serial examinations. , 2008, Radiology.

[25]  G. Kokkinakis,et al.  Computer aided diagnosis of breast cancer in digitized mammograms. , 2002, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[26]  Woo Kyung Moon,et al.  Screening mammography-detected cancers: sensitivity of a computer-aided detection system applied to full-field digital mammograms. , 2007, Radiology.

[27]  Harris Georgiou,et al.  Significance analysis of qualitative mammographic features, using linear classifiers, neural networks and support vector machines. , 2005, European journal of radiology.

[28]  R. Shumak,et al.  Organized breast screening programs in Canada: effect of radiologist reading volumes on outcomes. , 2006, Radiology.

[29]  U Bottigli,et al.  [Application of a computer-aided detection (CAD) system to digitalized mammograms for identifying microcalcifications]. , 2001, La Radiologia medica.

[30]  Paul Sajda,et al.  Learning contextual relationships in mammograms using a hierarchical pyramid neural network , 2002, IEEE Transactions on Medical Imaging.

[31]  Luisa P. Wallace,et al.  Computer-aided detection in mammography: an assessment of performance on current and prior images. , 2002, Academic radiology.

[32]  Kunio Doi,et al.  Computer-aided diagnosis in medical imaging: Historical review, current status and future potential , 2007, Comput. Medical Imaging Graph..

[33]  Jacqueline R. Halladay,et al.  Positive predictive value of mammography: comparison of interpretations of screening and diagnostic images by the same radiologist and by different radiologists. , 2010, AJR. American journal of roentgenology.

[34]  X. Castells,et al.  Association between Radiologists' Experience and Accuracy in Interpreting Screening Mammograms , 2008, BMC health services research.

[35]  Niall Phelan,et al.  Comparison of digital mammography and screen-film mammography in breast cancer screening: a review in the Irish breast screening program. , 2009, AJR. American journal of roentgenology.

[36]  P. Skaane,et al.  Randomized trial of screen-film versus full-field digital mammography with soft-copy reading in population-based screening program: follow-up and final results of Oslo II study. , 2007, Radiology.

[37]  Arnau Oliver,et al.  A review of automatic mass detection and segmentation in mammographic images , 2010, Medical Image Anal..

[38]  P. Skaane Studies comparing screen-film mammography and full-field digital mammography in breast cancer screening: Updated review , 2009, Acta radiologica.

[39]  J. Elmore,et al.  Variability in radiologists' interpretations of mammograms. , 1994, The New England journal of medicine.

[40]  Zhenyu Guo,et al.  A review of electrical impedance techniques for breast cancer detection. , 2003, Medical engineering & physics.

[41]  M. Elter,et al.  CADx of mammographic masses and clustered microcalcifications: a review. , 2009, Medical physics.

[42]  Li Lan,et al.  Evaluation of computer-aided diagnosis on a large clinical full-field digital mammographic dataset. , 2008, Academic radiology.

[43]  J. Elmore,et al.  Variability in interpretive performance at screening mammography and radiologists' characteristics associated with accuracy. , 2009, Radiology.

[44]  C. D'Orsi,et al.  International variation in screening mammography interpretations in community-based programs. , 2003, Journal of the National Cancer Institute.

[45]  Felix Diekmann,et al.  Digital mammography: what do we and what don’t we know? , 2007, European Radiology.

[46]  Heng-Da Cheng,et al.  Approaches for automated detection and classification of masses in mammograms , 2006, Pattern Recognit..

[47]  Alan C. Evans,et al.  The pathological and radiological features of screen-detected breast cancers diagnosed following arbitration of discordant double reading opinions. , 2005, Clinical radiology.

[48]  Heng-Da Cheng,et al.  Computer-aided detection and classification of microcalcifications in mammograms: a survey , 2003, Pattern Recognit..