Preprocessing of Breast Cancer Images to Create Datasets for Deep-CNN

Breast cancer is the most diagnosed cancer in Australia with crude incidence rates increasing drastically from 62.8 at ages 35–39 to 271.4 at ages 50–54 (cases per 100,000 women). Various researchers have proposed methods and tools based on Machine Learning and Convolutional Neural Networks for assessing mammographic images, but these methods have produced detection and interpretation errors resulting in false-positive and false-negative cases when used in the real world. We believe that this problem can potentially be resolved by implementing effective image pre-processing techniques to create training data for Deep-CNN. Therefore, the main aim of this research is to propose effective image pre-processing methods to create datasets that can save computational time for the neural network and improve accuracy and classification rates. To do so, this research proposes methods for background removal, pectoral muscle removal, adding noise to the images, and image enhancements. Adding noise without affecting the quality of details in the images makes the input images for the neural network more representative, which may improve the performance of the neural network model when used in the real world. The proposed method for background removal is the “Rolling Ball Algorithm” and “Huang’s Fuzzy Thresholding”, which succeed in removing background from 100% of the images. For pectoral muscle removal “Canny Edge Detection” and “Hough’s Line Transform” are used, which removed muscle from 99.06% of the images. “Invert”, “CTI_RAS” and “ISOCONTOUR” lookup tables (LUTs) were used for image enhancements to outline the ROIs and regions within the ROIs.

[1]  N. M. Mat Isa,et al.  A fully automated breast separation For mammographic images , 2015, 2015 International Conference on BioSignal Analysis, Processing and Systems (ICBAPS).

[2]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[3]  Thomas Martin Deserno,et al.  A simple parametric equation for pseudocoloring grey scale images keeping their original brightness progression , 1997, Image Vis. Comput..

[4]  R. J. Ferrari,et al.  Identification of the breast boundary in mammograms using active contour models , 2004, Medical and Biological Engineering and Computing.

[5]  Savita Gupta,et al.  Integrating saliency with fuzzy thresholding for brain tumor extraction in MR images , 2021, J. Vis. Commun. Image Represent..

[6]  Arnaldo de Albuquerque Araújo,et al.  MammoSys: A content-based image retrieval system using breast density patterns , 2010, Comput. Methods Programs Biomed..

[7]  István Csabai,et al.  Detecting and classifying lesions in mammograms with Deep Learning , 2017, Scientific Reports.

[8]  Geert J. S. Litjens,et al.  Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology , 2019, Medical Image Anal..

[9]  Haza Nuzly Abdul Hamed,et al.  Improved Threshold Based and Trainable Fully Automated Segmentation for Breast Cancer Boundary and Pectoral Muscle in Mammogram Images , 2020, IEEE Access.

[10]  Shadrokh Samavi,et al.  Detection of abnormalities in mammograms using deep features , 2019, Journal of Ambient Intelligence and Humanized Computing.

[11]  Jaime S. Cardoso,et al.  INbreast: toward a full-field digital mammographic database. , 2012, Academic radiology.

[12]  Simon K. Warfield,et al.  Deep learning with noisy labels: exploring techniques and remedies in medical image analysis , 2020, Medical Image Anal..

[13]  V. R. Thool,et al.  Detection and Segmentation of Pectoral Muscle on MLO-View Mammogram Using Enhancement Filter , 2017, Journal of Medical Systems.

[14]  Santiago Aja-Fernández,et al.  A local fuzzy thresholding methodology for multiregion image segmentation , 2015, Knowl. Based Syst..

[15]  Philip J. Morrow,et al.  Fully automated breast boundary and pectoral muscle segmentation in mammograms , 2017, Artif. Intell. Medicine.

[16]  Lorenzo L. Pesce,et al.  Noise injection for training artificial neural networks: a comparison with weight decay and early stopping. , 2009, Medical physics.

[17]  K Doi,et al.  Automated segmentation of digitized mammograms. , 1995, Academic radiology.

[18]  Thomas C. Redman,et al.  Data Driven: Profiting from Your Most Important Business Asset , 2008 .

[19]  Saranyaraj D.,et al.  A deep convolutional neural network for the early detection of breast carcinoma with respect to hyper- parameter tuning , 2018, Multimedia Tools and Applications.

[20]  Madasu Hanmandlu,et al.  Segmentation Techniques for Breast Cancer Imaging Modalities-A Review , 2019, 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence).

[21]  Soumen Biswas,et al.  Blood Cell Detection Using Thresholding Estimation Based Watershed Transformation with Sobel Filter in Frequency Domain , 2016 .

[22]  Lipi B Mahanta,et al.  A New Breast Border Extraction and Contrast Enhancement Technique with Digital Mammogram Images for Improved Detection of Breast Cancer , 2018, Asian Pacific journal of cancer prevention : APJCP.

[23]  Mao-Jiun J. Wang,et al.  Image thresholding by minimizing the measures of fuzzines , 1995, Pattern Recognit..

[24]  Avinash Keskar,et al.  Robust Automatic Pectoral Muscle Segmentation from Mammograms Using Texture Gradient and Euclidean Distance Regression , 2016, Journal of Digital Imaging.

[25]  Stanley R. Sternberg,et al.  Biomedical Image Processing , 1983, Computer.

[26]  M. Yaffe,et al.  American Cancer Society Guidelines for Breast Screening with MRI as an Adjunct to Mammography , 2007, CA: a cancer journal for clinicians.

[27]  Samir Kumar Bandyopadhyay,et al.  Technique for preprocessing of digital mammogram , 2012, Comput. Methods Programs Biomed..

[28]  G. Johnson,et al.  Regionally Adaptive Histogram Equalization of the Chest , 1987, IEEE Transactions on Medical Imaging.

[29]  Quoc V. Le,et al.  Adding Gradient Noise Improves Learning for Very Deep Networks , 2015, ArXiv.

[30]  Ian W. Ricketts,et al.  The Mammographic Image Analysis Society digital mammogram database , 1994 .

[31]  R. Yager ON THE MEASURE OF FUZZINESS AND NEGATION Part I: Membership in the Unit Interval , 1979 .

[32]  Jim R. Parker,et al.  Algorithms for image processing and computer vision , 1996 .

[33]  Miguel Ángel Guevara-López,et al.  Representation learning for mammography mass lesion classification with convolutional neural networks , 2016, Comput. Methods Programs Biomed..

[34]  C. K. Lee,et al.  A mathematical morphological approach for segmenting heavily noise-corrupted images , 1996, Pattern Recognit..

[35]  Philip J. Morrow,et al.  Breast pectoral muscle segmentation in mammograms using a modified holistically-nested edge detection network , 2019, Medical Image Anal..

[36]  Hui Wang,et al.  A hierarchical pipeline for breast boundary segmentation and calcification detection in mammograms , 2018, Comput. Biol. Medicine.

[37]  Yianni Attikiouzel,et al.  Automatic pectoral muscle segmentation on mediolateral oblique view mammograms , 2004, IEEE Transactions on Medical Imaging.

[38]  Oleg V. Michailovich,et al.  On Segmentation of Pectoral Muscle in Digital Mammograms by Means of Deep Learning , 2020, IEEE Access.

[39]  Jing Li,et al.  SD-CNN: a Shallow-Deep CNN for Improved Breast Cancer Diagnosis , 2018, Comput. Medical Imaging Graph..

[40]  E MavroforakisMichael,et al.  A fully automated scheme for mammographic segmentation and classification based on breast density and asymmetry , 2011 .

[41]  Pankaj Kandhway,et al.  A novel reformed histogram equalization based medical image contrast enhancement using krill herd optimization , 2020, Biomed. Signal Process. Control..

[42]  Ghassan Hamarneh,et al.  Geometry-Based Pectoral Muscle Segmentation From MLO Mammogram Views , 2017, IEEE Transactions on Biomedical Engineering.

[43]  A. John,et al.  Performance of a single lookup table (LUT) for displaying chest CT images. , 2004, Academic radiology.

[44]  Basit Raza,et al.  Enhancing breast pectoral muscle segmentation performance by using skip connections in fully convolutional network , 2020, Int. J. Imaging Syst. Technol..

[45]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[46]  Ron Kimmel,et al.  Computational mammography using deep neural networks , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[47]  Dejan Drajić,et al.  An Algorithm for Detection of Electromagnetic Interference in High Frequency Radar Range-Doppler Images Caused by LEDs , 2019, IEEE Access.

[48]  Ali Gooya,et al.  Fully automatic detection of lung nodules in CT images using a hybrid feature set , 2017, Medical physics.

[49]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Nikos Dimitropoulos,et al.  A fully automated scheme for mammographic segmentation and classification based on breast density and asymmetry , 2011, Comput. Methods Programs Biomed..

[51]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[52]  Christopher M. Bishop,et al.  Current address: Microsoft Research, , 2022 .

[53]  Mohamed Roushdy,et al.  Multi-view Convolutional Neural Network for lung nodule false positive reduction , 2020, Expert Syst. Appl..

[54]  Yu-Dong Yao,et al.  Breast Cancer Detection Using Extreme Learning Machine Based on Feature Fusion With CNN Deep Features , 2019, IEEE Access.

[55]  K. Vaidehi,et al.  Automatic Identification and Elimination of Pectoral Muscle in Digital Mammograms , 2013 .

[56]  T. Pavlidis,et al.  Fuzzy sets and their applications to cognitive and decision processes , 1977 .

[57]  Jing Wang,et al.  Shape-based Automatic Detection of Pectoral Muscle Boundary in Mammograms , 2015, Journal of Medical and Biological Engineering.

[58]  S Tangaro,et al.  Microcalcification detection in full-field digital mammograms: A fully automated computer-aided system. , 2019, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[59]  Rodney Hull,et al.  Breast cancer in low-middle income countries: abnormality in splicing and lack of targeted treatment options. , 2020, American journal of cancer research.

[60]  Friedhelm Schwenker,et al.  Automatic pectoral muscle removal in mammograms , 2019, Evolving Systems.

[61]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .