A Deep Learning Method for Breast Cancer Classification in the Pathology Images

Objective: Breast cancer is the most common female cancer in the world, and it poses a huge threat to women's health. There is currently promising research concerning its early diagnosis using deep learning methodologies. However, some commonly used Convolutional Neural Network (CNN) and their variations, such as AlexNet, VGGNet, GoogleNet and so on, are prone to overfitting in breast cancer classification, due to both small-scale breast pathology image datasets and overconfident softmax-cross-entropy loss. To alleviate the overfitting issue for better classification accuracy, we propose a novel framework for breast pathology classification, called the AlexNet-BC model. The model is pre-trained using the ImageNet dataset and fine-tuned using an augmented dataset. We also devise an improved cross-entropy loss function to penalize overconfident low-entropy output distributions and make the predictions suitable for uniform distributions. The proposed approach is then validated through a series of comparative experiments on BreaKHis, IDC and UCSB datasets. The experimental results show that the proposed method outperforms the state-of-the-art methods at different magnifications. Its strong robustness and generalization capabilities make it suitable for histopathology clinical computer-aided diagnosis systems.

[1]  A. Jemal,et al.  Cancer statistics for African American/Black People 2022 , 2022, CA: a cancer journal for clinicians.

[2]  Mengchu Zhou,et al.  Dynamic Embedding Projection-Gated Convolutional Neural Networks for Text Classification , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Abdullah Abusorrah,et al.  Effective Visual Domain Adaptation via Generative Adversarial Distribution Matching , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Zhe Xu,et al.  Tumor detection for whole slide image of liver based on patch-based convolutional neural network , 2020, Multimedia Tools and Applications.

[5]  Yi Xu,et al.  Towards Understanding Label Smoothing , 2020, ArXiv.

[6]  Ruchika Gupta,et al.  Whole Slide Imaging (WSI) in Pathology: Current Perspectives and Future Directions , 2020, Journal of Digital Imaging.

[7]  Maria Trocan,et al.  Benign and Malignant Skin Lesion Classification Comparison for Three Deep-Learning Architectures , 2020, ACIIDS.

[8]  Aditya Krishna Menon,et al.  Does label smoothing mitigate label noise? , 2020, ICML.

[9]  Zihao Liu,et al.  Soft-shell Shrimp Recognition Based on an Improved AlexNet for Quality Evaluations , 2020 .

[10]  Visar Berisha,et al.  Regularization via Structural Label Smoothing , 2020, AISTATS.

[11]  Neofytos Dimitriou,et al.  Deep Learning for Whole Slide Image Analysis: An Overview , 2019, Front. Med..

[12]  Kevin A. Schneider,et al.  Breast Cancer Diagnosis with Transfer Learning and Global Pooling , 2019, 2019 International Conference on Information and Communication Technology Convergence (ICTC).

[13]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[14]  Geoffrey E. Hinton,et al.  When Does Label Smoothing Help? , 2019, NeurIPS.

[15]  Peter König,et al.  Further Advantages of Data Augmentation on Convolutional Neural Networks , 2018, ICANN.

[16]  MengChu Zhou,et al.  A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Zidong Wang,et al.  Machine Learning with Applications in Breast Cancer Diagnosis and Prognosis , 2018 .

[18]  Andreas K. Maier,et al.  Classification of breast cancer histology images using transfer learning , 2018, ICIAR.

[19]  Yinan Kong,et al.  Involvement of Machine Learning for Breast Cancer Image Classification: A Survey , 2017, Comput. Math. Methods Medicine.

[20]  MengChu Zhou,et al.  A Noise-Filtered Under-Sampling Scheme for Imbalanced Classification , 2017, IEEE Transactions on Cybernetics.

[21]  Luis Perez,et al.  The Effectiveness of Data Augmentation in Image Classification using Deep Learning , 2017, ArXiv.

[22]  Zhenghao Chen,et al.  Using Transfer Learning with Convolutional Neural Networks to Diagnose Breast Cancer from Histopathological Images , 2017, ICONIP.

[23]  Daisuke Komura,et al.  Machine Learning Methods for Histopathological Image Analysis , 2017, Computational and structural biotechnology journal.

[24]  Yilong Yin,et al.  Deep learning model based breast cancer histopathological image classification , 2017, 2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).

[25]  Geoffrey E. Hinton,et al.  Regularizing Neural Networks by Penalizing Confident Output Distributions , 2017, ICLR.

[26]  Luiz Eduardo Soares de Oliveira,et al.  Breast cancer histopathological image classification using Convolutional Neural Networks , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[27]  Luiz Eduardo Soares de Oliveira,et al.  A Dataset for Breast Cancer Histopathological Image Classification , 2016, IEEE Transactions on Biomedical Engineering.

[28]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[29]  Nima Tajbakhsh,et al.  Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? , 2016, IEEE Transactions on Medical Imaging.

[30]  Yoichiro Yamamoto,et al.  A novel method for morphological pleomorphism and heterogeneity quantitative measurement: Named cell feature level co-occurrence matrix , 2016, Journal of pathology informatics.

[31]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Navid Farahani,et al.  whole slide imaging in pathology: advantages, limitations, and emerging perspectives , 2015 .

[34]  Joel H. Saltz,et al.  Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Ying LU,et al.  Decision tree methods: applications for classification and prediction , 2015, Shanghai archives of psychiatry.

[36]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[38]  Fabio A. González,et al.  Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks , 2014, Medical Imaging.

[39]  Aditya Bhaskara,et al.  Provable Bounds for Learning Some Deep Representations , 2013, ICML.

[40]  Marek Kowal,et al.  Computer-aided diagnosis of breast cancer based on fine needle biopsy microscopic images , 2013, Comput. Biol. Medicine.

[41]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[42]  B. S. Manjunath,et al.  A biosegmentation benchmark for evaluation of bioimage analysis methods , 2009, BMC Bioinformatics.

[43]  Ledan Qian,et al.  Sequence-Dropout Block for Reducing Overfitting Problem in Image Classification , 2020, IEEE Access.

[44]  Yahong Luo,et al.  A deep learning method for classifying mammographic breast density categories , 2018, Medical physics.

[45]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[46]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..