A Novel Transparency Strategy-based Data Augmentation Approach for BI-RADS Classification of Mammograms

Image augmentation techniques have been widely investigated to improve the performance of deep learning (DL) algorithms on mammography classification tasks. Recent methods have proved the efficiency of image augmentation on data deficiency or data imbalance issues. In this paper, we propose a novel transparency strategy to boost the Breast Imaging Reporting and Data System (BI-RADS) scores of mammogram classifiers. The proposed approach utilizes the Region of Interest (ROI) information to generate more high-risk training examples for breast cancer (BI-RADS 3, 4, 5) from original images. Our extensive experiments on three different datasets show that the proposed approach significantly improves the mammogram classification performance and surpasses a state-of-the-art data augmentation technique called CutMix. This study also highlights that our transparency method is more effective than other augmentation strategies for BI-RADS classification and can be widely applied to other computer vision tasks.

[1]  Samir B. Patel,et al.  Image Augmentation Techniques for Mammogram Analysis , 2022, J. Imaging.

[2]  M. Dao,et al.  VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography , 2022, medRxiv.

[3]  Shaw-Hwa Hwang,et al.  A High-Performance Deep Neural Network Model for BI-RADS Classification of Screening Mammography , 2022, Sensors.

[4]  Huyen T. X. Nguyen,et al.  A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms , 2021, 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[5]  J. Dowling,et al.  A review of medical image data augmentation techniques for deep learning applications , 2021, Journal of medical imaging and radiation oncology.

[6]  Pierre-Marc Jodoin,et al.  GANs for Medical Image Synthesis: An Empirical Study , 2021, J. Imaging.

[7]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[8]  Seong Joon Oh,et al.  CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  C. D'Orsi Breast Imaging Reporting and Data System (BI-RADS) , 2018 .

[10]  Hongyi Zhang,et al.  mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[11]  Vincent Dumoulin,et al.  Generative Adversarial Networks: An Overview , 2017, 1710.07035.

[12]  Graham W. Taylor,et al.  Improved Regularization of Convolutional Neural Networks with Cutout , 2017, ArXiv.

[13]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[14]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[15]  David R. Dance,et al.  Mammographic Image Analysis Society (MIAS) database v1.21 , 2015 .

[16]  Grigorios Tsoumakas,et al.  On the Stratification of Multi-label Data , 2011, ECML/PKDD.

[17]  C. Dolea,et al.  World Health Organization , 1949, International Organization.

[18]  J. Higginson,et al.  International Agency for Research on Cancer. , 1968, WHO chronicle.