CABNet: Category Attention Block for Imbalanced Diabetic Retinopathy Grading

Diabetic Retinopathy (DR) grading is challenging due to the presence of intra-class variations, small lesions and imbalanced data distributions. The key for solving fine-grained DR grading is to find more discriminative features corresponding to subtle visual differences, such as microaneurysms, hemorrhages and soft exudates. However, small lesions are quite difficult to identify using traditional convolutional neural networks (CNNs), and an imbalanced DR data distribution will cause the model to pay too much attention to DR grades with more samples, greatly affecting the final grading performance. In this article, we focus on developing an attention module to address these issues. Specifically, for imbalanced DR data distributions, we propose a novel Category Attention Block (CAB), which explores more discriminative region-wise features for each DR grade and treats each category equally. In order to capture more detailed small lesion information, we also propose the Global Attention Block (GAB), which can exploit detailed and class-agnostic global attention feature maps for fundus images. By aggregating the attention blocks with a backbone network, the CABNet is constructed for DR grading. The attention blocks can be applied to a wide range of backbone networks and trained efficiently in an end-to-end manner. Comprehensive experiments are conducted on three publicly available datasets, showing that CABNet produces significant performance improvements for existing state-of-the-art deep architectures with few additional parameters and achieves the state-of-the-art results for DR grading. Code and models will be available at https://github.com/he2016012996/CABnet.

[1]  Yi Yang,et al.  Attention to Scale: Scale-Aware Semantic Image Segmentation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Hubert Cardot,et al.  From Patch to Image Segmentation using Fully Convolutional Networks - Application to Retinal Images , 2019, ArXiv.

[3]  Tao Mei,et al.  Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Wei Xu,et al.  Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[5]  Xiaochun Cao,et al.  Disc-Aware Ensemble Network for Glaucoma Screening From Fundus Image , 2018, IEEE Transactions on Medical Imaging.

[6]  Aïda Valls,et al.  A Deep Learning Interpretable Classifier for Diabetic Retinopathy Disease Grading , 2017, Neurocomputing.

[7]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[8]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9]  Abhishek Verma,et al.  New Deep Neural Nets for Fine-Grained Diabetic Retinopathy Recognition on Hybrid Color Space , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[10]  Xiaogang Li,et al.  Convolutional neural networks based transfer learning for diabetic retinopathy fundus image classification , 2017, 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[11]  J. Shaw,et al.  IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. , 2018, Diabetes research and clinical practice.

[12]  Rishab Gargeya,et al.  Automated Identification of Diabetic Retinopathy Using Deep Learning. , 2017, Ophthalmology.

[13]  Shenghua Gao,et al.  Multi-Cell Multi-Task Convolutional Neural Networks for Diabetic Retinopathy Grading , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Jianyuan Guo,et al.  GhostNet: More Features From Cheap Operations , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Stephen Lin,et al.  GCNet: Non-Local Networks Meet Squeeze-Excitation Networks and Beyond , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[16]  Yun Fu,et al.  Tell Me Where to Look: Guided Attention Inference Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Jacques Wainer,et al.  Beyond Lesion-Based Diabetic Retinopathy: A Direct Approach for Referral , 2017, IEEE Journal of Biomedical and Health Informatics.

[18]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[19]  Tien Yin Wong,et al.  Current Epidemiology of Diabetic Retinopathy and Diabetic Macular Edema , 2012, Current Diabetes Reports.

[20]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Tao Li,et al.  Lesion Detection and Grading of Diabetic Retinopathy via Two-Stages Deep Convolutional Neural Networks , 2017, MICCAI.

[22]  Laude,et al.  FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE , 2014 .

[23]  Farida Cheriet,et al.  Red Lesion Detection Using Dynamic Shape Features for Diabetic Retinopathy Screening , 2016, IEEE Transactions on Medical Imaging.

[24]  Xiaochun Cao,et al.  Joint Optic Disc and Cup Segmentation Based on Multi-Label Deep Network and Polar Transformation , 2018, IEEE Transactions on Medical Imaging.

[25]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Shaoguo Wen,et al.  Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification , 2020, ArXiv.

[27]  Matthew D. Davis,et al.  Proposed international clinical diabetic retinopathy and diabetic macular edema disease severity scales. , 2003, Ophthalmology.

[28]  Wei Zhang,et al.  Automated identification and grading system of diabetic retinopathy using deep neural networks , 2019, Knowl. Based Syst..

[29]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Yu Zhang,et al.  Attention-guided Chained Context Aggregation for Semantic Segmentation , 2020, Image Vis. Comput..

[31]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[32]  Hyunjung Shim,et al.  Attention-Based Dropout Layer for Weakly Supervised Object Localization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[34]  Anjan Gudigar,et al.  Deep convolution neural network for accurate diagnosis of glaucoma using digital fundus images , 2018, Inf. Sci..

[35]  Bian Wu,et al.  A Framework for Identifying Diabetic Retinopathy Based on Anti-noise Detection and Attention-Based Fusion , 2018, MICCAI.

[36]  Song Guo,et al.  Diagnostic assessment of deep learning algorithms for diabetic retinopathy screening , 2019, Inf. Sci..

[37]  Bram van Ginneken,et al.  Fast Convolutional Neural Network Training Using Selective Data Sampling: Application to Hemorrhage Detection in Color Fundus Images , 2016, IEEE Transactions on Medical Imaging.

[38]  Xiaogang Wang,et al.  Zoom-in-Net: Deep Mining Lesions for Diabetic Retinopathy Detection , 2017, MICCAI.

[39]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Qiang Wu,et al.  Retinal Microaneurysm Detection Using Clinical Report Guided Multi-sieving CNN , 2017, MICCAI.

[41]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[42]  Pheng-Ann Heng,et al.  CANet: Cross-Disease Attention Network for Joint Diabetic Retinopathy and Diabetic Macular Edema Grading , 2019, IEEE Transactions on Medical Imaging.

[43]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Jianguo Zhang,et al.  CARE: Class Attention to Regions of Lesion for Classification on Imbalanced Data , 2019, MIDL.