Structurally Guided Channel Attention Networks: SGCA-Net

In this paper, we propose Structurally Guided Channel Attention Networks (SGCA-Net), a principled way to guide the channel attention of CNNs. Convolution operator constructs features maps by using both channel and spatial information within the receptive fields of its filters. Prior research has investigated the impact of strengthening the representational power of CNNs using channel attention modules. In this work, we guide the channel attention of networks using feature vectors that contain clinically relevant information. We do so by attaching guided attention modules into a state-of-the-art network architecture, and guiding these attention modules with domain knowledge using feature vectors. Experiments on a dataset of 5512 posterior retinal images, taken using a wide angle fundus camera, show that SGCA-Net achieves 0.983 and 0.948 AUC to predict plus and normal categories, respectively. SGCA-Net achieves higher performance than CNNs without attention modules and CNNs with unguided attention modules.

[1]  Sang Jun Park,et al.  Classification of Findings with Localized Lesions in Fundoscopic Images Using a Regionally Guided CNN , 2018, COMPAY/OMIA@MICCAI.

[2]  James M. Brown,et al.  Automated Diagnosis of Plus Disease in Retinopathy of Prematurity Using Deep Convolutional Neural Networks , 2018, JAMA ophthalmology.

[3]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[4]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Mohammad Riazi Esfahani,et al.  Retinopathy of Prematurity-assist: Novel Software for Detecting Plus Disease , 2017, Korean journal of ophthalmology : KJO.

[6]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[7]  Anna L. Ells,et al.  The International Classification of Retinopathy of Prematurity revisited. , 2005, Archives of ophthalmology.

[8]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[9]  Sicheng Zhao,et al.  Attention-Aware Polarity Sensitive Embedding for Affective Image Retrieval , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10]  Heechan Yang,et al.  Guided Soft Attention Network for Classification of Breast Cancer Histopathology Images , 2020, IEEE Transactions on Medical Imaging.

[11]  Yi Yang,et al.  Diagnose like a Radiologist: Attention Guided Convolutional Neural Network for Thorax Disease Classification , 2018, ArXiv.

[12]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[13]  Julien Jomier,et al.  Computer-automated quantification of plus disease in retinopathy of prematurity. , 2003, Journal of AAPOS : the official publication of the American Association for Pediatric Ophthalmology and Strabismus.

[14]  Michael F. Chiang,et al.  Development and Evaluation of Reference Standards for Image-based Telemedicine Diagnosis and Clinical Research Studies in Ophthalmology , 2014, AMIA.

[15]  Stratis Ioannidis,et al.  Plus Disease in Retinopathy of Prematurity: Convolutional Neural Network Performance Using a Combined Neural Network and Feature Extraction Approach , 2020, Translational vision science & technology.

[16]  Deniz Erdoğmuş,et al.  Computer-Based Image Analysis for Plus Disease Diagnosis in Retinopathy of Prematurity: Performance of the "i-ROP" System and Image Features Associated With Expert Diagnosis. , 2015, Translational vision science & technology.

[17]  C. Gilbert,et al.  Childhood blindness in the context of VISION 2020--the right to sight. , 2001, Bulletin of the World Health Organization.

[18]  Xiaogang Wang,et al.  Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Joel H. Saltz,et al.  Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[21]  Gabriel J. Brostow,et al.  Automated Retinopathy of Prematurity Case Detection with Convolutional Neural Networks , 2016, LABELS/DLMIA@MICCAI.

[22]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[23]  G. Quinn,et al.  Characteristics of Infants With Severe Retinopathy of Prematurity in Countries With Low, Moderate, and High Levels of Development: Implications for Screening Programs , 2005, Pediatrics.

[24]  Stratis Ioannidis,et al.  Fully automated disease severity assessment and treatment monitoring in retinopathy of prematurity using deep learning , 2018, Medical Imaging.

[25]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Stratis Ioannidis,et al.  Structural Visual Guidance Attention Networks In Retinopathy Of Prematurity , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[27]  Yun Fu,et al.  Tell Me Where to Look: Guided Attention Inference Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.