Attention W-Net: Improved Skip Connections for better Representations

Segmentation of macro and microvascular structures in fundoscopic retinal images plays a crucial role in detection of multiple retinal and systemic diseases, yet it is a difficult problem to solve. Most deep learning approaches for this task involve an autoencoder based architecture, but they face several issues such as lack of enough parameters, overfitting when there are enough parameters and incompatibility between internal feature-spaces. Due to such issues, these techniques are hence not able to extract the best semantic information from the limited data present for such tasks. We propose Attention W-Net, a new U-Net based architecture for retinal vessel segmentation to address these problems. In this architecture with a LadderNet backbone, we have two main contributions: Attention Block and regularisation measures. Our Attention Block uses decoder features to attend over the encoder features from skip-connections during upsampling, resulting in higher compatibility when the encoder and decoder features are added. Our regularisation measures include image augmentation and modifications to the ResNet Block used, which prevent overfitting. With these additions, we observe an AUC and F1-Score of 0.8407 and 0.9833 — a sizeable improvement over its LadderNet backbone as well as competitive performance among the contemporary state-of-the-art methods.

[1]  Tao Shen,et al.  DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding , 2017, AAAI.

[2]  A Hoover,et al.  Locating blood vessels in retinal images by piece-wise threshold probing of a matched filter response , 1998, AMIA.

[3]  Loïc Le Folgoc,et al.  Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.

[4]  Hanchao Yu,et al.  Study Group Learning: Improving Retinal Vessel Segmentation Trained with Noisy Labels , 2021, MICCAI.

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Bunyarit Uyyanonvara,et al.  An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation , 2012, IEEE Transactions on Biomedical Engineering.

[7]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Yuta Nakashima,et al.  IterNet: Retinal Image Segmentation Utilizing Structural Redundancy in Vessel Networks , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[9]  Max A. Viergever,et al.  Ridge-based vessel segmentation in color images of the retina , 2004, IEEE Transactions on Medical Imaging.

[10]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Yugen Yi,et al.  SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[12]  Enes Sadi Uysal,et al.  Exploring The Limits Of Data Augmentation For Retinal Vessel Segmentation , 2021, ArXiv.

[13]  Tuan D. Pham,et al.  DUNet: A deformable network for retinal vessel segmentation , 2018, Knowl. Based Syst..

[14]  A.D. Hoover,et al.  Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response , 2000, IEEE Transactions on Medical Imaging.

[15]  Shihao Zhang,et al.  Attention Guided Network for Retinal Image Segmentation , 2019, MICCAI.

[16]  Alireza Tavakkoli,et al.  RV-GAN : Retinal Vessel Segmentation from Fundus Images using Multi-scale Generative Adversarial Networks , 2021, ArXiv.

[17]  Juntang Zhuang,et al.  LadderNet: Multi-path networks based on U-Net for medical image segmentation , 2018, ArXiv.

[18]  Vijayan K. Asari,et al.  Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation , 2018, ArXiv.

[19]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[20]  Lei Zhang,et al.  Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.