Atrous Convolutional Neural Network (ACNN) for Biomedical Semantic Segmentation with Dimensionally Lossless Feature Maps

Deep Convolutional Neural Networks (DCNNs) are showing impressive performances in biomedical semantic segmentation. However, current DCNNs usually use down-sampling layers to achieve significant receptive field increasing and to gain abstract semantic information. These down-sampling layers decrease the spatial dimension of feature maps as well, which is harmful for semantic segmentation. Atrous convolution is an alternative for the down-sampling layer. It could increase the receptive field significantly but also maintain the spatial dimension of feature maps. In this paper, firstly, an atrous rate setting is proposed to achieve the largest and fully-covered receptive field with a minimum number of layers. Secondly, six atrous blocks, three shortcut connections and four normalization methods are explored to select the optimal atrous block, shortcut connection and normalization method. Finally, a new and dimensionally lossless DCNN - Atrous Convolutional Neural Network (ACNN) is proposed with using cascaded atrous II-blocks, residual learning and Fine Group Normalization (FGN). The Right Ventricle (RV), Left Ventricle (LV) and aorta data are used for the validation. The results show that the proposed ACNN achieves comparable segmentation Dice Similarity Coefficients (DSCs) with U-Net, optimized U-Net and the hybrid network, but uses much less parameters. This advantage is considered to benefit from the dimensionally lossless feature maps.

[1]  Guang-Zhong Yang,et al.  Normalization in Training Deep Convolutional Neural Networks for 2D Bio-medical Semantic Segmentation , 2018, ArXiv.

[2]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[3]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Kaiming He,et al.  Group Normalization , 2018, ECCV.

[6]  Tobias Gass,et al.  Cloud-Based Evaluation of Anatomical Structure Segmentation and Landmark Detection Algorithms: VISCERAL Anatomy Benchmarks , 2016, IEEE Transactions on Medical Imaging.

[7]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Su-Lin Lee,et al.  Towards Automatic 3D Shape Instantiation for Deployed Stent Grafts: 2D Multiple-class and Class-imbalance Marker Segmentation with Equally-weighted Focal U-Net , 2017, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[10]  Su-Lin Lee,et al.  A real‐time and registration‐free framework for dynamic shape instantiation , 2017, Medical Image Anal..

[11]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[12]  Andrea Vedaldi,et al.  Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[13]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Raquel Urtasun,et al.  Understanding the Effective Receptive Field in Deep Convolutional Neural Networks , 2016, NIPS.

[15]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[16]  Su-Lin Lee,et al.  Real-Time 3-D Shape Instantiation From Single Fluoroscopy Projection for Fenestrated Stent Graft Deployment , 2017, IEEE Robotics and Automation Letters.

[17]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[18]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[20]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[21]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[22]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[23]  Garrison W. Cottrell,et al.  Understanding Convolution for Semantic Segmentation , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[24]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  Bastian Leibe,et al.  Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Shuicheng Yan,et al.  Dual Path Networks , 2017, NIPS.