Semantic Image Segmentation With Propagating Deep Aggregation

In this article, we propose a new architecture of feature aggregation, which is designed to deal with the problem that the information of each convolutional layer cannot be used reasonably and the shallow layer information is lost in the process of transmission. In this method, propagating deep aggregation (PDA) is transplanted into the DeepLab-ASPP model to generate a new model combining structural continuity with feature aggregation. This new model can be divided into several stages according to the output resolution. In these stages, more possible feature fusion combinations can be realized, and the result from aggregation propagated among stages can be optimized and updated to get the best results. We demonstrate the effectiveness of the proposed model on the PASCAL VOC 2012 data set and the PASCAL-Context data set. Our method achieves state-of-the-art performance on two public benchmarks and significantly outperforms the previous results, 78.8% (versus 62.4%) on the PASCAL VOC 2012 data set and 46.2% (versus 37.8%) on the PASCAL-Context data set.

[1]  Satish Kumar Injeti,et al.  Optimal multilevel thresholding selection for brain MRI image segmentation based on adaptive wind driven optimization , 2018, Measurement.

[2]  Ayed Alwadain,et al.  Computer-aided cancer classification system using a hybrid level-set image segmentation , 2019 .

[3]  Ronan Collobert,et al.  Recurrent Convolutional Neural Networks for Scene Parsing , 2013, ArXiv.

[4]  Sanja Fidler,et al.  The Role of Context for Object Detection and Semantic Segmentation in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Ronan Collobert,et al.  Recurrent Convolutional Neural Networks for Scene Labeling , 2014, ICML.

[6]  Ron Kikinis,et al.  Improved watershed transform for medical image segmentation using prior information , 2004, IEEE Transactions on Medical Imaging.

[7]  Yanhui Guo,et al.  An effective color image segmentation approach using neutrosophic adaptive mean shift clustering , 2018 .

[8]  Gregory Shakhnarovich,et al.  Feedforward semantic segmentation with zoom-out features , 2014, CVPR.

[9]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[11]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[12]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Wei Liu,et al.  ParseNet: Looking Wider to See Better , 2015, ArXiv.

[14]  Yunchao Wei,et al.  Self-explanatory Deep Salient Object Detection , 2017, ArXiv.

[15]  Rob Fergus,et al.  Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[16]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[17]  Zhaohui Tang,et al.  A watershed segmentation algorithm based on an optimal marker for bubble size measurement , 2019, Measurement.

[18]  Nikhil Ketkar,et al.  Convolutional Neural Networks , 2021, Deep Learning with Python.

[19]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[20]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Sinisa Todorovic,et al.  A Multi-scale CNN for Affordance Segmentation in RGB Images , 2016, ECCV.

[22]  Song Wang,et al.  Degraded Image Semantic Segmentation With Dense-Gram Networks , 2020, IEEE Transactions on Image Processing.

[23]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[24]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[25]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[26]  Yingli Tian,et al.  Coarse-to-Fine Semantic Segmentation From Image-Level Labels , 2018, IEEE Transactions on Image Processing.

[27]  Bolei Zhou,et al.  Semantic Understanding of Scenes Through the ADE20K Dataset , 2016, International Journal of Computer Vision.

[28]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[29]  Yao Zhao,et al.  Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Daniel Maturana,et al.  Multi-Scale Convolutional Architecture for Semantic Segmentation , 2015 .

[31]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[32]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Trevor Darrell,et al.  Deep Layer Aggregation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Hao Gao,et al.  Multilevel Thresholding for Image Segmentation Through an Improved Quantum-Behaved Particle Swarm Algorithm , 2010, IEEE Transactions on Instrumentation and Measurement.

[35]  Sanja Fidler,et al.  Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ning Zhou,et al.  Multiscale fully convolutional network with application to industrial inspection , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).