D-Resunet: Resunet and Dilated Convolution for High Resolution Satellite Imagery Road Extraction

Reliably extracting information from satellite imagery is a difficult problem with many practical applications. One specific case of this problem is the task of automatically detecting roads. Road extraction from satellite images has been a hot research topic in the past decade. In this paper, we propose a semantic segmentation neural network, named D-ResUnet, which adopts U-Net structure, residual learning, and dilated convolutions for road area extraction. The network is built with ResUnet architecture and has dilated convolution layers in its center part. ResUnet architecture combines the strengths of residual units and feature concatenate, which help to ease training of networks and facilitate information propagation. Dilation convolution is a powerful tool that can enlarge the receptive field of feature points without reducing the resolution of the feature maps. We test our network and compare it with U-Net and ResUnet based road extraction methods. The proposed approach outperforms all the comparing methods, which demonstrates its superiority over recently developed state of the arts.

[1]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[3]  Qingjie Liu,et al.  Road Extraction by Deep Residual U-Net , 2017, IEEE Geoscience and Remote Sensing Letters.

[4]  Takayoshi Yamashita,et al.  Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks , 2016, IRIACV.

[5]  David J. DeWitt,et al.  RoadTracer: Automatic Extraction of Road Networks from Aerial Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[7]  Chinnathevar Sujatha,et al.  Connected component-based technique for automatic extraction of road centerline in high resolution satellite images , 2015, EURASIP Journal on Image and Video Processing.

[8]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).