论文信息 - Road Segmentation for Remote Sensing Images Using Adversarial Spatial Pyramid Networks

Road Segmentation for Remote Sensing Images Using Adversarial Spatial Pyramid Networks

Road extraction in remote sensing images is of great importance for a wide range of applications. Because of the complex background, and high density, most of the existing methods fail to accurately extract a road network that appears correct and complete. Moreover, they suffer from either insufficient training data or high costs of manual annotation. To address these problems, we introduce a new model to apply structured domain adaption for synthetic image generation and road segmentation. We incorporate a feature pyramid network into generative adversarial networks to minimize the difference between the source and target domains. A generator is learned to produce quality synthetic images, and the discriminator attempts to distinguish them. We also propose a feature pyramid network that improves the performance of the proposed model by extracting effective features from all the layers of the network for describing different scales objects. Indeed, a novel scale-wise architecture is introduced to learn from the multi-level feature maps and improve the semantics of the features. For optimization, the model is trained by a joint reconstruction loss function, which minimizes the difference between the fake images and the real ones. A wide range of experiments on three datasets prove the superior performance of the proposed approach in terms of accuracy and efficiency. In particular, our model achieves state-of-the-art 78.86 IOU on the Massachusetts dataset with 14.89M parameters and 86.78B FLOPs, with 4x fewer FLOPs but higher accuracy (+3.47% IOU) than the top performer among state-of-the-art approaches used in the evaluation.

[1] Seyed Majid Azimi,et al. Road Segmentation in SAR Satellite Images With Deep Fully Convolutional Neural Networks , 2018, IEEE Geoscience and Remote Sensing Letters.

[2] Dario Augusto Borges Oliveira,et al. Synthesis of Multispectral Optical Images From SAR/Optical Multitemporal Data Using Conditional Generative Adversarial Networks , 2019, IEEE Geoscience and Remote Sensing Letters.

[3] Marina Meila,et al. Comparing clusterings: an axiomatic view , 2005, ICML.

[4] Wei Sun,et al. Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation , 2019, ArXiv.

[5] Lianfa Bai,et al. Residual Pyramid Learning for Single-Shot Semantic Segmentation , 2020, IEEE Transactions on Intelligent Transportation Systems.

[6] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Kai Zhang,et al. Simultaneous Road Surface and Centerline Extraction From Large-Scale Remote Sensing Images Using CNN-Based Segmentation and Tracing , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[8] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[9] Martial Hebert,et al. Toward Objective Evaluation of Image Segmentation Algorithms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Tomas Pfister,et al. Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Nicholas D. Lane,et al. FlexAdapt: Flexible Cycle-Consistent Adversarial Domain Adaptation , 2019, 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA).

[12] Dazhuan Xu,et al. Sig-NMS-Based Faster R-CNN Combining Transfer Learning for Small Target Detection in VHR Optical Remote Sensing Imagery , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[13] Pourya Shamsolmoali,et al. A Novel Deep Structure U-Net for Sea-Land Segmentation in Remote Sensing Images , 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14] Xuelong Li,et al. Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes , 2019, IEEE Transactions on Image Processing.

[15] Sam Kwong,et al. Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[16] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[17] Xuelong Li,et al. Spectral Embedded Adaptive Neighbors Clustering , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[18] Jun Rao,et al. Road Segmentation Based on Hybrid Convolutional Network for High-Resolution Visible Remote Sensing Image , 2019, IEEE Geoscience and Remote Sensing Letters.

[19] Bo Zhao,et al. Satellite Image Spoofing: Creating Remote Sensing Dataset with Generative Adversarial Networks (Short Paper) , 2018, GIScience.

[20] Xavier Cufí,et al. Yet Another Survey on Image Segmentation: Region and Boundary Information Integration , 2002, ECCV.

[21] Bo Huang,et al. Unified fusion of remote-sensing imagery: generating simultaneously high-resolution synthetic spatial–temporal–spectral earth observations , 2013 .

[22] Richard Lepage,et al. Road Extraction From Very High Resolution Remote Sensing Optical Images Based on Texture Analysis and Beamlet Transform , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[23] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[26] Ting Mao,et al. Unsupervised Classification of Multispectral Images Embedded With a Segmentation of Panchromatic Images Using Localized Clusters , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[27] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[28] Taesung Park,et al. CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[29] Shiming Xiang,et al. Automatic Road Detection and Centerline Extraction via Cascaded End-to-End Convolutional Neural Network , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[30] Xiangtao Zheng,et al. Remote Sensing Scene Classification by Unsupervised Representation Learning , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[31] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Mingwei Sun,et al. Pan-Sharpening Using an Efficient Bidirectional Pyramid Network , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[33] Han Zhang,et al. Improving GANs Using Optimal Transport , 2018, ICLR.

[34] Jonathan Cheung-Wai Chan,et al. Improved Classification of VHR Images of Urban Areas Using Directional Morphological Profiles , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[35] Jing Wang,et al. Classifier-Constrained Deep Adversarial Domain Adaptation for Cross-Domain Semisupervised Classification in Remote Sensing Images , 2020, IEEE Geoscience and Remote Sensing Letters.

[36] Jimeng Sun,et al. Generating Multi-label Discrete Patient Records using Generative Adversarial Networks , 2017, MLHC.

[37] Pourya Shamsolmoali,et al. G-GANISR: Gradual generative adversarial network for image super resolution , 2019, Neurocomputing.

[38] Lu Yang,et al. Semantic Segmentation for High Spatial Resolution Remote Sensing Images Based on Convolution Neural Network and Pyramid Pooling Module , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[39] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[40] Jian Yao,et al. RoadNet: Learning to Comprehensively Analyze Road Networks in Complex Urban Scenes From High-Resolution Remotely Sensed Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[41] Ying Chen,et al. M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network , 2018, AAAI.

[42] Lei He,et al. Road Segmentation of Unmanned Aerial Vehicle Remote Sensing Images Using Adversarial Network With Multiscale Context Aggregation , 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[43] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[44] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[45] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Ling Shao,et al. Efficient Featurized Image Pyramid Network for Single Shot Detector , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[48] Ming Yang,et al. Conditional Generative Adversarial Network for Structured Domain Adaptation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.