Cascaded panoptic segmentation method for high resolution remote sensing image

Abstract Great progress has been made for remote sensing image segmentation with the development of Deep Convolutional Neural Networks. However, Multiple convolutions significantly reduce the resolution and lead to the loss of many key information, the prediction accuracy of pixel categories is reduced. And DCNN accumulate context information on a large receptive field, which leads to blurred boundary segmentation of objects. This paper proposes a cascaded panoptic segmentation network to target the aforementioned problems. Firstly, a shared feature pyramid network backbone and a new hybrid task cascade framework are designed, which share the features and integrate the complementary features of different tasks in different stages, which can extract rich context information. Then, a functional module is designed to learn the mask quality of predicted instances in Mask R-CNN to calibrate the inconsistency between mask quality and mask score, thus to deal with the scale change of the object. Finally, a new Visual-saliency ranking module is designed to overcome the mutual occlusion problem between the prediction results, and strengthen robustness to illumination. The experimental results prove that our method still has significant advantages even compared with the most advanced methods, and ablation experiments also verify the effectiveness of our designed strategies.

[1]  Zhenzhong Chen,et al.  Superpixel-enhanced deep neural forest for remote sensing image semantic segmentation , 2020 .

[2]  Marcin Woźniak,et al.  Road Detection Based on Shearlet for GF-3 Synthetic Aperture Radar Images , 2020, IEEE Access.

[3]  Richard S. Zemel,et al.  End-to-End Instance Segmentation with Recurrent Attention , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Yi Zhu,et al.  Soft Proposal Networks for Weakly Supervised Object Localization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Xu Liu,et al.  An End-To-End Network for Panoptic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Yi Sun,et al.  Problems of encoder-decoder frameworks for high-resolution remote sensing image segmentation: Structural stereotype and insufficient learning , 2019, Neurocomputing.

[7]  F. Khan,et al.  Object Counting and Instance Segmentation With Image-Level Supervision , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Carsten Rother,et al.  InstanceCut: From Edges to Instances with MultiCut , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Ting-Zhu Huang,et al.  Remote sensing images destriping using unidirectional hybrid total variation and nonconvex low-rank regularization , 2020, J. Comput. Appl. Math..

[10]  Mohammad Barr A Novel Technique for Segmentation of High Resolution Remote Sensing Images Based on Neural Networks , 2020, Neural Processing Letters.

[11]  Ronald Kemker,et al.  Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[12]  Cheng Wang,et al.  Road Manhole Cover Delineation Using Mobile Laser Scanning Point Cloud Data , 2020, IEEE Geoscience and Remote Sensing Letters.

[13]  Kaiming He,et al.  Panoptic Feature Pyramid Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Yongchao Gong,et al.  Mask Scoring R-CNN , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Fan Zhang,et al.  TreeUNet: Adaptive Tree convolutional neural networks for subdecimeter aerial image segmentation , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[16]  Philip H. S. Torr,et al.  Weakly- and Semi-Supervised Panoptic Segmentation , 2022 .

[17]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Chang Liu,et al.  Linear Span Network for Object Skeleton Detection , 2018, ECCV.

[19]  Carsten Rother,et al.  Panoptic Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Jocelyn Chanussot,et al.  Dynamic Multicontext Segmentation of Remote Sensing Images Based on Convolutional Networks , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Xiangyu Zhang,et al.  DetNet: Design Backbone for Object Detection , 2018, ECCV.

[22]  Yongji Wang,et al.  Hybrid Remote Sensing Image Segmentation Considering Intrasegment Homogeneity and Intersegment Heterogeneity , 2020, IEEE Geoscience and Remote Sensing Letters.

[23]  Yuning Jiang,et al.  MegDet: A Large Mini-Batch Object Detector , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Shi-Min Hu,et al.  S4Net: Single stage salient-instance segmentation , 2017, Computational Visual Media.

[26]  Weipeng Jing,et al.  NAS-HRIS: Automatic Design and Architecture Search of Neural Network for Semantic Segmentation in Remote Sensing Images , 2020, Sensors.

[27]  Guan Huang,et al.  Attention-Guided Unified Network for Panoptic Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Tengfei Su,et al.  Scale-variable region-merging for high resolution remote sensing image segmentation , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[29]  Kai Chen,et al.  Hybrid Task Cascade for Instance Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Jun Fu,et al.  Dual Attention Network for Scene Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Gijs Dubbelman,et al.  Panoptic Segmentation with a Joint Semantic and Instance Segmentation Network , 2018, ArXiv.

[33]  Shu Liu,et al.  Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Xiao Xiang Zhu,et al.  A Relation-Augmented Fully Convolutional Network for Semantic Segmentation in Aerial Scenes , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).