论文信息 - Improved Mask R-CNN for Aircraft Detection in Remote Sensing Images

Improved Mask R-CNN for Aircraft Detection in Remote Sensing Images

In recent years, remote sensing images has become one of the most popular directions in image processing. A small feature gap exists between satellite and natural images. Therefore, deep learning algorithms could be applied to recognize remote sensing images. We propose an improved Mask R-CNN model, called SCMask R-CNN, to enhance the detection effect in the high-resolution remote sensing images which contain the dense targets and complex background. Our model can perform object recognition and segmentation in parallel. This model uses a modified SC-conv based on the ResNet101 backbone network to obtain more discriminative feature information and adds a set of dilated convolutions with a specific size to improve the instance segmentation effect. We construct WFA-1400 based on the DOTA dataset because of the shortage of remote sensing mask datasets. We compare the improved algorithm with other state-of-the-art algorithms. The object detection AP50 and AP increased by 1–2% and 1%, respectively, objectively proving the effectiveness and the feasibility of the improved model.

[1] Ting Wang,et al. Research on Airplane and Ship Detection of Aerial Remote Sensing Images Based on Convolutional Neural Network , 2020, Sensors.

[2] Fahad Shahbaz Khan,et al. D2Det: Towards High Quality Object Detection and Instance Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Chen Wang,et al. Object Detection and Instance Segmentation in Remote Sensing Imagery Based on Precise Mask R-CNN , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[4] Jiebo Luo,et al. DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5] Lin Wang,et al. Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments , 2020, IEEE Access.

[6] Fang Liu,et al. Nearshore vessel detection based on Scene-mask R-CNN in remote sensing image , 2018, 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC).

[7] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[9] Jun Chen,et al. Building Area Estimation in Drone Aerial Images Based on Mask R-CNN , 2021, IEEE Geoscience and Remote Sensing Letters.

[10] Yiping Yang,et al. Rotated region based CNN for ship detection , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[11] Zhiguo Jiang,et al. Inshore Ship Detection Based on Mask R-CNN , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[12] Nuno Vasconcelos,et al. Cascade R-CNN: High Quality Object Detection and Instance Segmentation , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[14] Pengfei Zhao,et al. An Aircraft Detection Method Based on Improved Mask R-CNN in Remotely Sensed Imagery , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[15] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[17] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Lu Jun,et al. Aircraft detection in remote sensing images using cascade convolutional neural networks , 2019 .

[19] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Changhu Wang,et al. Improving Convolutional Networks With Self-Calibrated Convolutions , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Balaji Lakshminarayanan,et al. AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty , 2020, ICLR.

[24] Lixin Wang,et al. Vehicle Detection Based on Drone Images with the Improved Faster R-CNN , 2019, ICMLC '19.

[25] Hannes Taubenböck,et al. Large-scale building extraction in very high-resolution aerial imagery using Mask R-CNN , 2019, 2019 Joint Urban Remote Sensing Event (JURSE).