Improved Mask R-CNN for Aircraft Detection in Remote Sensing Images

In recent years, remote sensing images has become one of the most popular directions in image processing. A small feature gap exists between satellite and natural images. Therefore, deep learning algorithms could be applied to recognize remote sensing images. We propose an improved Mask R-CNN model, called SCMask R-CNN, to enhance the detection effect in the high-resolution remote sensing images which contain the dense targets and complex background. Our model can perform object recognition and segmentation in parallel. This model uses a modified SC-conv based on the ResNet101 backbone network to obtain more discriminative feature information and adds a set of dilated convolutions with a specific size to improve the instance segmentation effect. We construct WFA-1400 based on the DOTA dataset because of the shortage of remote sensing mask datasets. We compare the improved algorithm with other state-of-the-art algorithms. The object detection AP50 and AP increased by 1–2% and 1%, respectively, objectively proving the effectiveness and the feasibility of the improved model.

[1]  Ting Wang,et al.  Research on Airplane and Ship Detection of Aerial Remote Sensing Images Based on Convolutional Neural Network , 2020, Sensors.

[2]  Fahad Shahbaz Khan,et al.  D2Det: Towards High Quality Object Detection and Instance Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Chen Wang,et al.  Object Detection and Instance Segmentation in Remote Sensing Imagery Based on Precise Mask R-CNN , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[4]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5]  Lin Wang,et al.  Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments , 2020, IEEE Access.

[6]  Fang Liu,et al.  Nearshore vessel detection based on Scene-mask R-CNN in remote sensing image , 2018, 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC).

[7]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[9]  Jun Chen,et al.  Building Area Estimation in Drone Aerial Images Based on Mask R-CNN , 2021, IEEE Geoscience and Remote Sensing Letters.

[10]  Yiping Yang,et al.  Rotated region based CNN for ship detection , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[11]  Zhiguo Jiang,et al.  Inshore Ship Detection Based on Mask R-CNN , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[12]  Nuno Vasconcelos,et al.  Cascade R-CNN: High Quality Object Detection and Instance Segmentation , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[14]  Pengfei Zhao,et al.  An Aircraft Detection Method Based on Improved Mask R-CNN in Remotely Sensed Imagery , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[15]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[17]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Lu Jun,et al.  Aircraft detection in remote sensing images using cascade convolutional neural networks , 2019 .

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Changhu Wang,et al.  Improving Convolutional Networks With Self-Calibrated Convolutions , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Balaji Lakshminarayanan,et al.  AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty , 2020, ICLR.

[24]  Lixin Wang,et al.  Vehicle Detection Based on Drone Images with the Improved Faster R-CNN , 2019, ICMLC '19.

[25]  Hannes Taubenböck,et al.  Large-scale building extraction in very high-resolution aerial imagery using Mask R-CNN , 2019, 2019 Joint Urban Remote Sensing Event (JURSE).