Object Detection of Optical Remote Sensing Image Based on Improved Faster RCNN

Object detection of optical remote sensing image is an important and challenging problem. And it is widely used in the field of aerial and satellite image analysis. With the rapid increase of optical remote sensing image data and popularity of convolutional neural network, the problem has attracted lots of attention recently. However, the detection result of images with complex background is unsatisfactory, so as images with dense and small objects. Aiming at these problems, we propose a method that combined Feature Pyramid Network(FPN) and Deformable Convolution Network(DCN) to improve the Faster RCNN framework, which helps to improve the detection result. The improved network combines the low-level structural information and the high-level semantic information together to enhance the feature representation. The shared convolutional layer makes end-to-end training come true. Additionally, deformable convolution network makes feature extraction better. We adopt the proposed framework to implement experiments on DOTA dataset, attaining mean average precision(mAP)value of 0.834 on the testing dataset, which is an increase of 23% than the classic Faster RCNN.

[1]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[4]  Yi Li,et al.  Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Jun Wang,et al.  Knowledge-based detection method for SAR targets , 2014 .

[8]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[9]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  P. Mishra,et al.  Robust template matching based obstacle tracking for autonomous rovers , 2013, 2013 IEEE International Conference on Electronics, Computing and Communication Technologies.

[13]  Jin-Hee Lee,et al.  Co-Occurrence Matrix Analysis-Based Semi-Supervised Training for Object Detection , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[14]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Lei Guo,et al.  Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[17]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Junwei Han,et al.  A Survey on Object Detection in Optical Remote Sensing Images , 2016, ArXiv.

[19]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..