Boosting Ship Detection in SAR Images With Complementary Pretraining Techniques

Deep learning methods have made significant progress in ship detection in synthetic aperture radar (SAR) images. The pretraining technique is usually adopted to support deep neural networks-based SAR ship detectors due to the scarce labeled SAR images. However, directly leveraging ImageNet pretraining is hard to obtain a good ship detector because of different imaging perspectives and geometry. In this article, to resolve the problem of inconsistent imaging perspectives between ImageNet and earth observations, we propose an optical ship detector (OSD) pretraining technique to transfer the characteristics of ships in earth observations to SAR images from a large-scale aerial image dataset. On the other hand, to handle the problem of different imaging geometry between optical and SAR images, we propose an optical-SAR matching (OSM) pretraining technique, which transfers plentiful texture features from optical images to SAR images by common representation learning on the OSM task. Finally, observing that the OSD pretraining-based SSD has a better recall on sea area while the OSM pretraining-based SSD can reduce false alarms on land area, we combine the predictions of the two detectors through weighted boxes fusion to further improve detection results. Extensive experiments on four SAR ship detection datasets and three representative convolutional neural network-based detection benchmarks are conducted to show the effectiveness and complementarity of the two proposed detectors, and the state-of-the-art performance of the combination of the two detectors. The proposed method won the sixth place of ship detection in SAR images in the 2020 Gaofen challenge.

[1]  Zongxu Pan,et al.  Projection Shape Template-Based Ship Target Recognition in TerraSAR-X Images , 2017, IEEE Geoscience and Remote Sensing Letters.

[2]  Meiyu Huang,et al.  Task-Driven Common Representation Learning via Bridge Neural Network , 2019, AAAI.

[3]  R. Solovyev,et al.  Weighted Boxes Fusion: ensembling boxes for object detection models , 2019, ArXiv.

[4]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[5]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Hao Su,et al.  HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation , 2020, IEEE Access.

[7]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Aviv Eisenschtat,et al.  Linking Image and Text with 2-Way Nets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Jiao Jiao,et al.  A Densely Connected End-to-End Neural Network for Multiscale and Multiscene SAR Ship Detection , 2018, IEEE Access.

[11]  Qi Li,et al.  Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Wenhui,et al.  AIR-SARShip-1.0: High-resolution SAR Ship Detection Dataset , 2020 .

[13]  Chen Wang,et al.  Precise and Robust Ship Detection for High-Resolution SAR Imagery Based on HR-SDNet , 2020, Remote. Sens..

[14]  Menglong Yan,et al.  Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks , 2018, Remote. Sens..

[15]  Jianwei Li,et al.  Ship detection in SAR images based on an improved faster R-CNN , 2017, 2017 SAR in Big Data Era: Models, Methods and Applications (BIGSARDATA).

[16]  Shilin Zhou,et al.  Scheme of Parameter Estimation for Generalized Gamma Distribution and Its Application to Ship Detection in SAR Images , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Gangyao Kuang,et al.  Squeeze and Excitation Rank Faster R-CNN for Ship Detection in SAR Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[19]  Boli Xiong,et al.  Attention Receptive Pyramid Network for Ship Detection in SAR Images , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[20]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[21]  Ronghui Zhan,et al.  R2FA-Det: Delving into High-Quality Rotatable Boxes for Ship Detection in SAR Images , 2020, Remote. Sens..

[22]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[24]  Yang Long,et al.  Learning RoI Transformer for Oriented Object Detection in Aerial Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Lu Li,et al.  Saliency-Guided Single Shot Multibox Detector for Target Detection in SAR Images , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Xiao Xiang Zhu,et al.  Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN , 2018, IEEE Geoscience and Remote Sensing Letters.

[27]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Zongxu Pan,et al.  DRBox-v2: An Improved Detector With Rotatable Boxes for Target Detection in SAR Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[31]  Delphine Cerutti-Maori,et al.  Wide-Area Traffic Monitoring With the SAR/GMTI System PAMIR , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Larry S. Davis,et al.  Soft-NMS — Improving Object Detection with One Line of Code , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33]  Xiaoling Zhang,et al.  Depthwise Separable Convolution Neural Network for High-Speed SAR Ship Detection , 2019, Remote. Sens..

[34]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[35]  Kun Fu,et al.  An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR Images , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Tara N. Sainath,et al.  Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[38]  Biao Hou,et al.  Multilayer CFAR Detection of Ship Targets in Very High Resolution SAR Images , 2015, IEEE Geoscience and Remote Sensing Letters.

[39]  Xiaoling Zhang,et al.  High-Speed Ship Detection in SAR Images Based on a Grid Convolutional Neural Network , 2019, Remote. Sens..

[40]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.