Learning Slimming SAR Ship Object Detector Through Network Pruning and Knowledge Distillation

The deployment of deep convolutional neural networks (CNNs) in synthetic aperture radar (SAR) ship real-time detection is largely hindered by huge computational cost. In this article, we propose a novel learning scheme for training a lightweight ship detector called Tiny YOLO-Lite, which simultaneously 1) reduces the model storage size; 2) decreases the floating point operations (FLOPs) calculation; and 3) guarantees the high accuracy with faster speed. This is achieved by self-designed backbone structure and network pruning, which enforces channel-level sparsity in the backbone network and yields a compact model. In addition, knowledge distillation is also applied to make up for the performance decline caused by network pruning. Hereinto, we propose to let small student model mimic cumbersome teacher's output to achieve improved generalization. Rather than applying vanilla full feature imitation, we redefine the distilled knowledge as the inter-relationship between different levels of feature maps and then transfer it from the large network to a smaller one. On account that the detectors should focus more on the salient regions containing ships while background interference is overwhelming, a novel attention mechanism is designed and then attached to the distilled feature for enhanced representation. Finally, extensive experiments are conducted on SSDD, HRSID, and two large-scene SAR images to verify the effectiveness of the thinner SAR ship object detector in comparison of with other CNN-based algorithms. The detection results demonstrate that the proposed detector can achieve lighter architecture with 2.8-M model size, more efficient inference ($>$200 fps) with low computation cost, and more accurate prediction with knowledge transfer strategy.

[1]  Tianqi Chen,et al.  Net2Net: Accelerating Learning via Knowledge Transfer , 2015, ICLR.

[2]  Ching-Te Chiu,et al.  Real-time Object Detection via Pruning and a Concatenated Multi-feature Assisted Region Proposal Network , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Lena Chang,et al.  Ship Detection Based on YOLOv2 for SAR Imagery , 2019, Remote. Sens..

[4]  Song Han,et al.  Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[5]  Xiaoling Zhang,et al.  Depthwise Separable Convolution Neural Network for High-Speed SAR Ship Detection , 2019, Remote. Sens..

[6]  Junjie Yan,et al.  Mimicking Very Efficient Network for Object Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Tony X. Han,et al.  Learning Efficient Object Detection Models with Knowledge Distillation , 2017, NIPS.

[9]  Jianyu Yang,et al.  Ship Detection in Large-Scale SAR Images Via Spatial Shuffle-Group Enhance Attention , 2021, IEEE Trans. Geosci. Remote. Sens..

[10]  Xiaoling Zhang,et al.  HyperLi-Net: A hyper-light deep learning network for high-accurate and high-speed ship detection from synthetic aperture radar imagery , 2020 .

[11]  Wenhui,et al.  AIR-SARShip-1.0: High-resolution SAR Ship Detection Dataset , 2020 .

[12]  Fei Gao,et al.  Enhanced Feature Extraction for Ship Detection from Multi-Resolution and Multi-Scene Synthetic Aperture Radar (SAR) Images , 2019, Remote. Sens..

[13]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Wei Pan,et al.  Towards Accurate Binary Convolutional Neural Network , 2017, NIPS.

[17]  Adam Van Etten,et al.  You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery , 2018, ArXiv.

[18]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[19]  Junmo Kim,et al.  A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Pengyi Zhang,et al.  SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[21]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[23]  Gholamreza Akbarizadeh,et al.  Change detection in SAR images using deep belief network: a new training approach based on morphological images , 2019, IET Image Process..

[24]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Lei Xie,et al.  Learning Slimming SSD through Pruning and Knowledge Distillation , 2019, 2019 Chinese Automation Congress (CAC).

[26]  Chen Wang,et al.  Precise and Robust Ship Detection for High-Resolution SAR Imagery Based on HR-SDNet , 2020, Remote. Sens..

[27]  Zhiqiang Shen,et al.  Learning Efficient Convolutional Networks through Network Slimming , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[29]  Nikos Komodakis,et al.  Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer , 2016, ICLR.

[30]  Jiashi Feng,et al.  Strip Pooling: Rethinking Spatial Pooling for Scene Parsing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Gholamreza Akbarizadeh,et al.  Ship Classification in SAR Images Using a New Hybrid CNN–MLP Classifier , 2018, Journal of the Indian Society of Remote Sensing.

[32]  Gholamreza Akbarizadeh,et al.  PolSAR image segmentation based on feature extraction and data compression using Weighted Neighborhood Filter Bank and Hidden Markov random field-expectation maximization , 2020 .

[33]  Yoshua Bengio,et al.  FitNets: Hints for Thin Deep Nets , 2014, ICLR.

[34]  Zhicheng Zhao,et al.  Efficient Yolo: A Lightweight Model For Embedded Deep Learning Object Detection , 2020, 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[35]  Timo Aila,et al.  Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[36]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Qi Li,et al.  Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[39]  James Zijun Wang,et al.  Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers , 2018, ICLR.

[40]  Yuan Xie,et al.  Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey , 2020, Proceedings of the IEEE.

[41]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[42]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Naiyan Wang,et al.  Data-Driven Sparse Structure Selection for Deep Neural Networks , 2017, ECCV.

[44]  Hao Su,et al.  HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation , 2020, IEEE Access.

[45]  Zhidong Deng,et al.  Accelerating Convolutional Neural Networks with Dominant Convolutional Kernel and Knowledge Pre-regression , 2016, ECCV.

[46]  Xiangyu Zhang,et al.  Channel Pruning for Accelerating Very Deep Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[47]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[48]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Hanan Samet,et al.  Pruning Filters for Efficient ConvNets , 2016, ICLR.

[50]  Gholamreza Akbarizadeh,et al.  A new approach for oil tank detection using deep learning features with control false alarm rate in high-resolution satellite imagery , 2020, International Journal of Remote Sensing.

[51]  Chuan He,et al.  A Deep Neural Network Based on an Attention Mechanism for SAR Ship Detection in Multiscale and Complex Scenarios , 2019, IEEE Access.

[52]  Jianwei Li,et al.  Ship detection in SAR images based on an improved faster R-CNN , 2017, 2017 SAR in Big Data Era: Models, Methods and Applications (BIGSARDATA).