The Aeroplane and Undercarriage Detection Based on Attention Mechanism and Multi-Scale Features Processing

Undercarriage device is one of the essential parts of an aeroplane, and accurate detection of whether the aeroplane undercarriage is operating normally can effectively avoid aeroplane accidents. To address the problems of low automation and low accuracy of small target detection in existing aeroplane undercarriage detection methods, an improved algorithm for aeroplane undercarriage detection YOLO V4 is proposed. Firstly, the convolutional network structure of Inception-ResNet is integrated into the CSPDarkNet53 framework to improve the algorithm’s ability to extract semantic information of target features; then an attention mechanism is added to the path aggregation network algorithm structure to improve the importance and relevance of different features after conceptual operations. In addition, aeroplane and undercarriage datasets were constructed, and finally, the generated partitioned test sets were tested to evaluate the test performance of Faster R-CNN, YOLO V3, and YOLO V4 target detection algorithms. The experimental results show that the improved algorithm has significantly improved the recall rate and the mean accuracy of detection for small targets in our dataset compared with the YOLO V4 algorithm. The reasonableness and advancedness of the improved algorithm in this paper are effectively verified.

[1]  Zeyu Hou,et al.  Path Planning Optimization for Driverless Vehicle in Parallel Parking Integrating Radial Basis Function Neural Network , 2021, Applied Sciences.

[2]  Hongbin Zha,et al.  RDCFace: Radial Distortion Correction for Face Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[4]  Qingjie Liu,et al.  From W-Net to CDGAN: Bitemporal Change Detection via Deep Learning Techniques , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Guangyu Gao,et al.  IPG-Net: Image Pyramid Guidance Network for Small Object Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[6]  Jun-Wei Hsieh,et al.  CSPNet: A New Backbone that can Enhance Learning Capability of CNN , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[7]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Dongsheng Li,et al.  Geometry Guided Feature Aggregation in Video Face Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[9]  Diganta Misra,et al.  Mish: A Self Regularized Non-Monotonic Neural Activation Function , 2019, ArXiv.

[10]  Thiago Santini,et al.  Improving real-time CNN-based pupil detection through domain-specific data augmentation , 2019, ETRA.

[11]  Xiangyu Zhu,et al.  AdaptiveFace: Adaptive Margin and Sampling for Face Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Fang Liu,et al.  Transferred Deep Learning-Based Change Detection in Remote Sensing Images , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Maoguo Gong,et al.  A Conditional Adversarial Network for Change Detection in Heterogeneous Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[14]  Mun-Cheon Kang,et al.  Parallel Feature Pyramid Network for Object Detection , 2018, ECCV.

[15]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[16]  Shu Liu,et al.  Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Youping Chen,et al.  Classification of surface defects on steel sheet using convolutional neural networks , 2017 .

[18]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[21]  Jianxiong Xiao,et al.  R-CNN for Small Object Detection , 2016, ACCV.

[22]  Raquel Urtasun,et al.  Efficient Deep Learning for Stereo Matching , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Zsolt Kira,et al.  Fusing LIDAR and images for pedestrian detection using convolutional neural networks , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[28]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Reinhold Huber-Mörk,et al.  Convolutional Neural Networks for Steel Surface Defect Detection from Photometric Stereo Images , 2014, ISVC.

[30]  Jürgen Schmidhuber,et al.  Multi-scale pyramidal pooling network for generic steel defect classification , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).