Performance Comparison of Small Object Detection Algorithms of UAV based Aerial Images

Traffic controls in modern society are part of urban management. With the assistance of unmanned aerial vehicles (UAVs) equipped with mounted cameras, researchers could capture aerial (bird-view) images from appropriate altitude. The perspective in aerial images makes appearances of objects squat, although aerial images can supply more contextual information about the environment by a broader view angle, the object instances may be detected by mistake. This fact diminishes the aerial images that can be fed to a network with higher dimensions that increases the computational cost to prevent the diminishing of pixels belonging to small objects. To compare model performance on small objects with aerial images, this study trains and tests two object detectors, i.e. YOLOv4 and YOLOv3, on the AU-AIR dataset, and exploited the parameterization of YOLO based models for small object detection. Finally, the key numerical results and observations are presented.

[1]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[2]  A. Puri A Survey of Unmanned Aerial Vehicles ( UAV ) for Traffic Surveillance , 2005 .

[3]  Larry S. Davis,et al.  An Analysis of Scale Invariance in Object Detection - SNIP , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[5]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Kaiming He,et al.  Designing Network Design Spaces , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Xiangyu Zhang,et al.  DetNet: A Backbone network for Object Detection , 2018, ArXiv.

[8]  Razvan Pascanu,et al.  On the Number of Linear Regions of Deep Neural Networks , 2014, NIPS.

[9]  Ilker Bozcan,et al.  AU-AIR: A Multi-modal Unmanned Aerial Vehicle Dataset for Low Altitude Traffic Surveillance , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.