YOLOv3-Based Human Activity Recognition as Viewed from a Moving High-Altitude Aerial Camera

This paper presents a method to classify human activities as normal or suspicious using YOLOv3 to automatically process video footages taken from a high altitude moving aerial camera, such as the one attached to a drone. We consider four human activities namely, jogging, walking, fighting, and chasing. Objects generally appear much smaller, with less visible features, when viewed from high altitudes. The reduced visible features make automatic human activity detection from ground surveillance cameras not applicable to the high altitude case. Through transfer learning, we modified existing pre-trained YOLOv3 convolutional neural networks (CNN‘s) and retrained with our own high aerial human action dataset. By so doing, we were able to customize YOLOv3 to detect, localize, and recognize aerial human activities in real-time as normal or suspicious. The proposed approach achieves a promising average precision accuracy of 82.30%, and average F1 score of 88.10% on classifying high aerial human activities. We demonstrated that YOLOv3 is a powerful approach and relatively fast for the recognition and localization of human subjects as seen from above.

[1]  K. N. Kumar,et al.  Object Detection Method Based on YOLOv3 using Deep Learning Networks , 2019, International Journal of Innovative Technology and Exploring Engineering.

[2]  Dmitry B. Goldgof,et al.  How effective is human video surveillance performance? , 2008, 2008 19th International Conference on Pattern Recognition.

[3]  Nan Wang,et al.  Real-time behavior detection and judgment of egg breeders based on YOLO v3 , 2019, Neural Computing and Applications.

[4]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Miran Pobar,et al.  Adapting YOLO Network for Ball and Player Detection , 2019, ICPRAM.

[6]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[7]  Juan Du,et al.  Understanding of Object Detection Based on CNN Family and YOLO , 2018 .

[8]  Sutrisno Ibrahim,et al.  A comprehensive review on intelligent surveillance systems , 2016 .

[9]  Yuchuan Du,et al.  Pavement distress detection and classification based on YOLO network , 2020, International Journal of Pavement Engineering.

[10]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[11]  Antonio Fernández-Caballero,et al.  Identification of Loitering Human Behaviour in Video Surveillance Environments , 2015, IWINAC.

[12]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Lin Wang,et al.  Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments , 2020, IEEE Access.