Human abnormal behavior detection method based on T-TINY-YOLO

Aiming at the problem of human abnormal behavior detection in video surveillance, an abnormal target detection method based on T-TINY-YOLO network model is proposed. First, the type of abnormal behavior is defined according to the requirements of the monitoring scenario. Then the calibrated abnormal behavior data is trained through the YOLO network model to achieve end-to-end abnormal behavior classification, thereby achieving abnormal target detection for specific application scenarios. For the characteristics of a large number of zero-valued weight parameters in YOLO's network weights, a convolutional neural network tailoring scheme is proposed to improve the network model, accelerate the algorithm, and improve the real-time performance of the system. Finally, the method is implemented on the embedded platform NVIDIA JetsonTX2. The experimental results are good, the detection speed reaches 12 frames/s, and the recall rate is 80.87%.

[1]  Matti Pietikäinen,et al.  Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[2]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[4]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[6]  Dumitru Erhan,et al.  Deep Neural Networks for Object Detection , 2013, NIPS.

[7]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[8]  John F. Canny,et al.  Deep Traffic Light Detection for Self-driving Cars from a Large-scale Dataset , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[9]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[10]  Jitendra Malik,et al.  Deformable part models are convolutional neural networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  冯小雨,et al.  基于改进Faster R-CNN的空中目标检测 , 2018 .

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[15]  Yang Wang,et al.  Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation , 2016, ISVC.

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.