Violent Crowd Flow Detection Using Deep Learning

This research aims in detecting violent crowd flows in the context of Bangladesh. For this purpose, we have collected a dataset which includes both violent and non-violent crowd flows. Different deep learning algorithms and approaches have been applied on this dataset to detect scenarios which contain violence. Convolutional neural networks (CNN) and long short-term memory network (LSTM) based architectures have been experimented separately on this dataset and in combination as well. Moreover, a model that was already pre-trained on violent movie scenes has been used to leverage transfer learning which outperformed all other experimented approaches with an accuracy of 95.67%. Surprisingly, the sequence model alone or in combination with CNN has not performed well on this particular dataset. The proposed model is lightweight hence it can be deployed easily in any security systems consisting of CCTV cameras or unmanned aerial vehicles (UAVs).

[1]  Xiangjian He,et al.  MoWLD: a robust motion image descriptor for violence detection , 2015, Multimedia Tools and Applications.

[2]  Tal Hassner,et al.  Violent flows: Real-time detection of violent crowd behavior , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[3]  Rahul Sukthankar,et al.  Violence Detection in Video Using Computer Vision Techniques , 2011, CAIP.

[4]  Yong Xu,et al.  Detecting Robbery and Violent Scenarios , 2013, 2013 Second International Conference on Robot, Vision and Signal Processing.

[5]  Hélio Pedrini,et al.  Detection of Violent Events in Video Sequences Based on Census Transform Histogram , 2017, 2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI).

[6]  Oswald Lanz,et al.  Learning to detect violent videos using convolutional long short-term memory , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Ziqiang Shi,et al.  Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes , 2013, MediaEval.

[8]  Ming Zhu,et al.  Violence Detection in Video by Using 3D Convolutional Neural Networks , 2014, ISVC.

[9]  Alessandro Perina,et al.  Violence detection in crowded scenes using substantial derivative , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[10]  Mohammad Soleymani,et al.  VSD, a public dataset for the detection of violent scenes in movies: design, annotation, analysis and evaluation , 2014, Multimedia Tools and Applications.

[11]  Yingyun Yang,et al.  Violence Detection Algorithm Based on Local Spatio-temporal Features and Optical Flow , 2015, 2015 International Conference on Industrial Informatics - Computing Technology, Intelligent Technology, Industrial Information Integration.