Violent activity detection with transfer learning method

Although action recognition is a widely studied field in computer vision, the recognitions of aggressive activities and crowd violence actions are comparatively less studied. Nowadays, so many surveillance cameras have been installed in the streets and there is a demand for intelligent crowd activity detection systems. A method for violence detection in videos is proposed. The primary contribution is a novel transfer learning-based violence detector that gives promising results compared with the existing detectors. First, the optical flows of the input videos are computed via Lucas–Kanade method. Then, several 2D templates are constructed with overlapping optical flow magnitudes and orientations. These templates are supplied to a pre-trained convolutional neural network as input and deep features of different layers are extracted. Cubic kernel support vector machine and subspace k-nearest neighbours classifiers are trained for prediction and the proposed method is tested with three different datasets that commonly used in violence detection studies.

[1]  Michal Irani,et al.  Detecting Irregularities in Images and in Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[2]  Wen Gao,et al.  Detecting Violent Scenes in Movies by Auditory and Visual Cues , 2008, PCM.

[3]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[4]  Jeho Nam,et al.  Audio-visual content-based violent scene characterization , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[5]  Tal Hassner,et al.  Violent flows: Real-time detection of violent crowd behavior , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[6]  Nicu Sebe,et al.  Real-life violent social interaction detection , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  Xi Wang,et al.  Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning , 2015, MediaEval.

[9]  Oscar Déniz-Suárez,et al.  Fast violence detection in video , 2015, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[10]  V. E. Machaca Arceda,et al.  Real Time Violence Detection in Video , 2016, ICPR 2016.

[11]  Paul L. Rosin,et al.  Detecting Violent Crowds using Temporal Analysis of GLCM Texture , 2016, ArXiv.

[12]  Rahul Sukthankar,et al.  Violence Detection in Video Using Computer Vision Techniques , 2011, CAIP.

[13]  Yunhong Wang,et al.  Multi-stream Deep Networks for Person to Person Violence Detection in Videos , 2016, CCPR.

[14]  Johannes D. Krijnders,et al.  Multi-modal human aggression detection , 2016, Comput. Vis. Image Underst..

[15]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[16]  Weiqiang Wang,et al.  Weakly-Supervised Violence Detection in Movies with Audio and Video Based Co-training , 2009, PCM.

[17]  Can Wang,et al.  Violence detection using Oriented VIolent Flows , 2016, Image Vis. Comput..

[18]  Lior Wolf,et al.  Local Trinary Patterns for human action recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision.