Enhanced Convolutional Neural Network for Abnormal Event Detection in Video Streams

Abnormal event detection is a complex task in computer vision. It is one among the most difficult challenges in crowded scene analysis, given the huge variety of events to analyze and the frequent occlusions that can occur. Ideally, to meet this challenge, it would be necessary to have a large number of normal and abnormal training examples. However, the high variability between events and their complexity make it almost impossible to provide enough abnormal training samples. To overcome these constraints, the solution may reside in a robust one-class learning algorithm. In this paper, we propose a novel one-class neural network technique suitable for outliers detection. This network takes advantage of the deep learning ability in image processing, while meeting the constraint of using only training samples from the target class. Our network is composed of a Convolutional Auto Encoder (CAE) with two loss functions. The first one is a standard mean squared error (MSE) and the second is a proposed compactness loss function particularly adapted for one class learning. This network allows us to not only extract robust spatiotemporal features but also ensures that all the target class representations are sufficiently compact to be effectively isolated from outliers. The proposed algorithm is evaluated on the challenging USCD Ped2 dataset for abnormal video event detection and localization. Both qualitative and quantitative experiments show competitive performances compared to the state-of-the-art methods.

[1]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[2]  Yong Haur Tay,et al.  Abnormal Event Detection in Videos using Spatiotemporal Autoencoder , 2017, ISNN.

[3]  Simone Calderara,et al.  Detecting anomalies in people's trajectories using spectral graph analysis , 2011, Comput. Vis. Image Underst..

[4]  Hongbin Zha,et al.  Learning to Detect Anomalies in Surveillance Video , 2015, IEEE Signal Processing Letters.

[5]  Aggelos K. Katsaggelos,et al.  Abnormal Event Detection from Surveillance Video by Dynamic Hierarchical Clustering , 2007, 2007 IEEE International Conference on Image Processing.

[6]  Jonghyun Choi,et al.  Learning Temporal Regularity in Video Sequences , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yong Man Ro,et al.  Stan: Spatio- Temporal Adversarial Networks for Abnormal Event Detection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[10]  Martin D. Levine,et al.  An on-line, real-time learning method for detecting anomalies in videos using spatio-temporal compositions , 2013, Comput. Vis. Image Underst..

[11]  Mahmood Fathy,et al.  Deep-Cascade: Cascading 3D Deep Neural Networks for Fast Anomaly Detection and Localization in Crowded Scenes , 2017, IEEE Transactions on Image Processing.

[12]  K. Grauman,et al.  Observe locally, infer globally: A space-time MRF for detecting abnormal activities with incremental updates , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Wei Shen,et al.  Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes , 2016, Signal Process. Image Commun..

[14]  Nicu Sebe,et al.  Abnormal event detection in videos using generative adversarial nets , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[15]  Vishal M. Patel,et al.  One-Class Convolutional Neural Network , 2019, IEEE Signal Processing Letters.

[16]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[18]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[19]  Jie Shao,et al.  Abnormal event detection for video surveillance using deep one-class learning , 2017, Multimedia Tools and Applications.

[20]  Nuno Vasconcelos,et al.  Anomaly detection in crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Kristen Grauman,et al.  Observe locally, infer globally: A space-time MRF for detecting abnormal activities with incremental updates , 2009, CVPR.

[22]  Vania Bogorny,et al.  Toward Abnormal Trajectory and Event Detection in Video Surveillance , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Martin D. Levine,et al.  Online Dominant and Anomalous Behavior Detection in Videos , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Hichem Snoussi,et al.  Detection of Abnormal Visual Events via Global Optical Flow Orientation Histogram , 2014, IEEE Transactions on Information Forensics and Security.

[25]  Andrei Zaharescu,et al.  Anomalous Behaviour Detection Using Spatiotemporal Oriented Energies, Subset Inclusion Histogram Comparison and Event-Driven Processing , 2010, ECCV.

[26]  Tieniu Tan,et al.  A system for learning statistical motion patterns , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Fei-Fei Li,et al.  Online detection of unusual events in videos via dynamic sparse coding , 2011, CVPR 2011.

[28]  Mahmood Fathy,et al.  Adversarially Learned One-Class Classifier for Novelty Detection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[30]  Brian C. Lovell,et al.  Improved anomaly detection in crowded scenes via cell-based analysis of foreground speed, size and texture , 2011, CVPR 2011 WORKSHOPS.

[31]  Bowen Zhang,et al.  Real-Time Action Recognition with Enhanced Motion Vector CNNs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Hichem Snoussi,et al.  Abnormal event detection via covariance matrix for optical flow based feature , 2018, Multimedia Tools and Applications.

[33]  Alberto Del Bimbo,et al.  Multi-scale and real-time non-parametric approach for anomaly detection and localization , 2012, Comput. Vis. Image Underst..

[34]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[35]  Alexander Binder,et al.  Deep One-Class Classification , 2018, ICML.

[36]  Lin Wang,et al.  Abnormal Event Detection in Videos Using Hybrid Spatio-Temporal Autoencoder , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[37]  Hichem Snoussi,et al.  Generative Neural Networks for Anomaly Detection in Crowded Scenes , 2019, IEEE Transactions on Information Forensics and Security.

[38]  Svetha Venkatesh,et al.  Activity recognition and abnormality detection with the switching hidden semi-Markov model , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[39]  Nuno Vasconcelos,et al.  Anomaly Detection and Localization in Crowded Scenes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.