One-Class Classification in Images and Videos Using a Convolutional Autoencoder With Compact Embedding

In One-Class Classification (OCC) problems, the classifier is trained with samples of a class considered normal, such that exceptional patterns can be identified as anomalies. Indeed, for real-world problems, the representation of the normal class in the feature space is an important issue, considering that one or more clusters can describe different aspects of the normality. For classification purposes, it is important that these clusters be as compact (dense) as possible, for better discriminating anomalous patterns, which is a recurrent problem in OCC tasks. This work introduces a hybrid approach using deep learning and One-Class Support Vector Machine (OC-SVM) methods, named Convolutional Autoencoder with Compact Embedding (CAE-CE), for enhancing the compactness of clusters in the feature space. Such an approach is still underexplored in the literature, being restricted to models within the context of metric learning. Additionally, the absence of anomalous samples during training makes it difficult to determine when to interrupt the learning process, so as to avoid over-compression of the normal examples, thus resulting in overfitting of the model. In this work, we propose a novel sensitivity-based stop criterion, and its suitability for OCC problems was assessed. Using an OC-SVM for the classification task, several experiments were done using publicly available image and video datasets. We also introduce other two new benchmarks, specifically designed for video anomaly detection in highways. The final performance of the proposed method was compared with a baseline Convolutional Autoencoder (CAE). Overall results suggest that the enhanced compactness introduced by the CAE-CE improved the classification performance for most datasets. Also, the qualitative analysis of frames at the visual level indicated that features learned by CAE-CE are closely correlated to the anomalous events.

[1]  David M. J. Tax,et al.  One-class classification , 2001 .

[2]  Yasushi Makihara,et al.  Identifying motion pathways in highly crowded scenes: A non-parametric tracklet clustering approach , 2020, Comput. Vis. Image Underst..

[3]  Hong Chang,et al.  A Scalable Kernel-Based Algorithm for Semi-Supervised Metric Learning , 2007, IJCAI.

[4]  Patrick Marques Ciarelli,et al.  Novelty Detection in Social Media by Fusing Text and Image Into a Single Structure , 2019, IEEE Access.

[5]  Misha Pavel,et al.  Adjustment Learning and Relevant Component Analysis , 2002, ECCV.

[6]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[7]  Nuno Vasconcelos,et al.  Anomaly detection in crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Minho Lee,et al.  Deep learning with support vector data description , 2015, Neurocomputing.

[9]  Jonghyun Choi,et al.  Learning Temporal Regularity in Video Sequences , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yong Liu,et al.  AnomalyNet: An Anomaly Detection Network for Video Surveillance , 2019, IEEE Transactions on Information Forensics and Security.

[11]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[12]  Dhruv Batra,et al.  Joint Unsupervised Learning of Deep Representations and Image Clusters , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Jiliu Zhou,et al.  Dual-channel CNN for efficient abnormal behavior identification through crowd feature engineering , 2018, Machine Vision and Applications.

[14]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[15]  Xiaoqiang Lu,et al.  Learning deep event models for crowd anomaly detection , 2017, Neurocomputing.

[16]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[17]  Elvan Duman,et al.  Anomaly Detection in Videos Using Optical Flow and Convolutional Autoencoder , 2019, IEEE Access.

[18]  Matheus Gutoski,et al.  A clustering-based deep autoencoder for one-class image classification , 2017, 2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI).

[19]  Tehreem Qasim,et al.  A low dimensional descriptor for detection of anomalies in crowd videos , 2019, Math. Comput. Simul..

[20]  Subutai Ahmad,et al.  Unsupervised real-time anomaly detection for streaming data , 2017, Neurocomputing.

[21]  Mubarak Shah,et al.  Abnormal crowd behavior detection using social force model , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Tomer Hertz,et al.  Learning a Mahalanobis Metric from Equivalence Constraints , 2005, J. Mach. Learn. Res..

[23]  Colin Raffel,et al.  Lasagne: First release. , 2015 .

[24]  Alexander Binder,et al.  Deep One-Class Classification , 2018, ICML.

[25]  Venkatesh Saligrama,et al.  Video anomaly detection based on local statistical aggregates , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Mahmood Fathy,et al.  Deep-anomaly: Fully convolutional neural network for fast anomaly detection in crowded scenes , 2016, Comput. Vis. Image Underst..

[27]  Myungjin Kim,et al.  Anomaly Detection for HTTP Using Convolutional Autoencoders , 2018, IEEE Access.

[28]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[29]  Raúl Monroy,et al.  Bagging-RandomMiner: a one-class classifier for file access-based masquerade detection , 2018, Machine Vision and Applications.

[30]  Muhammad Munwar Iqbal,et al.  Enhanced Network Anomaly Detection Based on Deep Neural Networks , 2018, IEEE Access.

[31]  Heitor Silvério Lopes,et al.  A study of deep convolutional auto-encoders for anomaly detection in videos , 2018, Pattern Recognit. Lett..

[32]  Jürgen Schmidhuber,et al.  Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction , 2011, ICANN.

[33]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[34]  David M. J. Tax,et al.  An Adaptive Radial Basis Function Kernel for Support Vector Data Description , 2015, SIMBAD.

[35]  Geoffrey E. Hinton,et al.  Using very deep autoencoders for content-based image retrieval , 2011, ESANN.

[36]  Eduardo A. B. da Silva,et al.  Anomaly detection with a moving camera using multiscale video analysis , 2019, Multidimens. Syst. Signal Process..

[37]  Wei Xiong,et al.  Stacked Convolutional Denoising Auto-Encoders for Feature Representation , 2017, IEEE Transactions on Cybernetics.

[38]  Vikas Gupta,et al.  Abnormality detection in crowd videos by tracking sparse components , 2017, Machine Vision and Applications.

[39]  Joachim Denzler,et al.  Local Novelty Detection in Multi-class Recognition Problems , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[40]  Shaogang Gong,et al.  Learning a Discriminative Null Space for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[42]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[43]  Shaogang Gong,et al.  Image and Video Understanding in Big Data , 2017, Comput. Vis. Image Underst..

[44]  R. Venkatesh Babu,et al.  Anomaly detection via short local trajectories , 2017, Neurocomputing.

[45]  Christopher Leckie,et al.  High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning , 2016, Pattern Recognit..

[46]  Ganesh Ramakrishnan,et al.  Anomaly Detection in Surveillance Videos , 2019, 2019 26th International Conference on High Performance Computing, Data and Analytics Workshop (HiPCW).

[47]  H. Bourlard,et al.  Auto-association by multilayer perceptrons and singular value decomposition , 1988, Biological Cybernetics.

[48]  Nicu Sebe,et al.  Detecting anomalous events in videos by learning deep representations of appearance and motion , 2017, Comput. Vis. Image Underst..

[49]  Cewu Lu,et al.  Abnormal Event Detection at 150 FPS in MATLAB , 2013, 2013 IEEE International Conference on Computer Vision.

[50]  Laurens van der Maaten,et al.  Learning a Parametric Embedding by Preserving Local Structure , 2009, AISTATS.

[51]  Nannan Li,et al.  Video anomaly detection based on a hierarchical activity discovery within spatio-temporal contexts , 2014, Neurocomputing.

[52]  Anderson Rocha,et al.  Toward Open Set Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Heitor Silvério Lopes,et al.  Extracting human attributes using a convolutional neural network approach , 2015, Pattern Recognit. Lett..

[54]  Xuan Li,et al.  Probabilistic framework of visual anomaly detection for unbalanced data , 2016, Neurocomputing.

[55]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[56]  Manassés Ribeiro Deep learning methods for detecting anomalies in videos: theoretical and methodological contributions , 2018 .

[57]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[58]  Joachim Denzler,et al.  Kernel Null Space Methods for Novelty Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.