Evaluation of Anomaly Detection Algorithms for the Real-World Applications

Anomaly detection in complex data structures is one of the most challenging problems in computer vision. In many real-world problems, for example in the quality control in modern manufacturing, the anomalous samples are usually rare, resulting in (highly) imbalanced datasets. However, in current research practice, these scenarios are rarely modeled, and as a consequence, evaluation of anomaly detection algorithms often do not reproduce results that are useful for practical applications. First, even in case of highly unbalanced input data, anomaly detection algorithms are expected to significantly reduce the proportion of anomalous samples, detecting “almost all” anomalous samples (with exact specifications depending on the target customer). This places high importance on only the small part of the ROC curve, possibly rendering the standard metrics such as AUC (Area Under Curve) and AP (Average Precision) useless. Second, the target of automatic anomaly detection in practical applications is significant reduction in manual work required, and standard metrics are poor predictor of this feature. Finally, the evaluation may produce erratic results for different randomly initialized training runs of the neural network, producing evaluation results that may not reproduce well in practice. In this paper, we present an evaluation methodology that avoids these pitfalls.

[1]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[2]  Janez Pers,et al.  Building Visual Anomaly Dataset from Satellite Data using ADS-B , 2019, OpenSky.

[3]  Katerina Bicova,et al.  Use of the ppm and its function in the production process , 2017 .

[4]  Georg Langs,et al.  Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.

[5]  Dimitris Kanellopoulos,et al.  Handling imbalanced datasets: A review , 2006 .

[6]  Emanuele Ghelfi,et al.  A Survey on GANs for Anomaly Detection , 2019, ArXiv.

[7]  Erik Learned-Miller,et al.  FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[8]  Toby P. Breckon,et al.  GANomaly: Semi-Supervised Anomaly Detection via Adversarial Training , 2018, ACCV.

[9]  Ron Kohavi,et al.  The Case against Accuracy Estimation for Comparing Induction Algorithms , 1998, ICML.

[10]  Jure Skvarč,et al.  Segmentation-based deep-learning approach for surface-defect detection , 2019, Journal of Intelligent Manufacturing.

[11]  Georg Langs,et al.  f‐AnoGAN: Fast unsupervised anomaly detection with generative adversarial networks , 2019, Medical Image Anal..

[12]  Roger Davies,et al.  A Machine Vision Quality Control System for Industrial Acrylic Fibre Production , 2002, EURASIP J. Adv. Signal Process..

[13]  Nikolaos M. Avouris,et al.  EVALUATION OF CLASSIFIERS FOR AN UNEVEN CLASS DISTRIBUTION PROBLEM , 2006, Appl. Artif. Intell..

[14]  Hayit Greenspan,et al.  GAN-based Synthetic Medical Image Augmentation for increased CNN Performance in Liver Lesion Classification , 2018, Neurocomputing.

[15]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[16]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[17]  Farzaneh Khoshnevisan,et al.  RSM-GAN: A Convolutional Recurrent GAN for Anomaly Detection in Contaminated Seasonal Multivariate Time Series , 2019, ArXiv.

[18]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[19]  Paul F. Whelan,et al.  Machine vision systems: proverbs, principles, prejudices, and priorities , 1994, Other Conferences.

[20]  Jason Yosinski,et al.  Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[22]  Trevor Darrell,et al.  Adversarial Feature Learning , 2016, ICLR.

[23]  Chuan Sheng Foo,et al.  Adversarially Learned Anomaly Detection , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[24]  Atsuto Maki,et al.  A systematic study of the class imbalance problem in convolutional neural networks , 2017, Neural Networks.

[25]  Shenghua Gao,et al.  A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26]  Toby P. Breckon,et al.  Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[27]  Jan Brabec,et al.  On Model Evaluation Under Non-constant Class Imbalance , 2020, ICCS.

[28]  Roland Vollgraf,et al.  Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[29]  Ramesh Nallapati,et al.  OCGAN: One-Class Novelty Detection Using GANs With Constrained Latent Representations , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[31]  Yuval Elovici,et al.  DOPING: Generative Data Augmentation for Unsupervised Anomaly Detection with GAN , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[32]  Danijel Skocaj,et al.  End-to-end training of a two-stage neural network for defect detection , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[33]  Bernd Scholz-Reiter,et al.  Design of deep convolutional neural network architectures for automated feature extraction in industrial inspection , 2016 .