Anomaly Detection in Predictive Maintenance: A New Evaluation Framework for Temporal Unsupervised Anomaly Detection Algorithms

The research in anomaly detection lacks a unified definition of what represents an anomalous instance. Discrepancies in the nature itself of an anomaly lead to multiple paradigms of algorithms design and experimentation. Predictive maintenance is a special case, where the anomaly represents a failure that must be prevented. Related time series research as outlier and novelty detection or time series classification does not apply to the concept of an anomaly in this field, because they are not single points which have not been seen previously and may not be precisely annotated. Moreover, due to the lack of annotated anomalous data, many benchmarks are adapted from supervised scenarios. To address these issues, we generalise the concept of positive and negative instances to intervals to be able to evaluate unsupervised anomaly detection algorithms. We also preserve the imbalance scheme for evaluation through the proposal of the Preceding Window ROC, a generalisation for the calculation of ROC curves for time series scenarios. We also adapt the mechanism from a established time series anomaly detection benchmark to the proposed generalisations to reward early detection. Therefore, the proposal represents a flexible evaluation framework for the different scenarios. To show the usefulness of this definition, we include a case study of Big Data algorithms with a real-world time series problem provided by the company ArcelorMittal, and compare the proposal with an evaluation method.

[1]  Subutai Ahmad,et al.  Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[2]  Nidhi Singh,et al.  Demystifying Numenta anomaly benchmark , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[3]  Tomás Pevný,et al.  Loda: Lightweight on-line detector of anomalies , 2016, Machine Learning.

[4]  Mohak Shah,et al.  Deep learning on symbolic representations for large-scale heterogeneous time-series event prediction , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Huawen Liu,et al.  Recent Progress of Anomaly Detection , 2019, Complex..

[6]  Weiming Shen,et al.  An inquiry into the predictability of failure events in chillers and boilers , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[7]  Yue Zhao,et al.  XGBOD: Improving Supervised Outlier Detection with Unsupervised Representation Learning , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[8]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[9]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[10]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[11]  Stanley B. Zdonik,et al.  Precision and Recall for Range-Based Anomaly Detection , 2018, ArXiv.

[12]  Stanley B. Zdonik,et al.  Precision and Recall for Time Series , 2018, NeurIPS.

[13]  Rita P. Ribeiro,et al.  Sequential anomalies: a study in the Railway Industry , 2016, Machine Learning.

[14]  Uwe Glässer,et al.  Dynamic Attack Scoring Using Distributed Local Detectors , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Takuya Akiba,et al.  Optuna: A Next-generation Hyperparameter Optimization Framework , 2019, KDD.

[16]  T. Lumley,et al.  Time‐Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker , 2000, Biometrics.

[17]  J. Carrasco,et al.  Recent Trends in the Use of Statistical Tests for Comparing Swarm and Evolutionary Computing Algorithms: Practical Guidelines and a Critical Review , 2020, Swarm Evol. Comput..

[18]  Charu C. Aggarwal,et al.  Outlier Analysis , 2013, Springer New York.

[19]  Zidong Wang,et al.  Inferring nonlinear lateral flow immunoassay state-space models via an unscented Kalman filter , 2016, Science China Information Sciences.

[20]  Adriano Lorena Inácio de Oliveira,et al.  Novelty detection with constructive probabilistic neural networks , 2008, Neurocomputing.

[21]  Zhizhong Mao,et al.  Outlier detection based on a dynamic ensemble model: Applied to process monitoring , 2019, Inf. Fusion.

[22]  Seiichi Uchida,et al.  A Comparative Evaluation of Unsupervised Anomaly Detection Algorithms for Multivariate Data , 2016, PloS one.

[23]  Jie Cao,et al.  Time Series Modeling of Nano-Gold Immunochromatographic Assay via Expectation Maximization Algorithm , 2013, IEEE Transactions on Biomedical Engineering.

[24]  Amparo Alonso-Betanzos,et al.  One-Class Convex Hull-Based Algorithm for Classification in Distributed Environments , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[25]  Nick Craswell,et al.  Precision at n , 2009, Encyclopedia of Database Systems.

[26]  Jose A. Lozano,et al.  Analyzing rare event, anomaly, novelty and outlier detection terms under the supervised classification framework , 2019, Artificial Intelligence Review.

[27]  Francisco Herrera,et al.  otsad: A package for online time-series anomaly detectors , 2020, Neurocomputing.

[28]  Andreas Dengel,et al.  Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm , 2012 .

[29]  George Yannis,et al.  Predicting road accidents: a rare-events modeling approach , 2016 .

[30]  Rassoul Noorossana,et al.  An overview of dynamic anomaly detection in social networks via control charts , 2018, Qual. Reliab. Eng. Int..

[31]  Francisco Herrera,et al.  Learning from Imbalanced Data Sets , 2018, Springer International Publishing.