An anomaly detection framework for dynamic systems using a Bayesian hierarchical framework

Complex systems are susceptible to many types of anomalies, faults, and abnormal behavior caused by a variety of off-nominal conditions that may ultimately result in major failures or catastrophic events. Early and accurate detection of these anomalies using system inputs and outputs collected from sensors and smart devices has become a challenging problem and an active area of research in many application domains. In this article, we present a new Bayesian hierarchical framework that is able to model the relationship between system inputs (sensor measurements) and outputs (response variables) without imposing strong distributional/parametric assumptions while using only a subset of training samples and sensor attributes. Then, an optimal cost-sensitive anomaly detection framework is proposed to determine whether a sample is an anomalous one taking into consideration the trade-off between misclassification errors and detection rates. The model can be used for both supervised and unsupervised settings depending on the availability of data regarding the behavior of the system under anomaly conditions. The unsupervised model is particularly useful when it is prohibitive to identify in advance the anomalies that a system may present and where no data are available regarding the behavior of the system under anomaly conditions. A Bayesian hierarchical setting is used to structure the proposed framework and help with accommodating uncertainty, imposing interpretability, and controlling the sparsity and complexity of the proposed anomaly detection framework. A Markov chain Monte Carlo algorithm is also developed for model training using past data. The numerical experiments conducted using a simulated data set and a wind turbine data set demonstrate the successful application of the proposed work for system response modeling and anomaly detection.

[1]  R. Stine Bootstrap Prediction Intervals for Regression , 1985 .

[2]  E. Nadaraya On Estimating Regression , 1964 .

[3]  David Draper,et al.  GPU-accelerated Gibbs sampling: a case study of the Horseshoe Probit model , 2016, Stat. Comput..

[4]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[5]  Laetitia Chapel,et al.  Anomaly Detection with Score Functions Based on the Reconstruction Error of the Kernel PCA , 2014, ECML/PKDD.

[6]  Zhihua Ma,et al.  Bayesian methods for dealing with missing data problems , 2018, Journal of the Korean Statistical Society.

[7]  Roy E. Welsch,et al.  Anomaly detection via a Gaussian Mixture Model for flight operation and safety monitoring , 2016 .

[8]  Miguel A. Sanz-Bobi,et al.  Failure Risk Indicators for a Maintenance Model Based on Observable Life of Industrial Components With an Application to Wind Turbines , 2013, IEEE Transactions on Reliability.

[9]  Alice M. Agogino,et al.  Design of machine learning models with domain experts for automated sensor selection for energy fault detection , 2019, Applied Energy.

[10]  Hongzhi Wang,et al.  Anomaly Detection in Gas Turbine Fuel Systems Using a Sequential Symbolic Method , 2017 .

[11]  Eunshin Byon,et al.  Wind turbine operations and maintenance: a tractable approximation of dynamic decision making , 2013 .

[12]  Xiao Lei,et al.  A generalized model for wind turbine anomaly identification based on SCADA data , 2016 .

[13]  Peter D. Turney Cost-Sensitive Classification: Empirical Evaluation of a Hybrid Genetic Decision Tree Induction Algorithm , 1994, J. Artif. Intell. Res..

[14]  Jing Lin,et al.  An angle-based subspace anomaly detection approach to high-dimensional data: With an application to industrial fault detection , 2015, Reliab. Eng. Syst. Saf..

[15]  Qiang Chen,et al.  Attack–norm separation for detecting attack‐induced quality problems on computers and networks , 2007, Qual. Reliab. Eng. Int..

[16]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data , 1988 .

[17]  Oliver Niggemann,et al.  Self-Organizing Maps for Anomaly Localization and Predictive Maintenance in Cyber-Physical Production Systems , 2018 .

[18]  P. Kavitha,et al.  Anomaly based intrusion detection for 802.11 networks with optimal features using SVM classifier , 2016, Wirel. Networks.

[19]  Sergio Martín-Martínez,et al.  Wind turbine reliability: A comprehensive review towards effective condition monitoring development , 2018, Applied Energy.

[20]  Marc G. Genton,et al.  Power Curve Estimation With Multivariate Environmental Factors for Inland and Offshore Wind Farms , 2015 .

[21]  Christos Faloutsos,et al.  Fraud Detection in Comparison-Shopping Services: Patterns and Anomalies in User Click Behaviors , 2017, IEICE Trans. Inf. Syst..

[22]  Deni Torres Román,et al.  Using Generalized Entropies and OC-SVM with Mahalanobis Kernel for Detection and Classification of Anomalies in Network Traffic , 2015, Entropy.

[23]  Peyman Mazidi,et al.  A health condition model for wind turbine monitoring through neural networks and proportional hazard models , 2017 .

[24]  Miguel A. Sanz-Bobi,et al.  Behavior Anomaly Indicators Based on Reference Patterns—Application to the Gearbox and Electrical Generator of a Wind Turbine , 2018 .

[25]  Xuan Li,et al.  Probabilistic framework of visual anomaly detection for unbalanced data , 2016, Neurocomputing.

[26]  Raymond T. Ng,et al.  Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.

[27]  António Ramos Andrade,et al.  Statistical modelling of railway track geometry degradation using Hierarchical Bayesian models , 2015, Reliab. Eng. Syst. Saf..

[28]  Fu Xiao,et al.  Analytical investigation of autoencoder-based methods for unsupervised anomaly detection in building energy data , 2018 .

[29]  Eunshin Byon,et al.  Condition Monitoring of Wind Power System With Nonparametric Regression Analysis , 2014, IEEE Transactions on Energy Conversion.

[30]  Tommy W. S. Chow,et al.  Anomaly Detection and Fault Prognosis for Bearings , 2016, IEEE Transactions on Instrumentation and Measurement.

[31]  Yu Peng,et al.  Anomaly detection based on uncertainty fusion for univariate monitoring series , 2017 .

[32]  Rassoul Noorossana,et al.  An overview of dynamic anomaly detection in social networks via control charts , 2018, Qual. Reliab. Eng. Int..

[33]  Chiman Kwan,et al.  A Novel Cluster Kernel RX Algorithm for Anomaly and Change Detection Using Hyperspectral Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[34]  Esmaeil S. Nadimi,et al.  Bayesian state prediction of wind turbine bearing failure , 2018 .

[35]  Hao Yan,et al.  Anomaly Detection in Images With Smooth Background via Smooth-Sparse Decomposition , 2017, Technometrics.

[36]  Clayton D. Scott,et al.  Robust kernel density estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[37]  David Wenzhong Gao,et al.  Condition Parameter Modeling for Anomaly Detection in Wind Turbines , 2014 .

[38]  Kai Goebel,et al.  Bayesian hierarchical model-based prognostics for lithium-ion batteries , 2018, Reliab. Eng. Syst. Saf..

[39]  Peyman Mazidi,et al.  Wind turbine prognostics and maintenance management based on a hybrid approach of neural networks and a proportional hazards model , 2017 .

[40]  Chein-I. Chang,et al.  Anomaly Detection Outperforms Logistic Regression in Predicting Outcomes in Trauma Patients , 2017, Prehospital emergency care : official journal of the National Association of EMS Physicians and the National Association of State EMS Directors.

[41]  Enrico Zio,et al.  A support vector machine integrated system for the classification of operation anomalies in nuclear components and systems , 2007, Reliab. Eng. Syst. Saf..

[42]  Danny Hendler,et al.  Metric Anomaly Detection via Asymmetric Risk Minimization , 2011, SIMBAD.

[43]  Chengliang Liu,et al.  Wind turbines abnormality detection through analysis of wind farm power curves , 2016 .

[44]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[45]  Hamid R. Zarandi,et al.  Context-Aware Anomaly Detection in Embedded Systems , 2017, DepCoS-RELCOMEX.

[46]  Sofiane Achiche,et al.  Wind turbine condition monitoring based on SCADA data using normal behavior models. Part 1: System description , 2013, Appl. Soft Comput..

[47]  Marc G. Genton,et al.  A kernel plus method for quantifying wind turbine performance upgrades , 2015 .

[48]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[49]  Chao Liu,et al.  An unsupervised spatiotemporal graphical modeling approach for wind turbine condition monitoring , 2018, Renewable Energy.

[50]  Sandro Bologna,et al.  Safeguarding information intensive critical infrastructures against novel types of emerging failures , 2007, Reliab. Eng. Syst. Saf..

[51]  R. Schmoyer Asymptotically valid prediction intervals for linear models , 1992 .