Anomaly detection based on uncertainty fusion for univariate monitoring series

Abstract Detecting the anomalies timely in the condition monitoring data, which are highly relevant to the potential system faults, has become a research focus in many domains. Among the various detection methods available, the prediction-based algorithms are popular without using prior knowledge and expert labels. Additionally, these methods can take the time-ordered specialty into account which is highly significant for time-series-based anomaly detection. However, the detected feedback is binary, especially, due to the influence of inaccurate confidence interval (CI), the false alarm phenomena occur frequently based on predicted models. Thus this paper proposes an Uncertainty Fusion method to realize anomaly detection. Firstly, in order to estimate the data uncertainty, the Gaussian Process Regression (GPR) is applied to perform the prediction with uncertainty presentation. Then, based on the GPR model, the improved k-fold cross-validation is combined to represent the model uncertainty. Moreover, the quantitative error index is designed to provide more detecting information for decision-making. Eventually, the effectiveness of the proposed method are verified by different simulated and open-source data sets, as well as the real application in mobile traffic data detecting. The quantitative results on simulated data experiments show the proposed method can largely eliminate the false alarms without sacrificing much detection rate compared with the basic GPR model. Especially, the experiments on periodic data sets with high Signal Noise Ratio have the better performance. And the mobile traffic data detecting proves the Uncertainty Fusion method can expand the basic GPR model to meet the real industrial requirements.

[1]  Yaguo Lei,et al.  Condition monitoring and fault diagnosis of planetary gearboxes: A review , 2014 .

[2]  Mooi Choo Chuah,et al.  ECG Anomaly Detection via Time Series Analysis , 2007, ISPA Workshops.

[3]  Jianmin Wang,et al.  An anomaly detection approach for multiple monitoring data series based on latent correlation probabilistic model , 2015, Applied Intelligence.

[4]  Maxime Gariel,et al.  Anomaly detection in onboard-recorded flight data using cluster analysis , 2011, 2011 IEEE/AIAA 30th Digital Avionics Systems Conference.

[5]  Byeng D. Youn,et al.  A generic probabilistic framework for structural health prognostics and uncertainty management , 2012 .

[6]  A. Messac,et al.  Predictive quantification of surrogate model fidelity based on modal variations with sample density , 2015 .

[7]  Shingo Mabu,et al.  An Intrusion-Detection Model Based on Fuzzy Class-Association-Rule Mining Using Genetic Network Programming , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8]  Dingwen Dong,et al.  Mine Gas Emission Prediction based on Gaussian Process Model , 2012 .

[9]  Eamonn J. Keogh,et al.  Finding surprising patterns in a time series database in linear time and space , 2002, KDD.

[10]  R.J.M. Pijpers,et al.  A new approach for structural health monitoring by applying anomaly detection on strain sensor data , 2014, Smart Structures.

[11]  Yu Peng,et al.  Anomaly detection based on data stream monitoring and prediction with improved Gaussian process regression algorithm , 2014, 2014 International Conference on Prognostics and Health Management.

[12]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[13]  Jing Zhou,et al.  A Flexible Framework for Assessing the Quality of Information in Wireless Sensor Networks , 2015, Int. J. Distributed Sens. Networks.

[14]  David J. Hill,et al.  Anomaly detection in streaming environmental sensor data: A data-driven modeling approach , 2010, Environ. Model. Softw..

[15]  Bin Zhang,et al.  Machine Condition Prediction Based on Adaptive Neuro–Fuzzy and High-Order Particle Filtering , 2011, IEEE Transactions on Industrial Electronics.

[16]  Yu Peng,et al.  Multiple optimized online support vector regression for adaptive time series prediction , 2013 .

[17]  Marco Vieira,et al.  Applying Data Mining for Detecting Anomalies in Satellites , 2012, 2012 Ninth European Dependable Computing Conference.

[18]  Y. Kawahara,et al.  Telemetry-mining: a machine learning approach to anomaly detection and fault diagnosis for space systems , 2006, 2nd IEEE International Conference on Space Mission Challenges for Information Technology (SMC-IT'06).

[19]  B. Saha,et al.  Uncertainty Management for Diagnostics and Prognostics of Batteries using Bayesian Techniques , 2008, 2008 IEEE Aerospace Conference.

[20]  Xiao-yun Chen,et al.  Multi-scale anomaly detection algorithm based on infrequent pattern of time series , 2008 .

[21]  Jung-Min Park,et al.  An overview of anomaly detection techniques: Existing solutions and latest technological trends , 2007, Comput. Networks.

[22]  Chao Hu,et al.  Ensemble of data-driven prognostic algorithms for robust prediction of remaining useful life , 2011, 2011 IEEE Conference on Prognostics and Health Management.

[23]  Peter W. Tse,et al.  Anomaly Detection Through a Bayesian Support Vector Machine , 2010, IEEE Transactions on Reliability.

[24]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[25]  Markus Bohlin,et al.  Statistical Anomaly Detection for Train Fleets , 2012, AI Mag..

[26]  Witold Pedrycz,et al.  Anomaly detection in time series data using a fuzzy c-means clustering , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[27]  Yongqian Liu,et al.  Wind power grouping forecasts and its uncertainty analysis using optimized relevance vector machine , 2013 .

[28]  Dieter Fox,et al.  Learning GP-BayesFilters via Gaussian process latent variable models , 2009, Auton. Robots.

[29]  Ying Chen,et al.  A framework with nonlinear system model and nonparametric noise for gas turbine degradation state estimation , 2015 .

[30]  Gabriel Maciá-Fernández,et al.  Anomaly-based network intrusion detection: Techniques, systems and challenges , 2009, Comput. Secur..

[31]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[32]  Thomas J. Veasey,et al.  Anomaly Detection in Application Performance Monitoring Data , 2014 .

[33]  Ratko Grbic,et al.  Stream water temperature prediction based on Gaussian process regression , 2013, Expert Syst. Appl..

[34]  Fenxiong Chen,et al.  An Anomaly Detection Based on Data Fusion Algorithm in Wireless Sensor Networks , 2015, Int. J. Distributed Sens. Networks.

[35]  Mansour Sheikhan,et al.  Flow-based anomaly detection in high-speed links using modified GSA-optimized neural network , 2012, Neural Computing and Applications.

[36]  Eyal Amir,et al.  Real‐time Bayesian anomaly detection in streaming environmental data , 2007 .