Application of Sliding Nest Window Control Chart in Data Stream Anomaly Detection

Since data stream anomaly detection algorithms based on sliding windows are sensitive to the abnormal deviation of individual interference data, this paper presents a sliding nest window chart anomaly detection based on the data stream (SNWCAD-DS) by employing the concept of the sliding window and control chart. By nesting a small sliding window in a large sliding window and analyzing the deviation distance between the small window and the large sliding window, the algorithm increases the out-of-bounds detection ratio and classifies the conceptual drift data stream online. The designed algorithm is simulated on the industrial data stream of drilling engineering. The proposed algorithm SNWCAD is compared with Automatic Outlier Detection for Data Streams (A-ODDS) and Distance-Based Outline Detection for Data Stream (DBOD-DS). The experimental results show that the new algorithm can obtain higher detection accuracy than the compared algorithms. Furthermore, it can shield the influence of individual interference data and satisfy actual engineering needs.

[1]  Francisco Herrera,et al.  A survey on data preprocessing for data stream mining: Current status and future directions , 2017, Neurocomputing.

[2]  Enzo Baccarelli,et al.  Energy-efficient dynamic traffic offloading and reconfiguration of networked data centers for big data stream mobile computing: review, challenges, and a case study , 2016, IEEE Network.

[3]  Sudipto Guha,et al.  Robust Random Cut Forest Based Anomaly Detection on Streams , 2016, ICML.

[4]  Syed Hassan Ahmed,et al.  Multimedia streaming in information-centric networking: A survey and future perspectives , 2017, Comput. Networks.

[5]  Yiyang Pei,et al.  A Cloud-Based Stream Processing Platform for Traffic Monitoring Using Large-Scale Probe Vehicle Data , 2017, 2017 IEEE Wireless Communications and Networking Conference (WCNC).

[6]  Woongsup Kim,et al.  Toward Bulk Synchronous Parallel-Based Machine Learning Techniques for Anomaly Detection in High-Speed Big Data Networks , 2017, Symmetry.

[7]  Rodrigo Fernandes de Mello,et al.  Multidimensional surrogate stability to detect data stream concept drift , 2017, Expert Syst. Appl..

[8]  Meenakshi A.Thalor,et al.  Learning on High Frequency Stock Market Data Using Misclassified Instances in Ensemble , 2016 .

[9]  Michael Lang,et al.  A Low-Complexity Model-Free Approach for Real-Time Cardiac Anomaly Detection Based on Singular Spectrum Analysis and Nonparametric Control Charts , 2018 .

[10]  S. Muthukrishnan,et al.  One-Pass Wavelet Decompositions of Data Streams , 2003, IEEE Trans. Knowl. Data Eng..

[11]  Xu Minqiang,et al.  An Anomaly Detection Algorithm of Satellite Power System Based on CUSUM Control Chart , 2016, 2016 3rd International Conference on Information Science and Control Engineering (ICISCE).

[12]  Michael Hahsler,et al.  Introduction to stream: An Extensible Framework for Data Stream Clustering Research with R , 2017 .

[13]  Mohammad Shojafar,et al.  FR trust: a fuzzy reputation-based model for trust management in semantic P2P grids , 2014, Int. J. Grid Util. Comput..

[14]  Jie Yang,et al.  Mapping Temporal Variables Into the NeuCube for Improved Pattern Recognition, Predictive Modeling, and Understanding of Stream Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Jean Paul Barddal,et al.  A Survey on Ensemble Learning for Data Stream Classification , 2017, ACM Comput. Surv..

[16]  Jin-Fu Lin,et al.  The research of data stream mining and application in fault diagnosis of equipment , 2017 .

[17]  Enzo Baccarelli,et al.  FLAPS: bandwidth and delay-efficient distributed data searching in Fog-supported P2P content delivery networks , 2017, The Journal of Supercomputing.

[18]  Jie Wang,et al.  Fault Diagnosis of Rolling Bearings Based on EWT and KDEC , 2017, Entropy.

[19]  Ke Tang,et al.  A Selective Transfer Learning Method for Concept Drift Adaptation , 2017, ISNN.

[20]  Claudia Canali,et al.  Joint Minimization of the Energy Costs From Computing, Data Transmission, and Migrations in Cloud Data Centers , 2018, IEEE Transactions on Green Communications and Networking.

[21]  Athanasios V. Vasilakos,et al.  Self-Adaptive Pre-Processing Methodology for Big Data Stream Mining in Internet of Things Environmental Sensor Monitoring , 2017, Symmetry.

[22]  Erik Elmroth,et al.  Adaptive Anomaly Detection in Performance Metric Streams , 2018, IEEE Transactions on Network and Service Management.

[23]  Ahmed Patel,et al.  Design and evaluation of a hybrid system for detection and prediction of faults in electrical transformers , 2015 .

[24]  Antonio Krüger,et al.  Inferring Landmarks for Pedestrian Navigation from Mobile Eye-Tracking Data and Google Street View , 2017, CHI Extended Abstracts.

[25]  Le Gruenwald,et al.  DBOD-DS: Distance Based Outlier Detection for Data Streams , 2010, DEXA.

[26]  Agostino Forestiero Self-organizing anomaly detection in data streams , 2016, Inf. Sci..

[27]  Mateu Sbert,et al.  Trajectory Shape Analysis and Anomaly Detection Utilizing Information Theory Tools , 2017, Entropy.

[28]  Ssu-Han Chen,et al.  The gamma CUSUM chart method for online customer churn prediction , 2016, Electron. Commer. Res. Appl..

[29]  Chris Jermaine,et al.  Real-time High Performance Anomaly Detection over Data Streams: Grand Challenge , 2017, DEBS.

[30]  Muhammad Riaz,et al.  Improving the performance of CUSUM charts , 2011, Qual. Reliab. Eng. Int..

[31]  Le Gruenwald,et al.  An Adaptive Outlier Detection Technique for Data Streams , 2011, SSDBM.

[32]  Jeff Calder,et al.  Anomaly detection and classification for streaming data using partial differential equations , 2016, ArXiv.

[33]  Yunhao Liu,et al.  Indexable PLA for Efficient Similarity Search , 2007, VLDB.

[34]  Jing Lin,et al.  Sliding Window-Based Fault Detection From High-Dimensional Data Streams , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[35]  Olivier Gibaru,et al.  Unsupervised Gesture Segmentation by Motion Detection of a Real-Time Data Stream , 2017, IEEE Transactions on Industrial Informatics.