Data Representation Based on Interval-Sets for Anomaly Detection in Time Series

Anomaly detection in time series is a popular topic focusing on a variety of applications, which achieves a wealth of results. However, there are many cases of missing anomaly and increased false alarm in most of the existing works. Inspired by the concept of interval-sets, this paper proposes an anomaly detection algorithm called a fuzzy interval-set and tries to detect the potential value of the time series from a new perspective. In the proposed algorithm, a time series will be divided into several subsequences. Each subsequence is regarded as an interval-set depending on its value space and boundary of the subsequence. The similarity measurements between interval sets adopt interval operations and point probability distributions of the interval bounds. In addition, an anomaly score is defined based on similarity results. The experimental results on synthetic and real data sets indicate that the proposed algorithm has better discriminative performance than the piecewise aggregate approximation method and reduces the false alarm rate significantly.

[1]  Clement T. Yu,et al.  Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping , 2003, IEEE Trans. Knowl. Data Eng..

[2]  Witold Pedrycz,et al.  The Principle of Justifiable Granularity and an Optimization of Information Granularity Allocation as Fundamentals of Granular Computing , 2011, J. Inf. Process. Syst..

[3]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data: A Survey , 2014, IEEE Transactions on Knowledge and Data Engineering.

[4]  Witold Pedrycz,et al.  Granular Data Description: Designing Ellipsoidal Information Granules , 2017, IEEE Transactions on Cybernetics.

[5]  A. Hill,et al.  The North American Animal Disease Spread Model: a simulation model to assist decision making in evaluating animal disease incursions. , 2007, Preventive veterinary medicine.

[6]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[7]  Raymond T. Ng,et al.  Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.

[8]  Tim Oates,et al.  Time series anomaly discovery with grammar-based compression , 2015, EDBT.

[9]  V. G. Bondur,et al.  Modern approaches to processing large hyperspectral and multispectral aerospace data flows , 2014, Izvestiya, Atmospheric and Oceanic Physics.

[10]  ZhiWu Li,et al.  Anomaly detection based on a dynamic Markov model , 2017, Information Sciences.

[11]  Rob J Hyndman,et al.  Phenological change detection while accounting for abrupt and gradual trends in satellite image time series , 2010 .

[12]  Miklos A. Vasarhelyi,et al.  Cluster Analysis for Anomaly Detection in Accounting Data: An Audit Approach 1 , 2011 .

[13]  Optimization of interval formulas for approximate integration of set-valued functions monotone with respect to inclusion , 2012 .

[14]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[15]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[16]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[17]  Christian Callegari,et al.  Combining sketches and wavelet analysis for multi time-scale network anomaly detection , 2011, Comput. Secur..

[18]  Witold Pedrycz,et al.  Building the fundamentals of granular computing: A principle of justifiable granularity , 2013, Appl. Soft Comput..

[19]  Eric Ruggieri,et al.  An exact approach to Bayesian sequential change point detection , 2016, Comput. Stat. Data Anal..

[20]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[21]  Dan Wang,et al.  Design of granular interval-valued information granules with the use of the principle of justifiable granularity and their applications to system modeling of higher type , 2016, Soft Comput..

[22]  Witold Pedrycz,et al.  Anomaly detection in time series data using a fuzzy c-means clustering , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[23]  Tetsuya Higuchi,et al.  Anomaly Detection using Multi-channel FLAC for Supporting Diagnosis of ECG , 2012 .

[24]  Jiayin Wang,et al.  Interval-Valued Fuzzy Control , 2010 .

[25]  Yiyu Yao,et al.  Interval-set algebra for qualitative knowledge representation , 1993, Proceedings of ICCI'93: 5th International Conference on Computing and Information.