FCAD: Feature-based Clipped Representation for Time Series Anomaly Detection

Since time series are a ubiquitous type of data, more and more researchers are investing in time series data mining in recent years. It is generally known that time series have the characteristics of large volume, high dimensionality and ever-increasing. Thus, dimensionality reduction is usually the first step in time series data mining. Over the years, amount of high-level representation approaches have been proposed. In this paper, we propose a novel bit level approximation of time series data, called Feature-based Clipped Representation (FCR), and a similarity measure for FCR which lower bounds the Euclidean distance is introduced. Finally, we propose an anomaly detection approach for time series, called FCAD, which is a novel anomaly detection approach based on FCR and the corresponding similarity measure. Extensive experiments have been conducted to demonstrate the advantages of FCAD in time series anomaly detection.

[1]  Qi Zhang,et al.  Feature-Based Dividing Symbolic Time Series Representation for Streaming Data Processing , 2018, 2018 9th International Conference on Information Technology in Medicine and Education (ITME).

[2]  Eamonn J. Keogh,et al.  Finding Unusual Medical Time-Series Subsequences: Algorithms and Applications , 2006, IEEE Transactions on Information Technology in Biomedicine.

[3]  Jugal K. Kalita,et al.  A Survey of Outlier Detection Methods in Network Anomaly Identification , 2011, Comput. J..

[4]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[5]  Yu Peng,et al.  Intelligent pattern analysis and anomaly detection of satellite telemetry series with improved time series representation , 2018, J. Intell. Fuzzy Syst..

[6]  Wei Luo,et al.  Multi-resolution representation with recurrent neural networks application for streaming time series in IoT , 2019, Comput. Networks.

[7]  Daniel Nikovski,et al.  Exemplar learning for extremely efficient anomaly detection in real-valued time series , 2015, Data Mining and Knowledge Discovery.

[8]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[9]  Yupeng Hu,et al.  Feature-Based Online Representation Algorithm for Streaming Time Series Similarity Search , 2020, Int. J. Pattern Recognit. Artif. Intell..

[10]  Duong Tuan Anh,et al.  An Improvement of PIP for Time Series Dimensionality Reduction and Its Index Structure , 2010, 2010 Second International Conference on Knowledge and Systems Engineering.

[11]  Tomasz Andrysiak Machine Learning Techniques Applied to Data Analysis and Anomaly Detection in ECG Signals , 2016, Appl. Artif. Intell..

[12]  Tak-chung Fu,et al.  Flexible time series pattern matching based on perceptually important points , 2001 .

[13]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[14]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[15]  Tuan Anh Duong,et al.  Time series similarity search based on Middle points and Clipping , 2011, 2011 3rd Conference on Data Mining and Optimization (DMO).

[16]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[17]  Eamonn J. Keogh,et al.  A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering , 2005, PAKDD.