Survey of Methods for Time Series Symbolic Aggregate Approximation

Time series analysis is widely used in the fields of finance, medical, and climate monitoring. However, the high dimension characteristic of time series brings a lot of inconvenience to its application. In order to solve the high dimensionality problem of time series, symbolic representation, a method of time series feature representation is proposed, which plays an important role in time series classification and clustering, pattern matching, anomaly detection and others. In this paper, existing symbolization representation methods of time series were reviewed and compared. Firstly, the classical symbolic aggregate approximation (SAX) principle and its deficiencies were analyzed. Then, several SAX improvement methods, including aSAX, SMSAX, ESAX and some others, were introduced and classified; Meanwhile, an experiment evaluation of the existing SAX methods was given. Finally, some unresolved issues of existing SAX methods were summed up for future work.

[1]  Tran Khanh Dang,et al.  HOT aSAX: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery , 2010, ACIIDS.

[2]  Osmar R. Zaïane,et al.  Time series contextual anomaly detection for detecting market manipulation in stock market , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[3]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[4]  Tingting Li,et al.  Multi-level Anomaly Detection in Industrial Control Systems via Package Signatures and LSTM Networks , 2017, 2017 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[5]  Bapi Raju Surampudi,et al.  Inter-patient heart-beat classification using complete ECG beat time series by alignment of R-peaks using SVM and decision rule , 2016, 2016 International Conference on Signal and Information Processing (IConSIP).

[6]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[7]  N. Itoh Change-P oint Detection of Climate Time Series by Nonparametric Method , 2010 .

[8]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[9]  Bernard Hugueney,et al.  Adaptive Segmentation-Based Symbolic Representations of Time Series for Better Modeling and Lower Bounding Distance Measures , 2006, PKDD.

[10]  Tran Khanh Dang,et al.  Two Novel Adaptive Symbolic Representations for Similarity Search in Time Series Databases , 2010, 2010 12th International Asia-Pacific Web Conference.

[11]  Niko E. C. Verhoest,et al.  Analyzing Granger Causality in Climate Data with Time Series Classification Methods , 2017, ECML/PKDD.

[12]  Pierre-François Marteau,et al.  Enhancing the Symbolic Aggregate Approximation Method Using Updated Lookup Tables , 2010, KES.

[13]  Jiuyong Li,et al.  An improvement of symbolic aggregate approximation distance measure for time series , 2014, Neurocomputing.

[14]  Ping Yang,et al.  Adaptive Change Detection in Heart Rate Trend Monitoring in Anesthetized Children , 2006, IEEE Transactions on Biomedical Engineering.

[15]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[16]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[17]  Xing Wang,et al.  A Self-Learning and Online Algorithm for Time Series Anomaly Detection, with Application in CPU Manufacturing , 2016, CIKM.

[18]  Jianguo Wei,et al.  A hybrid statistical approach for stock market forecasting based on Artificial Neural Network and ARIMA time series models , 2015, 2015 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC).

[19]  Zi-Xing Cai,et al.  The Symbolic Algorithm for Time Series Data Based on Statistic Feature: The Symbolic Algorithm for Time Series Data Based on Statistic Feature , 2009 .

[20]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[21]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[22]  Kyoji Kawagoe,et al.  Extended SAX: Extension of Symbolic Aggregate Approximation for Financial Time Series Data Representation , 2006 .