Frequent State Transition Patterns of Multivariate Time Series

Sequence pattern discovery is a key issue in multivariate time series analysis. Popular approaches first obtain the pattern of each single-variate time series and then obtain cross-variate associations. In this paper, we consider different variables at the same time during pattern construction, and propose a new type of pattern called State Transition pAttern with Periodic wildcard gaps (STAP). Compared to previous types, STAP reveals stronger cross associations among different variables and provides better interpretability for decision makers. We design an approach with two stages, namely frequent state discovery and pattern synthesis, to obtain frequent STAPs. We propose two pre-pruning and an Apriori-pruning techniques to speed up pattern discovery. We also propose two post-pruning techniques to simplify the output and a visualization way to support expert decision. Experimental results on four real-world datasets demonstrate 1) STAP captures the cross and temporal associations; 2) the five pruning and pattern synthesis techniques are quite effective; and 3) visualization technique greatly increases the readability of STAP.

[1]  Ruoxi Xiang,et al.  Complex network analysis of time series , 2014 .

[2]  Asok Ray,et al.  Symbolic time series analysis via wavelet-based partitioning , 2006, Signal Process..

[3]  Xindong Wu,et al.  The Apriori property of sequence pattern mining with wildcard gaps , 2010 .

[4]  Jiucheng Xu,et al.  An Adaptive Density Peaks Clustering Method With Fisher Linear Discriminant , 2019, IEEE Access.

[5]  Xindong Wu,et al.  Pattern Matching with Flexible Wildcards , 2014, Journal of Computer Science and Technology.

[6]  Kuniaki Uehara,et al.  Discovery of Time-Series Motif from Multi-Dimensional Data Based on MDL Principle , 2005, Machine Learning.

[7]  J. Rotton,et al.  Air pollution, weather, and violent crimes: concomitant time-series analysis of archival data. , 1985, Journal of personality and social psychology.

[8]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[9]  Xindong Wu,et al.  NOSEP: Nonoverlapping Sequence Pattern Mining With Gap Constraints , 2018, IEEE Transactions on Cybernetics.

[10]  Kunhuang Huarng,et al.  Effective lengths of intervals to improve forecasting in fuzzy time series , 2001, Fuzzy Sets Syst..

[11]  Raj Bhatnagar,et al.  Discovery of Temporal Dependencies between Frequent Patterns in Multivariate Time Series , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[12]  B. J Hne,et al.  Spatio - temporal Image Processing: Theory and Scientific Applications , 1991 .

[13]  Jiadong Ren,et al.  Mining sequential patterns with periodic wildcard gaps , 2014, Applied Intelligence.

[14]  Gerhard Fohler,et al.  Handling aperiodic tasks in diverse real-time systems via plug-ins , 2002, Proceedings Fifth IEEE International Symposium on Object-Oriented Real-Time Distributed Computing. ISIRC 2002.

[15]  Bing Shi,et al.  Regression-based three-way recommendation , 2017, Inf. Sci..

[16]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[17]  Fan Min,et al.  A two-stage discretization algorithm based on information entropy , 2017, Applied Intelligence.

[18]  M. Ghil,et al.  Interdecadal oscillations and the warming trend in global temperature time series , 1991, Nature.

[19]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[20]  P. Robinson Non-linear regression for multiple time-series , 1972, Journal of Applied Probability.

[21]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[22]  Trevelyan J. McKinley,et al.  Model selection for time series of count data , 2018, Comput. Stat. Data Anal..

[23]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[24]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[26]  Andrew K. C. Wong,et al.  Discovery of Temporal Associations in Multivariate Time Series , 2014, IEEE Transactions on Knowledge and Data Engineering.

[27]  John Langbein,et al.  Improved efficiency of maximum likelihood analysis of time series with temporally correlated errors , 2017, Journal of Geodesy.

[28]  R. Palmer,et al.  Time series properties of an artificial stock market , 1999 .

[29]  Craig J. Forsyth,et al.  economic fluctuation and crime: a time-series analysis of the effects of oil development in the coastal regions of Louisiana , 2007 .

[30]  S. Chatterjee,et al.  Regression Analysis by Example , 1979 .

[31]  William Brendel,et al.  Activities as Time Series of Human Postures , 2010, ECCV.

[32]  Moncef Gabbouj,et al.  Epileptic Seizure Classification of EEG Time-Series Using Rational Discrete Short-Time Fourier Transform , 2015, IEEE Transactions on Biomedical Engineering.

[33]  Fan Min,et al.  State Transition Pattern with Periodic Wildcard Gaps , 2018, 2018 IEEE International Conference on Big Knowledge (ICBK).

[34]  Min Wang,et al.  Discovering Patterns With Weak-Wildcard Gaps , 2016, IEEE Access.

[35]  Ming Lei,et al.  A high efficient AprioriTid algorithm for mining association rule , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[36]  Allan R. Willms,et al.  An algorithm for continuous piecewise linear bounding of discrete time series data , 2014, BIT Numerical Mathematics.

[37]  George Karabatis,et al.  Discrete wavelet transform-based time series analysis and mining , 2011, CSUR.

[38]  Giancarlo Valente,et al.  Multivariate analysis of fMRI time series: classification and regression of brain responses using machine learning. , 2008, Magnetic resonance imaging.

[39]  Bent Nielsen,et al.  Asymptotic Theory of Outlier Detection Algorithms for Linear Time Series Regression Models , 2016 .

[40]  Karl J. Friston,et al.  Analysis of fMRI Time-Series Revisited , 1995, NeuroImage.

[41]  T. Higuchi Approach to an irregular time series on the basis of the fractal theory , 1988 .

[42]  Gordon Reikard Predicting solar radiation at high resolutions: A comparison of time series forecasts , 2009 .

[43]  B. Chissom,et al.  Fuzzy time series and its models , 1993 .

[44]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[45]  Boon Leng Cheong,et al.  A Time Series Weather Radar Simulator Based on High-Resolution Atmospheric Models , 2008 .

[46]  M. Obersteiner,et al.  Forecasting electricity spot-prices using linear univariate time-series models , 2004 .

[47]  Henrik André-Jönsson,et al.  Using Signature Files for Querying Time-Series Data , 1997, PKDD.

[48]  Arild Saasen,et al.  Effects of time and shear energy on the rheological behaviour of oilwell cement slurries , 2000 .

[49]  Korbinian Strimmer,et al.  Identifying periodically expressed transcripts in microarray time series data , 2008, Bioinform..

[50]  Xindong Wu,et al.  Pattern matching with wildcards and length constraints using maximum network flow , 2015, J. Discrete Algorithms.

[51]  Philip Hans Franses,et al.  Periodic Time Series Models , 1996 .