Decomposing time series into deterministic and stochastic influences: A survey

Abstract Temporal data produced by industrial, human, and natural phenomena typically contain deterministic and stochastic influences, being the first ideally modelled using Dynamical Systems while the second is appropriately addressed using Statistical tools. Although such influences have been widely studied as individual components, specific tools are required to support their decomposition for a proper modeling and analysis. This article addresses a comprehensive survey of the main time-series decomposition strategies and their relative performances in different application domains. The following strategies are discussed: i) Fourier Transform, ii) Wavelet transforms, iii) Moving Average, iv) Singular Spectrum Analysis, v) Lazy, vi) GHKSS, and vii) other approaches based on the Empirical Mode Decomposition method. In order to assess these strategies, we employ diverse and complementary performance measures: i) Mean Absolute Error, Mean Squared and Root Mean Squared Errors; ii) Minkowski Distances; iii) Complexity-Invariant Distance; iv) Pearson correlation; v) Mean Distance from the Diagonal Line; and vi) Mean Distance from Attractors. Each decomposition strategy is better devoted to particular scenarios, however, without any previous knowledge on data, GHKSS confirmed to work as a fair and general baseline besides its time complexity.

[1]  Schreiber,et al.  Nonlinear noise reduction: A case study on experimental data. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[2]  R. Vautard,et al.  Singular spectrum analysis in nonlinear dynamics, with applications to paleoclimatic time series , 1989 .

[3]  Igor Vajda,et al.  Estimation of the Information by an Adaptive Partitioning of the Observation Space , 1999, IEEE Trans. Inf. Theory.

[4]  A. Haar Zur Theorie der orthogonalen Funktionensysteme , 1910 .

[5]  R. Palaniappan,et al.  Utilizing Gamma Band to Improve Mental Task Based Brain-Computer Interface Design , 2006, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[6]  Toni Giorgino,et al.  Matching incomplete time series with dynamic time warping: an algorithm and an application to post-stroke rehabilitation , 2009, Artif. Intell. Medicine.

[7]  Eamonn J. Keogh,et al.  CID: an efficient complexity-invariant distance for time series , 2013, Data Mining and Knowledge Discovery.

[8]  C L Webber,et al.  Dynamical assessment of physiological systems and states using recurrence plot strategies. , 1994, Journal of applied physiology.

[9]  Andreas Ziehe,et al.  An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[10]  Ouarda Raaf,et al.  Pattern recognition filtering and bidimensional FFT-based detection of storms in meteorological radar images , 2012, Digit. Signal Process..

[11]  Michele Marchesi,et al.  Fast wavelet transform assisted predictors of streaming time series , 2017, Digit. Signal Process..

[12]  P. Grassberger,et al.  On noise reduction methods for chaotic data. , 1993, Chaos.

[13]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[14]  Manohar Das,et al.  An efficient technique for modeling and synthesis of automotive engine sounds , 2001, IEEE Trans. Ind. Electron..

[15]  Maria Papadopouli,et al.  Singular spectrum analysis of traffic workload in a large-scale wireless lan , 2007, MSWiM '07.

[16]  José Luis Pons Rovira,et al.  Empirical mode decomposition: a novel technique for the study of tremor time series , 2006, Medical and Biological Engineering and Computing.

[17]  Ran Tao,et al.  Are there significant linkages between two series of housing prices, money supply and short-term international capital? - Evidence from China , 2018, Digit. Signal Process..

[18]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[19]  Rodrigo Fernandes de Mello,et al.  A Systematic Literature Review on Decomposition Approaches to Estimate Time Series Components , 2012 .

[20]  Rodrigo Fernandes de Mello,et al.  Applying Empirical Mode Decomposition and mutual information to separate stochastic and deterministic influences embedded in signals , 2016, Signal Process..

[21]  Johnny S. K. Wong,et al.  A task migration algorithm for heterogeneous distributed computing systems , 1998, J. Syst. Softw..

[22]  Holger Kantz,et al.  Practical implementation of nonlinear time series methods: The TISEAN package. , 1998, Chaos.

[23]  Rodrigo Fernandes de Mello,et al.  Applying a kernel function on time-dependent data to provide supervised-learning guarantees , 2017, Expert Syst. Appl..

[24]  Thomas Schreiber,et al.  FAST NONLINEAR PROJECTIVE FILTERING IN A DATA STREAM , 1998 .

[25]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[26]  Rodrigo Fernandes de Mello,et al.  Improving time series modeling by decomposing and analyzing stochastic and deterministic influences , 2013, Signal Process..

[27]  Holger Lange,et al.  Estimating determinism rates to detect patterns in geospatial datasets , 2015 .

[28]  Jürgen Kurths,et al.  Recurrence plots for the analysis of complex systems , 2009 .

[29]  J. L. Holloway Smoothing and Filtering of Time Series and Space Fields , 1958 .

[30]  J. Zbilut,et al.  Embeddings and delays as derived from quantification of recurrence plots , 1992 .

[31]  F. Takens Detecting strange attractors in turbulence , 1981 .

[32]  J. Elsner Analysis of Time Series Structure: SSA and Related Techniques , 2002 .

[33]  D. Gabor,et al.  Theory of communication. Part 1: The analysis of information , 1946 .

[34]  Ka-Ming Lau,et al.  Climate Signal Detection Using Wavelet Transform: How to Make a Time Series Sing , 1995 .

[35]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[36]  Leonardo A. Mozelli,et al.  Improving reconstruction of time-series based in Singular Spectrum Analysis: A segmentation approach , 2017, Digit. Signal Process..

[37]  Schreiber,et al.  Extremely simple nonlinear noise-reduction method. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[38]  Rodrigo Fernandes de Mello,et al.  Machine Learning: A Practical Approach on the Statistical Learning Theory , 2018 .

[39]  V. Pan Structured Matrices and Polynomials: Unified Superfast Algorithms , 2001 .

[40]  D. Dickey,et al.  Testing for unit roots in autoregressive-moving average models of unknown order , 1984 .

[41]  A. Walden,et al.  Wavelet Methods for Time Series Analysis , 2000 .

[42]  Merlin Suarez,et al.  Predicting Academic Emotions Based on Brainwaves, Mouse Behaviour and Personality Profile , 2012, PRICAI.

[43]  Jarek Gryz,et al.  Algorithms and analyses for maximal vector computation , 2007, The VLDB Journal.

[44]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[45]  Yunong Zhang,et al.  Colder-winter monthly-temperature forecasting in general trend of global warming via multiple sine functions decomposition , 2017, 2017 36th Chinese Control Conference (CCC).

[46]  Beverly Rubik,et al.  Neurofeedback-enhanced gamma brainwaves from the prefrontal cortical region of meditators and non-meditators and associated subjective experiences. , 2011, Journal of alternative and complementary medicine.

[47]  Michael J. Black,et al.  A Framework for Robust Subspace Learning , 2003, International Journal of Computer Vision.

[48]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[49]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[50]  Y. Amihud,et al.  Illiquidity and Stock Returns II: Cross-Section and Time-Series Effects , 2018, The Review of Financial Studies.

[51]  Abdulhamit Subasi,et al.  EEG signal classification using wavelet feature extraction and a mixture of expert model , 2007, Expert Syst. Appl..

[52]  Rodrigo Fernandes de Mello,et al.  On learning guarantees to unsupervised concept drift detection on data streams , 2019, Expert Syst. Appl..

[53]  Rodrigo Fernandes de Mello,et al.  Time Series Decomposition Using Spring System Applied on Phase Spaces , 2018, BRACIS.

[54]  Jacob Benesty,et al.  Pearson Correlation Coefficient , 2009 .

[55]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[56]  Rodrigo Fernandes de Mello,et al.  Multidimensional surrogate stability to detect data stream concept drift , 2017, Expert Syst. Appl..

[57]  Schreiber,et al.  Noise reduction in chaotic time-series data: A survey of common methods. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[58]  Cagatay Candan,et al.  The discrete fractional Fourier transform , 2000, IEEE Trans. Signal Process..

[59]  H. Kantz,et al.  Nonlinear time series analysis , 1997 .

[60]  A. J. Jerri The Shannon sampling theorem—Its various extensions and applications: A tutorial review , 1977, Proceedings of the IEEE.

[61]  C. Nelson,et al.  A new approach to decomposition of economic time series into permanent and transitory components with particular attention to measurement of the ‘business cycle’☆ , 1981 .

[62]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[63]  Jing Hu,et al.  Denoising Nonlinear Time Series by Adaptive Filtering and Wavelet Shrinkage: A Comparison , 2010, IEEE Signal Processing Letters.

[64]  D. Ruelle,et al.  Recurrence Plots of Dynamical Systems , 1987 .

[65]  George E. P. Box,et al.  Time Series Analysis: Forecasting and Control , 1977 .

[66]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[67]  Peter Bloomfield,et al.  Fourier Analysis of Time Series: An Introduction , 1977 .

[68]  G Pulina,et al.  Time series autoregressive integrated moving average modeling of test-day milk yields of dairy ewes. , 2000, Journal of dairy science.