Feature-based time-series analysis

This work presents an introduction to feature-based time-series analysis. The time series as a data type is first described, along with an overview of the interdisciplinary time-series analysis literature. I then summarize the range of feature-based representations for time series that have been developed to aid interpretable insights into time-series structure. Particular emphasis is given to emerging research that facilitates wide comparison of feature-based representations that allow us to understand the properties of a time-series dataset that make it suited to a particular feature-based representation or analysis algorithm. The future of time-series analysis is likely to embrace approaches that exploit machine learning methods to partially automate human learning to aid understanding of the complex dynamical patterns in the time series we measure from the world.

[1]  Eamonn J. Keogh,et al.  CID: an efficient complexity-invariant distance for time series , 2013, Data Mining and Knowledge Discovery.

[2]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[3]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[4]  Philip H. W. Leong,et al.  Grammar-Based Feature Generation for Time-Series Prediction , 2015 .

[5]  Joseph T. Lizier,et al.  JIDT: An Information-Theoretic Toolkit for Studying the Dynamics of Complex Systems , 2014, Front. Robot. AI.

[6]  Kazunori Matsumoto,et al.  Classification system for time series data based on feature pattern extraction , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[7]  Jason Lines,et al.  A shapelet transform for time series classification , 2012, KDD.

[8]  Fred Collopy,et al.  Automatic Identification of Time Series Features for Rule-Based Forecasting , 2001 .

[9]  Craig B. Borkowf,et al.  Time-Series Forecasting , 2002, Technometrics.

[10]  G G Haddad,et al.  Heart rate control in normal and aborted-SIDS infants. , 1993, The American journal of physiology.

[11]  J. Gore,et al.  Mutual information analysis of the EEG in patients with Alzheimer's disease , 2001, Clinical Neurophysiology.

[12]  Nikos E. Mastorakis,et al.  Information processing and technology , 2001 .

[13]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[14]  S. Havlin,et al.  Detecting long-range correlations with detrended fluctuation analysis , 2001, cond-mat/0102214.

[15]  Gary M. Weiss Mining with rarity: a unifying framework , 2004, SKDD.

[16]  Eamonn J. Keogh,et al.  Clustering of time-series subsequences is meaningless: implications for previous and future research , 2004, Knowledge and Information Systems.

[17]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[18]  Jason Lines,et al.  Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles , 2015, IEEE Transactions on Knowledge and Data Engineering.

[19]  Patrick Schäfer The BOSS is concerned with time series classification in the presence of noise , 2014, Data Mining and Knowledge Discovery.

[20]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[21]  Rohit J. Kate Using dynamic time warping distances as features for improved time series classification , 2016, Data Mining and Knowledge Discovery.

[22]  Tim Oates,et al.  RPM: Representative Pattern Mining for Efficient Time Series Classification , 2016, EDBT.

[23]  Xiaozhe Wang,et al.  Characteristic-Based Clustering for Time Series Data , 2006, Data Mining and Knowledge Discovery.

[24]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[25]  George Athanasopoulos,et al.  Forecasting: principles and practice , 2013 .

[26]  Yuan Li,et al.  Rotation-invariant similarity in time series using bag-of-patterns representation , 2012, Journal of Intelligent Information Systems.

[27]  J. A. Stewart,et al.  Nonlinear Time Series Analysis , 2015 .

[28]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[29]  Fred Collopy,et al.  Rule-Based Forecasting: Development and Validation of an Expert Systems Approach to Combining Time Series Extrapolations , 1992 .

[30]  Bhavik R. Bakshi,et al.  Representation of process trends—IV. Induction of real-time patterns from operating data for diagnosis and supervisory control , 1994 .

[31]  Kemal Leblebicioglu,et al.  Time series classification with feature covariance matrices , 2018, Knowledge and Information Systems.

[32]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[33]  Amy McGovern,et al.  Identifying predictive multi-dimensional time series motifs: an application to severe weather prediction , 2010, Data Mining and Knowledge Discovery.

[34]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[35]  F. Mörchen Time series feature extraction for data mining using DWT and DFT , 2003 .

[36]  Laura J. Grundy,et al.  A dictionary of behavioral motifs reveals clusters of genes affecting Caenorhabditis elegans locomotion , 2012, Proceedings of the National Academy of Sciences.

[37]  K Lehnertz,et al.  Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[39]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[40]  Eamonn J. Keogh,et al.  LB_Keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures , 2006, VLDB.

[41]  George C. Runger,et al.  A Bag-of-Features Framework to Classify Time Series , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  H E Stanley,et al.  Statistical properties of DNA sequences. , 1995, Physica A.

[43]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[44]  Eamonn J. Keogh,et al.  Towards parameter-free data mining , 2004, KDD.

[45]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[46]  Max A. Little,et al.  Highly comparative time-series analysis: the empirical structure of time series and their methods , 2013, Journal of The Royal Society Interface.

[47]  Elif Derya Übeyli,et al.  Recurrent neural networks employing Lyapunov exponents for EEG signals classification , 2005, Expert Syst. Appl..

[48]  Henrik Boström,et al.  Boosting interval based literals , 2001, Intell. Data Anal..

[49]  B. Pompe,et al.  Permutation entropy: a natural complexity measure for time series. , 2002, Physical review letters.

[50]  Jens Timmer,et al.  Characteristics of hand tremor time series , 1993, Biological Cybernetics.

[51]  George C. Runger,et al.  A time series forest for classification and feature extraction , 2013, Inf. Sci..

[52]  Abdulhamit Subasi,et al.  EEG signal classification using PCA, ICA, LDA and support vector machines , 2010, Expert Syst. Appl..

[53]  Eamonn J. Keogh,et al.  Making Time-Series Classification More Accurate Using Learned Constraints , 2004, SDM.

[54]  Wolfgang Lehner,et al.  Feature-driven Time Series Generation , 2017, Grundlagen von Datenbanken.

[55]  Fred Collopy,et al.  Rule-Based Forecasting: Using Judgment in Time-Series Extrapolation , 2001 .

[56]  Jason Lines,et al.  Transformation Based Ensembles for Time Series Classification , 2012, SDM.

[57]  Jason Lines,et al.  HIVE-COTE: The Hierarchical Vote Collective of Transformation-Based Ensembles for Time Series Classification , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[58]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[59]  Kate Smith-Miles,et al.  Visualising forecasting algorithm performance using time series instance spaces , 2017 .

[60]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[61]  Ben D. Fulcher,et al.  Structural connectome topology relates to regional BOLD signal dynamics in the mouse brain , 2016, bioRxiv.

[62]  Nick S. Jones,et al.  Highly Comparative Feature-Based Time-Series Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[63]  Andreas W. Kempa-Liehr,et al.  Distributed and parallel time series feature extraction for industrial big data applications , 2016, ArXiv.

[64]  S M Pincus,et al.  Approximate entropy as a measure of system complexity. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[65]  F. Takens Detecting strange attractors in turbulence , 1981 .

[66]  James Large,et al.  Simulated Data Experiments for Time Series Classification Part 1: Accuracy Comparison with Default Settings , 2017, ArXiv.

[67]  Alice S. French,et al.  Ethoscopes: An open platform for high-throughput ethomics , 2017, bioRxiv.

[68]  Liang Wang,et al.  Structure-Based Statistical Features and Multivariate Time Series Clustering , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[69]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[70]  Tomaso Aste,et al.  Measures of Causality in Complex Datasets with Application to Financial Data , 2014, Entropy.

[71]  Jason Lines,et al.  Time series classification with ensembles of elastic distance measures , 2015, Data Mining and Knowledge Discovery.

[72]  San Cristóbal Mateo,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996 .

[73]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.

[74]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[75]  E. S. Gardner EXPONENTIAL SMOOTHING: THE STATE OF THE ART, PART II , 2006 .

[76]  Avinash Kumar,et al.  A Feature Based Neural Network Model for Weather Forecasting , 2007 .

[77]  Michael D. Todd,et al.  Automated Feature Design for Numeric Sequence Classification by Genetic Programming , 2015, IEEE Transactions on Evolutionary Computation.

[78]  Amine Bermak,et al.  Gaussian process for nonstationary time series prediction , 2004, Comput. Stat. Data Anal..

[79]  Nick S. Jones,et al.  Highly comparative fetal heart rate analysis , 2014, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[80]  Philip S. Yu,et al.  Extracting Interpretable Features for Early Classification on Time Series , 2011, SDM.