Representational primitives using trend based global features for time series classification

Abstract Feature based learning of time series sequences contains a systematic step of preprocessing, representing and analyzing the properties of time series elements. Representational features include the mapping of time series properties namely trend, seasonality and stationarity. Usually, the segmented generation of representational structures does not contain the global features of a time series sequence which can influence the learning algorithms. Global information of each time series sequence reinforces the respective segmental properties present in it. Identifying, extracting and processing of global features which are common to all time series sequences are challenging tasks in time series feature learning. Hence, we propose a novel set of global features which provides an additional representational leverage to feature based time series learning scenarios. The feature enriched primitives can provide an additional information on the global trend pattern in each of the time series sequences. This enables the learning algorithms to process the time series sequences with the awareness of trend information. We formed a minimum number of most influential trend features which describe the behavior of time series sequences. Thus the dimensionality of the features are preserved which influence the performance of various learning algorithms. The experiments on this novel representational structures are performed on UCR-2018 time series archive which contains 128 datasets. We also represented the trend sequences in a pictorial form named positional size diagram (PSD) and aggregated all the instances of the datasets into an auxiliary data representation named positional dataset (PD). We compared six traditional classification algorithms namely k-nearest neighbor (k-NN), logistic regression (LR), support vector (SV), decision tree (DC), gaussian naive bayes (GNB) and random forest (RF) with trendlets. The additional set of global features enrich the trendlets with supplementary information about the trend of time series sequences. The classification accuracy of the aforementioned algorithms shows a significant improvement with this additional set of global features.

[1]  Pierpaolo D'Urso,et al.  Fuzzy clustering with spatial–temporal information , 2019, Spatial Statistics.

[2]  G. R. Gangadharan,et al.  Trendlets: A novel probabilistic representational structures for clustering the time series data , 2020, Expert Syst. Appl..

[3]  Marc Boullé,et al.  Feature Extraction over Multiple Representations for Time Series Classification , 2013, NFMCP.

[4]  Marimuthu Palaniswami,et al.  Fuzzy c-Shape: A new algorithm for clustering finite time series waveforms , 2016, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[5]  Qin Zhang,et al.  Time series feature learning with labeled and unlabeled data , 2019, Pattern Recognit..

[6]  Bonny Banerjee,et al.  Unsupervised Feature Learning From Time-Series Data Using Linear Models , 2018, IEEE Internet of Things Journal.

[7]  Hamdi Yahyaoui,et al.  A novel trend based SAX reduction technique for time series , 2019, Expert Syst. Appl..

[8]  Wolfgang Lehner,et al.  Feature-based comparison and generation of time series , 2018, SSDBM.

[9]  Pedro Antonio Gutiérrez,et al.  Time-Series Clustering Based on the Characterization of Segment Typologies , 2018, IEEE Transactions on Cybernetics.

[10]  Weining Zhang,et al.  An iterative end point fitting based trend segmentation representation of time series and its distance measure , 2020, Multimedia Tools and Applications.

[11]  Andreas W. Kempa-Liehr,et al.  Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh - A Python package) , 2018, Neurocomputing.

[12]  Max A. Little,et al.  Highly comparative time-series analysis: the empirical structure of time series and their methods , 2013, Journal of The Royal Society Interface.

[13]  Ying Zhang,et al.  Discriminative extraction of features from time series , 2018, Neurocomputing.

[14]  Nuo Li,et al.  Detecting and locating patterns in time series using machine learning , 2019 .

[15]  Seiichi Uchida,et al.  Time series classification using local distance-based features in multi-modal fusion networks , 2020, Pattern Recognit..

[16]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[17]  Klaus Moessner,et al.  Univariate and Multivariate Time Series Manifold Learning , 2017, Knowl. Based Syst..

[18]  Yannis Manolopoulos,et al.  Continuous Trend-Based Classification of Streaming Time Series , 2005, ADBIS.

[19]  Ben D. Fulcher,et al.  Feature-based time-series analysis , 2017, ArXiv.

[20]  George C. Runger,et al.  A Bag-of-Features Framework to Classify Time Series , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jacques Klein,et al.  Extracting Statistical Graph Features for Accurate and Efficient Time Series Classification , 2018, EDBT.