Functional echo state network for time series classification

Abstract Echo state networks (ESNs) are a new approach to recurrent neural networks (RNNs) that have been successfully applied in many domains. Nevertheless, an ESN is a predictive model rather than a classifier, and methods to employ ESNs in time series classification (TSC) tasks have not yet been fully explored. In this paper, we propose a novel ESN approach named functional echo state network (FESN) for time series classification. The basic idea behind FESN is to replace the numeric variable output weights of an ESN with time-varying output-weight functions and introduce a temporal aggregation operator to the output layer that can project temporal signals into discrete class labels, thereby transforming the ESN from a predictive model into a true classifier. Subsequently, to learn the output-weight functions, a spatio-temporal aggregation learning algorithm is proposed based on orthogonal function basis expansion. By leveraging the nonlinear mapping capacity of a reservoir and the accumulation of temporal information in the time domain, FESN can not only enhance the separability of different classes in a high-dimensional functional space but can also consider the relative importance of temporal data at different time steps according to dynamic output-weight functions. Theoretical analyses and experiments on an extensive set of UCR data were conducted on FESN. The results show that FESN yields better performance than single-algorithm methods, has comparable accuracy with ensemble-based methods and exhibits acceptable computational complexity. Interestingly, for some time series datasets, we visualized some interpretable features extracted by FESN via specific patterns within the output-weight functions.

[1]  Robert A. Legenstein,et al.  2007 Special Issue: Edge of chaos and prediction of computational performance for neural circuit models , 2007 .

[2]  Henry Leung,et al.  Data-driven based model for flow prediction of steam system in steel industry , 2012, Inf. Sci..

[3]  Shan Liu,et al.  An effective multivariate time series classification approach using echo state network and adaptive differential evolution algorithm , 2016, Expert Syst. Appl..

[4]  Jason Lines,et al.  Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles , 2015, IEEE Trans. Knowl. Data Eng..

[5]  Eamonn J. Keogh,et al.  Experimental comparison of representation methods and distance measures for time series data , 2010, Data Mining and Knowledge Discovery.

[6]  George C. Runger,et al.  A time series forest for classification and feature extraction , 2013, Inf. Sci..

[7]  Edgar S. García-Treviño,et al.  Structural Generative Descriptions for Time Series Classification , 2014, IEEE Transactions on Cybernetics.

[8]  Harald Haas,et al.  Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication , 2004, Science.

[9]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[10]  Ya-Ju Fan,et al.  Pattern- and Network-Based Classification Techniques for Multichannel Medical Data Signals to Improve Brain Diagnosis , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[11]  Shu-Xian Lun,et al.  A new echo state network with variable memory length , 2016, Inf. Sci..

[12]  Yi Zheng,et al.  Convolutional Nonlinear Neighbourhood Components Analysis for Time Series Classification , 2015, PAKDD.

[13]  Shie Mannor,et al.  Time Series Analysis Using Geometric Template Matching , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Ali Deihimi,et al.  Application of echo state networks in short-term electric load forecasting , 2012 .

[15]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[16]  Yiannis Demiris,et al.  Echo State Gaussian Process , 2011, IEEE Transactions on Neural Networks.

[17]  James Large,et al.  The Great Time Series Classification Bake Off: An Experimental Evaluation of Recently Proposed Algorithms. Extended Version , 2016, ArXiv.

[18]  Enrico Zio,et al.  Quantifying the reliability of fault classifiers , 2014, Inf. Sci..

[19]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[20]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[21]  Peter Tiño,et al.  Simple Deterministically Constructed Cycle Reservoirs with Regular Jumps , 2012, Neural Computation.

[22]  Nick S. Jones,et al.  Highly Comparative Feature-Based Time-Series Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[23]  Qian-Li Ma,et al.  Modular state space of echo state network , 2013, Neurocomputing.

[24]  Anthony K. H. Tung,et al.  SpADe: On Shape-based Pattern Detection in Streaming Time Series , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[25]  Herbert Jaeger,et al.  Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..

[26]  Benjamin Schrauwen,et al.  Connectivity, Dynamics, and Memory in Reservoir Computing with Binary and Analog Neurons , 2010, Neural Computation.

[27]  Qinghua Hu,et al.  Kernel sparse representation for time series classification , 2015, Inf. Sci..

[28]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[29]  Jun Wang,et al.  Chaotic Time Series Prediction Based on a Novel Robust Echo State Network , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Hüseyin Kaya,et al.  A distance based time series classification framework , 2015, Inf. Syst..

[31]  Alexander Schliep,et al.  Constrained mixture estimation for analysis and robust classification of clinical time series , 2009, Bioinform..

[32]  Jason Lines,et al.  Transformation Based Ensembles for Time Series Classification , 2012, SDM.

[33]  Jignesh M. Patel,et al.  An efficient and accurate method for evaluating time series similarity , 2007, SIGMOD '07.

[34]  Pierre Geurts,et al.  Pattern Extraction for Time Series Classification , 2001, PKDD.

[35]  George C. Runger,et al.  A Bag-of-Features Framework to Classify Time Series , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[37]  Lior Rokach,et al.  Fast and space-efficient shapelets-based time-series classification , 2015, Intell. Data Anal..

[38]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[39]  Yannis Theodoridis,et al.  Index-based Most Similar Trajectory Search , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[40]  Herbert Jaeger,et al.  Controlling Recurrent Neural Networks by Conceptors , 2014, ArXiv.

[41]  Jukka Kortelainen,et al.  Invariant trajectory classification of dynamical systems with a case study on ECG , 2009, Pattern Recognit..

[42]  Xingui He,et al.  Process Neural Networks: Theory and Applications , 2010 .

[43]  John G. Harris,et al.  Automatic speech recognition using a predictive echo state network classifier , 2007, Neural Networks.

[44]  Yaochu Jin,et al.  Modeling neural plasticity in echo state networks for classification and regression , 2016, Inf. Sci..

[45]  Olufemi A. Omitaomu,et al.  Weighted dynamic time warping for time series classification , 2011, Pattern Recognit..

[46]  Rohit J. Kate Using dynamic time warping distances as features for improved time series classification , 2016, Data Mining and Knowledge Discovery.

[47]  Tomasz Górecki,et al.  Using derivatives in a longest common subsequence dissimilarity measure for time series classification , 2014, Pattern Recognit. Lett..

[48]  Huanhuan Chen,et al.  Model Metric Co-Learning for Time Series Classification , 2015, IJCAI.

[49]  John G. Harris,et al.  Minimum mean squared error time series classification using an echo state network prediction model , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[50]  Qinghua Hu,et al.  Dynamic time warping constraint learning for large margin nearest neighbor classification , 2011, Inf. Sci..

[51]  Eamonn J. Keogh,et al.  Time series shapelets: a new primitive for data mining , 2009, KDD.

[52]  Carlo Vercellis,et al.  Combining discrete SVM and fixed cardinality warping distances for multivariate time series classification , 2010, Pattern Recognit..

[53]  Jason Lines,et al.  Time series classification with ensembles of elastic distance measures , 2015, Data Mining and Knowledge Discovery.

[54]  Jason Lines,et al.  A shapelet transform for time series classification , 2012, KDD.

[55]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[56]  Cécile Amblard,et al.  Classification trees for time series , 2012, Pattern Recognit..

[57]  Eamonn J. Keogh,et al.  A Complexity-Invariant Distance Measure for Time Series , 2011, SDM.

[58]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[59]  Benjamin Schrauwen,et al.  An experimental unification of reservoir computing methods , 2007, Neural Networks.

[60]  Tomasz Górecki,et al.  Non-isometric transforms in time series classification using DTW , 2014, Knowl. Based Syst..

[61]  Chao Lu,et al.  Time Series Shapelet Classification Based Online Short-Term Voltage Stability Assessment , 2016, IEEE Transactions on Power Systems.