A self-organising eigenspace map for time series clustering

This paper presents a novel time series clustering method, the self-organising eigenspace map (SOEM), based on a generalisation of the well-known self-organising feature map (SOFM). The SOEM operates on the eigenspaces of the embedded covariance structures of time series which are related directly to modes in those time series. Approximate joint diagonalisation acts as a pseudo-metric across these spaces allowing us to generalise the SOFM to a neural network with matrix input. The technique is empirically validated against three sets of experiments; univariate and multivariate time series clustering, and application to (clustered) multi-variate time series forecasting. Results indicate that the technique performs a valid topologically ordered clustering of the time series. The clustering is superior in comparison to standard benchmarks when the data is non-aligned, gives the best clustering stage for when used in forecasting, and can be used with partial/non-overlapping time series, multivariate clustering and produces a topological representation of the time series objects.

[1]  José G. Dias,et al.  Clustering financial time series: New insights from an extended hidden Markov model , 2015, Eur. J. Oper. Res..

[2]  M. Bronstein,et al.  Matrix commutators: their asymptotic metric properties and relation to approximate joint diagonalization , 2013 .

[3]  Jun Yu Li,et al.  Two Sample Tests for High Dimensional Covariance Matrices , 2012, 1206.0917.

[4]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[5]  Kerry Patterson,et al.  A Comprehensive Causality Test Based on the Singular Spectrum Analysis , 2011 .

[6]  Inderjit S. Dhillon,et al.  Clustering to forecast sparse time-series data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[7]  Tao Wang,et al.  Modeling daily realized futures volatility with singular spectrum analysis , 2002 .

[8]  S. Sanei,et al.  Tensor Based Singular Spectrum Analysis for Automatic Scoring of Sleep EEG , 2015, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Peng Xu,et al.  Dynamic-Time-Warping-Based Measurement Data Alignment Model for Condition-Based Railroad Track Maintenance , 2015, IEEE Transactions on Intelligent Transportation Systems.

[10]  M. P. S Bhatia,et al.  Wireless sensor networks for monitoring the environmental activities , 2010, 2010 IEEE International Conference on Computational Intelligence and Computing Research.

[11]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[12]  Lutz Hamel,et al.  Self-Organizing Map Convergence , 2018, Int. J. Serv. Sci. Manag. Eng. Technol..

[13]  Antoine Souloumiac,et al.  Jacobi Angles for Simultaneous Diagonalization , 1996, SIAM J. Matrix Anal. Appl..

[14]  Klaus Obermayer,et al.  Self-organizing maps and clustering methods for matrix data , 2004, Neural Networks.

[15]  Panayotis G. Michaelides,et al.  Business cycles synchronization and clustering in Europe (1960–2009) , 2010 .

[16]  Raja Jayaraman,et al.  Support vector-based algorithms with weighted dynamic time warping kernel function for time series classification , 2015, Knowl. Based Syst..

[17]  Ziv Bar-Joseph,et al.  Clustering short time series gene expression data , 2005, ISMB.

[18]  JeongYoung-Seon,et al.  Support vector-based algorithms with weighted dynamic time warping kernel function for time series classification , 2015 .

[19]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[20]  Wen-Liang Hung,et al.  Data Analysis on the Extra-solar Planets Using Robust Clustering , 2006 .

[21]  N. Golyandina,et al.  SSA-based approaches to analysis and forecast of multidimensional time series , 2012 .

[22]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[23]  Rahim Mahmoudvand,et al.  The benefits of multivariate singular spectrum analysis over the univariate version , 2018, J. Frankl. Inst..

[24]  Éric Gaussier,et al.  Generalized k-means-based clustering for temporal data under weighted and kernel time warp , 2016, Pattern Recognit. Lett..

[25]  Konstantin Usevich,et al.  2D-extension of Singular Spectrum Analysis: algorithm and elements of theory , 2010 .

[26]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[27]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[28]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[29]  Artur Dubrawski,et al.  Classification of Time Sequences using Graphs of Temporal Constraints , 2017, J. Mach. Learn. Res..

[30]  Andrew L. Rukhin,et al.  Analysis of Time Series Structure SSA and Related Techniques , 2002, Technometrics.

[31]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[32]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[33]  Hamid Reza Mohseni,et al.  Localizing Heart Sounds in Respiratory Signals Using Singular Spectrum Analysis , 2011, IEEE Transactions on Biomedical Engineering.

[34]  Bernd Fritzke,et al.  Kohonen Feature Maps and Growing Cell Structures - a Performance Comparison , 1992, NIPS.

[35]  Mahmut Ozer,et al.  EEG signals classification using the K-means clustering and a multilayer perceptron neural network model , 2011, Expert Syst. Appl..

[36]  Donya Rahmani,et al.  Bayesian singular spectrum analysis with state dependent models , 2017 .

[37]  Michael Flynn,et al.  The UEA multivariate time series classification archive, 2018 , 2018, ArXiv.

[38]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[39]  Alan Bundy,et al.  Dynamic Time Warping , 1984 .

[40]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).