Correlation based dynamic time warping of multivariate time series

In recent years, dynamic time warping (DTW) has begun to become the most widely used technique for comparison of time series data where extensive a priori knowledge is not available. However, it is often expected a multivariate comparison method to consider the correlation between the variables as this correlation carries the real information in many cases. Thus, principal component analysis (PCA) based similarity measures, such as PCA similarity factor (SPCA), are used in many industrial applications. In this paper, we present a novel algorithm called correlation based dynamic time warping (CBDTW) which combines DTW and PCA based similarity measures. To preserve correlation, multivariate time series are segmented and the local dissimilarity function of DTW originated from SPCA. The segments are obtained by bottom-up segmentation using special, PCA related costs. Our novel technique qualified on two databases, the database of signature verification competition 2004 and the commonly used AUSLAN dataset. We show that CBDTW outperforms the standard SPCA and the most commonly used, Euclidean distance based multivariate DTW in case of datasets with complex correlation structure.

[1]  Eamonn J. Keogh,et al.  Making Time-Series Classification More Accurate Using Learned Constraints , 2004, SDM.

[2]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[3]  Amy McGovern,et al.  Identifying predictive multi-dimensional time series motifs: an application to severe weather prediction , 2010, Data Mining and Knowledge Discovery.

[4]  Berrin A. Yanikoglu,et al.  Identity authentication using improved online signature verification method , 2005, Pattern Recognit. Lett..

[5]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[6]  Eamonn J. Keogh,et al.  Three Myths about Dynamic Time Warping Data Mining , 2005, SDM.

[7]  Hannu Toivonen,et al.  Estimating the number of segments in time series data using permutation tests , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[8]  D. Seborg,et al.  Clustering multivariate time‐series data , 2005 .

[9]  Mohammed Waleed Kadous,et al.  Temporal classification: extending the classification paradigm to multivariate time series , 2002 .

[10]  Cyrus Shahabi,et al.  A PCA-based similarity measure for multivariate time series , 2004, MMDB '04.

[11]  Heikki Mannila,et al.  Time series segmentation for context recognition in mobile devices , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[12]  János Abonyi,et al.  Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series , 2005, Fuzzy Sets Syst..

[13]  Dale E Seborg,et al.  Fault Detection and Diagnosis in an Industrial Fed‐Batch Cell Culture Process , 2007, Biotechnology progress.

[14]  W. Krzanowski Between-Groups Comparison of Principal Components , 1979 .

[15]  Sauli Kivikunnas,et al.  Overview of Process Trend Analysis Methods and Applications , 1998 .

[16]  Eamonn J. Keogh,et al.  Scaling up Dynamic Time Warping to Massive Dataset , 1999, PKDD.

[17]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[18]  D. Seborg,et al.  Pattern Matching in Historical Data , 2002 .

[19]  George M. Church,et al.  Aligning gene expression time series with time warping algorithms , 2001, Bioinform..

[20]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[21]  Toni Giorgino,et al.  Matching incomplete time series with dynamic time warping: an algorithm and an application to post-stroke rehabilitation , 2009, Artif. Intell. Medicine.

[22]  Cyrus Shahabi,et al.  An efficient k nearest neighbor search for multivariate time series , 2007, Inf. Comput..

[23]  Hong Chang,et al.  SVC2004: First International Signature Verification Competition , 2004, ICBA.

[24]  Eamonn J. Keogh,et al.  Derivative Dynamic Time Warping , 2001, SDM.

[25]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[26]  Theodora Kourti,et al.  Application of latent variable methods to process control and multivariate statistical process control in industry , 2005 .

[27]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[28]  Paul S. Heckbert,et al.  Survey of Polygonal Surface Simplification Algorithms , 1997 .

[29]  Sándor Németh,et al.  Principal Component Analysis based Time Series Segmentation { A New Sensor Fusion Algorithm , 2004 .

[30]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[31]  Roy Rada,et al.  Expert systems and evolutionary computing for financial investing: A review , 2008, Expert Syst. Appl..