A Feature Extraction Method for Multivariate Time Series Classification Using Temporal Patterns

Multiple variables and high dimensions are two main challenges for classification of Multivariate Time Series (MTS) data. In order to overcome these challenges, feature extraction should be performed before performing classification. However, the existing feature extraction methods lose the important correlations among the variables while reducing high dimensions of MTS. Hence, in this paper, we propose a new feature extraction method combined with different classifiers to provide a general classification strategy for MTS data which can be applied for different area problems of MTS data. The proposed algorithm can handle data of high feature dimensions efficiently with unequal length and discover the relationship within the same and between different component univariate time series for MTS data. Hence, the proposed feature extraction method is application-independent and therefore does not depend on domain knowledge of relevant features or assumption about underling data models. We evaluate the algorithm on one synthetic dataset and two real-world datasets. The comparison experimental result shows that the proposed algorithm can achieve higher classification accuracy and F-measure value.

[1]  Keith C. C. Chan,et al.  Clustering and re-clustering for Pattern Discovery in Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[2]  Pei-Gee Peter Ho Multivariate Time Series Support Vector Machine for Multispectral Remote Sensing Image Classifications , 2009 .

[3]  Xiaoqing Weng Classification of multivariate time series using supervised neighborhood preserving embedding , 2013, 2013 25th Chinese Control and Decision Conference (CCDC).

[4]  Akira Hayashi,et al.  Embedding Time Series Data for Classification , 2005, MLDM.

[5]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[6]  Cyrus Shahabi,et al.  A PCA-based similarity measure for multivariate time series , 2004, MMDB '04.

[7]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[8]  Chinatsu Aone,et al.  Fast and effective text mining using linear-time document clustering , 1999, KDD '99.

[9]  Hui Zhang,et al.  Blind Feature Extraction for Time-Series Classification Using Haar Wavelet Transform , 2005, ISNN.

[10]  Nathan Intrator,et al.  A Study of Ensemble of Hybrid Networks with Strong Regularization , 2003, Multiple Classifier Systems.

[11]  Latifur Khan,et al.  Feature Selection for Classification of Variable Length Multiattribute Motions , 2007 .

[12]  Hong Zhu,et al.  A survey on feature extraction for pattern recognition , 2011, Artificial Intelligence Review.

[13]  R. Laubenbacher,et al.  Discretization of Time Course Data , 2006 .

[14]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[15]  Juan José Rodríguez Diez,et al.  Support vector machines of interval-based features for time series classification , 2005, Knowl. Based Syst..

[16]  Kenneth Koehler,et al.  Multivariate Statistical Analysis Textbook and Software Course Web Page Computational Resources Optional Sas Books Computational Resources Accommodations for Disabilities Academic Dishonesty Dead Week Harassment and Discrimination Religious Accommodation Outline Outline Introduction , .

[17]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[18]  Jian Pei,et al.  A brief survey on sequence classification , 2010, SKDD.

[19]  Junyi Shen,et al.  Classification of multivariate time series using locality preserving projections , 2008, Knowl. Based Syst..

[20]  Jiawei Han,et al.  Orthogonal Laplacianfaces for Face Recognition , 2006, IEEE Transactions on Image Processing.

[21]  Chi Hau Chen,et al.  Multivariate AR Model based Support Vector Machine for Multispectral Remote Sensing Image Classification , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[22]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[23]  Sahin Albayrak,et al.  Pattern recognition and classification for multivariate time series , 2011, SensorKDD '11.

[24]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[25]  Andrew K. C. Wong,et al.  DECA: A Discrete-Valued Data Clustering Algorithm , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Juan José Rodríguez Diez,et al.  Support vector machines of interval-based features for time series classification , 2004, Knowl. Based Syst..

[27]  John McGee,et al.  Discretization of Time Series Data , 2005, J. Comput. Biol..