A PCA-based Kernel for Kernel PCA on Multivariate Time Series

Multivariate time series (MTS) data sets are common in various multimedia, medical and financial application domains. These applications perform several data-analysis operations on large number of MTS data sets such as similarity searches, feature-subset-selection, cluster ing and classification. Inherently, an MTS item has a large number of dimensions. Hence, before applying data mining techniques, some form of dimension reduction, e.g., feature extraction, should be performed. Principal Component Analysis (PCA) is one of the techniques that have been frequently utilized for dimension reduction. However, traditional PCA does not scale well in terms of dimensionality, and therefore may not be applied to MTS data sets. The Kernel PCA technique addresses this problem of scalability by utilizing the kernel trick. In this paper, we propose a PCA based kernel to be employed for the Kernel PCA technique on the MTS data sets, termed KEros, which is based on Eros, a PCA based similarity measure for MTS data sets. We evaluate the performance of KEros using Support Vector Machine (SVM), and compare the performance with Kernel PCA using linear kernel and Generalized Principal Component (GPCA). The experimental results show that KEros outperforms these other techniques in terms of classificati on

[1]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[2]  H. Begleiter,et al.  Event related potentials during object recognition tasks , 1995, Brain Research Bulletin.

[3]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[4]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[5]  L. K. Hansen,et al.  On Clustering fMRI Time Series , 1999, NeuroImage.

[6]  Z. Meral Özsoyoglu,et al.  Indexing large metric spaces for similarity search queries , 1999, TODS.

[7]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[8]  Xiaohui Liu,et al.  Variable grouping in multivariate time series via correlation , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[9]  A. Corradini,et al.  Dynamic time warping for off-line recognition of a small gesture vocabulary , 2001, Proceedings IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems.

[10]  Mohammed Waleed Kadous,et al.  Temporal classification: extending the classification paradigm to multivariate time series , 2002 .

[11]  Vladimir Pavlovic,et al.  Discovering clusters in motion time-series data , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12]  Cyrus Shahabi,et al.  AIMS: An Immersidata Management System , 2003, CIDR.

[13]  Tanveer F. Syeda-Mahmood,et al.  View-invariant alignment and matching of video sequences , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[15]  Jieping Ye,et al.  Linear projection methods in face recognition under unconstrained illuminations: a comparative study , 2004, CVPR 2004.

[16]  Cyrus Shahabi,et al.  A PCA-based similarity measure for multivariate time series , 2004, MMDB '04.

[17]  Jieping Ye,et al.  GPCA: an efficient dimension reduction scheme for image compression and retrieval , 2004, KDD.

[18]  Bernhard Schölkopf,et al.  Methods Towards Invasive Human Brain Computer Interfaces , 2004, NIPS.

[19]  Bruno Sinopoli,et al.  A kernel-based learning approach to ad hoc sensor network localization , 2005, TOSN.

[20]  Cyrus Shahabi,et al.  On the stationarity of multivariate time series for correlation-based data analysis , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[21]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[22]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.