Semi-supervised feature extraction with local temporal regularization for EEG classification

Extreme energy ratio (EER) is a recently proposed feature extractor to learn spatial filters for electroencephalogram (EEG) signal classification. It is theoretically equivalent and computationally superior to the common spatial patterns (CSP) method which is a widely used technique in brain-computer interfaces (BCIs). However, EER may seriously overfit on small training sets due to the presence of large noise. Moreover, it is a totally supervised method that cannot take advantage of unlabeled data. To overcome these limitations, we propose a regularization constraint utilizing local temporal information of unlabeled trails. It can encourage the temporal smoothness of source signals discovered, and thus alleviate their tendency to overfit. By combining this regularization trick with the EER method, we present a semi-supervised feature extractor termed semi-supervised extreme energy ratio (SEER). After solving two eigenvalue decomposition problems, SEER recovers latent source signals that not only have discriminative energy features but also preserve the local temporal structure of test trails. Compared to the features found by EER, the energy features of these source signals have a stronger generalization ability, as shown by the experimental results. As a nonlinear extension of SEER, we further present the kernel SEER and provide the derivation of its solutions.

[1]  Mikhail Belkin,et al.  Manifold Regularization : A Geometric Framework for Learning from Examples , 2004 .

[2]  Klaus-Robert Müller,et al.  Classifying Single Trial EEG: Towards Brain Computer Interfacing , 2001, NIPS.

[3]  Shiliang Sun,et al.  An optimal kernel feature extractor and its application to EEG signal classification , 2006, Neurocomputing.

[4]  Lucas C. Parra,et al.  Recipes for the linear analysis of EEG , 2005, NeuroImage.

[5]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[6]  Shiliang Sun,et al.  The Extreme Energy Ratio Criterion for EEG Feature Extraction , 2008, ICANN.

[7]  Bernhard Schölkopf,et al.  Classifying Event-Related Desynchronization in EEG, ECoG and MEG Signals , 2006, DAGM-Symposium.

[8]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[9]  G. Pfurtscheller,et al.  Designing optimal spatial filters for single-trial EEG classification in a movement task , 1999, Clinical Neurophysiology.

[10]  D. Tucker,et al.  EEG coherency. I: Statistics, reference electrode, volume conduction, Laplacians, cortical imaging, and interpretation at multiple scales. , 1997, Electroencephalography and clinical neurophysiology.

[11]  Jdel.R. Millan,et al.  On the need for on-line learning in brain-computer interfaces , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[12]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.