Deep learning for EEG-based Motor Imagery classification: Accuracy-cost trade-off

Electroencephalography (EEG) datasets are often small and high dimensional, owing to cumbersome recording processes. In these conditions, powerful machine learning techniques are essential to deal with the large amount of information and overcome the curse of dimensionality. Artificial Neural Networks (ANNs) have achieved promising performance in EEG-based Brain-Computer Interface (BCI) applications, but they involve computationally intensive training algorithms and hyperparameter optimization methods. Thus, an awareness of the quality-cost trade-off, although usually overlooked, is highly beneficial. In this paper, we apply a hyperparameter optimization procedure based on Genetic Algorithms to Convolutional Neural Networks (CNNs), Feed-Forward Neural Networks (FFNNs), and Recurrent Neural Networks (RNNs), all of them purposely shallow. We compare their relative quality and energy-time cost, but we also analyze the variability in the structural complexity of networks of the same type with similar accuracies. The experimental results show that the optimization procedure improves accuracy in all models, and that CNN models with only one hidden convolutional layer can equal or slightly outperform a 6-layer Deep Belief Network. FFNN and RNN were not able to reach the same quality, although the cost was significantly lower. The results also highlight the fact that size within the same type of network is not necessarily correlated with accuracy, as smaller models can and do match, or even surpass, bigger ones in performance. In this regard, overfitting is likely a contributing factor since deep learning approaches struggle with limited training examples.

[1]  Marco Zaffalon,et al.  Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis , 2016, J. Mach. Learn. Res..

[2]  J. Q. Gan,et al.  Multiresolution analysis over simple graphs for brain computer interfaces , 2013, Journal of neural engineering.

[3]  A. E. Eiben,et al.  Introduction to Evolutionary Computing , 2003, Natural Computing Series.

[4]  Gordon Cheng,et al.  Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals , 2018, Sensors.

[5]  Shuicheng Yan,et al.  Learning Temporal Information for Brain-Computer Interface Using Convolutional Neural Networks , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[7]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[8]  Aimin Jiang,et al.  LSTM-Based EEG Classification in Motor Imagery Tasks , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Kapil D. Katyal,et al.  Individual finger control of a modular prosthetic limb using high-density electrocorticography in a human subject , 2016, Journal of neural engineering.

[10]  U. Rajendra Acharya,et al.  Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals , 2017, Comput. Biol. Medicine.

[11]  Jon A. Mukand,et al.  Neuronal ensemble control of prosthetic devices by a human with tetraplegia , 2006, Nature.

[12]  Mohammad Hassan Moradi,et al.  A new approach for EEG feature extraction in P300-based lie detection , 2009, Comput. Methods Programs Biomed..

[13]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[14]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[15]  Eran Yahav,et al.  On the Practical Computational Power of Finite Precision RNNs for Language Recognition , 2018, ACL.

[16]  O. Woolpert Biological Sciences , 1980, Nature.

[17]  Ram Bilas Pachori,et al.  Classification of Seizure and Nonseizure EEG Signals Using Empirical Mode Decomposition , 2012, IEEE Transactions on Information Technology in Biomedicine.

[18]  Jürgen Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[19]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Antonio F. Díaz,et al.  Parallel high-dimensional multi-objective feature selection for EEG classification with dynamic workload balancing on CPU–GPU architectures , 2017, Cluster Computing.

[21]  Sung Chan Jun,et al.  EEG datasets for motor imagery brain–computer interface , 2017, GigaScience.

[22]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[23]  Matthew D. Lieberman,et al.  Does Rejection Hurt? An fMRI Study of Social Exclusion , 2003, Science.

[24]  Hui Wang,et al.  A multi-class EEG-based BCI classification using multivariate empirical mode decomposition based filtering and Riemannian geometry , 2018, Expert Syst. Appl..

[25]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[26]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[27]  Rizos Sakellariou,et al.  A Survey of Power and Energy Predictive Models in HPC Systems and Applications , 2017, ACM Comput. Surv..

[28]  Edoardo Amaldi,et al.  On the Approximability of Minimizing Nonzero Variables or Unsatisfied Relations in Linear Systems , 1998, Theor. Comput. Sci..

[29]  Andrés Ortiz,et al.  Classification of motor imagery tasks for BCI with multiresolution analysis and multiobjective feature selection , 2016, Biomedical engineering online.

[30]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[31]  G. Comi,et al.  Long Term Natural History Data in Ambulant Boys with Duchenne Muscular Dystrophy: 36-Month Changes , 2014, PloS one.

[32]  Angelika Braun,et al.  Artificial vision with wirelessly powered subretinal electronic implant alpha-IMS , 2013, Proceedings of the Royal Society B: Biological Sciences.

[33]  Brent Lance,et al.  EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces , 2016, Journal of neural engineering.

[34]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[35]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[36]  H. Yokoi,et al.  Real-time control of a prosthetic hand using human electrocorticography signals. , 2011, Journal of neurosurgery.

[37]  Borja Calvo,et al.  scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems , 2016, R J..

[38]  OrtegaJulio,et al.  Leveraging cooperation for parallel multi-objective feature selection in high-dimensional EEG data , 2015 .

[39]  John Q. Gan,et al.  A supervised filter method for multi-objective feature selection in EEG classification based on multi-resolution analysis for BCI , 2017, Neurocomputing.

[40]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[41]  Borja Calvo,et al.  Statistical Comparison of Multiple Algorithms in MultipleProblems , 2015 .

[42]  Richard D. Jones,et al.  EEG-Based Lapse Detection With High Temporal Resolution , 2007, IEEE Transactions on Biomedical Engineering.

[43]  John Q. Gan,et al.  Deep Belief Networks and Multiobjective Feature Selection for BCI with Multiresolution Analysis , 2017, IWANN.

[44]  Francesca Mangili,et al.  Should We Really Use Post-Hoc Tests Based on Mean-Ranks? , 2015, J. Mach. Learn. Res..

[45]  M Congedo,et al.  A review of classification algorithms for EEG-based brain–computer interfaces: a 10 year update , 2018, Journal of neural engineering.

[46]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[47]  S. G. Ponnambalam,et al.  Multi-objective genetic algorithm as channel selection method for P300 and motor imagery data set , 2015, Neurocomputing.

[48]  Filippo Zappasodi,et al.  Deep learning for hybrid EEG-fNIRS brain–computer interface: application to motor imagery classification , 2018, Journal of neural engineering.

[49]  SchmidhuberJürgen,et al.  2005 Special Issue , 2005 .

[50]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[51]  Francisco Herrera,et al.  rNPBST: An R Package Covering Non-parametric and Bayesian Statistical Tests , 2017, HAIS.

[52]  Muhammad Ghulam,et al.  Deep Learning for EEG motor imagery classification based on multi-layer CNNs feature fusion , 2019, Future Gener. Comput. Syst..

[53]  Thomas Bäck,et al.  Selective Pressure in Evolutionary Algorithms: A Characterization of Selection Mechanisms , 1994, International Conference on Evolutionary Computation.

[54]  Sampsa Hautaniemi,et al.  Biclustering Methods: Biological Relevance and Application in Gene Expression Analysis , 2014, PloS one.

[55]  Alok Sharma,et al.  An improved discriminative filter bank selection approach for motor imagery EEG signal classification using mutual information , 2017, BMC Bioinformatics.

[56]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[57]  Rafael Ramírez,et al.  Detecting Emotion from EEG Signals Using the Emotive Epoc Device , 2012, Brain Informatics.

[58]  Yu Zhang,et al.  Sparse Group Representation Model for Motor Imagery EEG Classification , 2019, IEEE Journal of Biomedical and Health Informatics.

[59]  Julio Ortega Lopera,et al.  Leveraging cooperation for parallel multi‐objective feature selection in high‐dimensional EEG data , 2015, Concurr. Comput. Pract. Exp..

[60]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[62]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[63]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[64]  B. Wilhelm,et al.  Subretinal Visual Implant Alpha IMS – Clinical trial interim report , 2015, Vision Research.

[65]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[66]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[67]  Alok Sharma,et al.  Brain wave classification using long short-term memory network based OPTICAL predictor , 2019, Scientific Reports.

[68]  Sara van de Geer,et al.  Statistics for High-Dimensional Data: Methods, Theory and Applications , 2011 .

[69]  Rebeca Corralejo,et al.  Feature selection using a genetic algorithm in a motor imagery-based Brain Computer Interface , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[70]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[71]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..