The Effects of Individual Differences, Non-Stationarity, and the Importance of Data Partitioning Decisions for Training and Testing of EEG Cross-Participant Models

EEG-based deep learning models have trended toward models that are designed to perform classification on any individual (cross-participant models). However, because EEG varies across participants due to non-stationarity and individual differences, certain guidelines must be followed for partitioning data into training, validation, and testing sets, in order for cross-participant models to avoid overestimation of model accuracy. Despite this necessity, the majority of EEG-based cross-participant models have not adopted such guidelines. Furthermore, some data repositories may unwittingly contribute to the problem by providing partitioned test and non-test datasets for reasons such as competition support. In this study, we demonstrate how improper dataset partitioning and the resulting improper training, validation, and testing of a cross-participant model leads to overestimated model accuracy. We demonstrate this mathematically, and empirically, using five publicly available datasets. To build the cross-participant models for these datasets, we replicate published results and demonstrate how the model accuracies are significantly reduced when proper EEG cross-participant model guidelines are followed. Our empirical results show that by not following these guidelines, error rates of cross-participant models can be underestimated between 35% and 3900%. This misrepresentation of model performance for the general population potentially slows scientific progress toward truly high-performing classification models.

[1]  R. Ellis,et al.  Large deviations and statistical mechanics , 1985 .

[2]  Tiago H. Falk,et al.  Deep learning-based electroencephalography analysis: a systematic review , 2019, Journal of neural engineering.

[3]  Klaus-Robert Müller,et al.  Covariate Shift Adaptation by Importance Weighted Cross Validation , 2007, J. Mach. Learn. Res..

[4]  Martin Wattenberg,et al.  How to Use t-SNE Effectively , 2016 .

[5]  Hao Yu,et al.  Levenberg—Marquardt Training , 2011 .

[6]  Jerzy Bodurka,et al.  Dynamical Hurst analysis identifies EEG channel differences between PTSD and healthy controls , 2018, PloS one.

[7]  Girijesh Prasad,et al.  EWMA Based Two-Stage Dataset Shift-Detection in Non-stationary Environments , 2013, AIAI.

[8]  Girijesh Prasad,et al.  Covariate shift estimation based adaptive ensemble learning for handling non-stationarity in motor imagery related EEG-based brain-computer interface , 2018, Neurocomputing.

[9]  Alexander A. Fingelkurts,et al.  Nonstationary nature of the brain activity as revealed by EEG/MEG: Methodological, practical and conceptual challenges , 2005, Signal Process..

[10]  Girijesh Prasad,et al.  Dataset Shift Detection in Non-stationary Environments Using EWMA Charts , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[11]  Krish D. Singh,et al.  Visual gamma oscillations: The effects of stimulus type, visual field coverage and stimulus motion on MEG and EEG recordings , 2013, NeuroImage.

[12]  Haider Raza Adaptive learning for modelling non-stationarity in EEG-based brain-computer interfacing , 2016 .

[13]  H. H. Hulshoff Pol,et al.  Individual Differences in EEG Spectral Power Reflect Genetic Variance in Gray and White Matter Volumes , 2012, Twin Research and Human Genetics.

[14]  G. Matthews,et al.  Extraversion, arousal theory and performance: A study of individual differences in the eeg , 1993 .

[15]  Mehrdad Nourani,et al.  An unsupervised subject identification technique using EEG signals , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[16]  Steffen Bickel,et al.  Learning under differing training and test distributions , 2009 .

[17]  James C. Christensen,et al.  Deep long short-term memory structures model temporal dependencies improving cognitive workload estimation , 2017, Pattern Recognit. Lett..

[18]  Ingo J. Timm,et al.  High-performance exclusion of schizophrenia using a novel machine learning method on EEG data , 2019, 2019 IEEE International Conference on E-health Networking, Application & Services (HealthCom).

[19]  Kathryn A. Lee,et al.  Validity and reliability of a scale to assess fatigue , 1991, Psychiatry Research.

[20]  Ping Wang,et al.  Driver fatigue detection through multiple entropy fusion analysis in an EEG-based system , 2017, PloS one.

[21]  Yasuharu Koike,et al.  Application of Covariate Shift Adaptation Techniques in Brain–Computer Interfaces , 2010, IEEE Transactions on Biomedical Engineering.

[22]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[23]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[24]  H. Landolt Genetic determination of sleep EEG profiles in healthy humans. , 2011, Progress in brain research.

[25]  Aurélien Géron,et al.  Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems , 2017 .

[26]  V. Rasoulzadeh,et al.  A comparative stationarity analysis of EEG signals , 2017, Ann. Oper. Res..

[27]  Eric P. Xing,et al.  Removing Confounding Factors Associated Weights in Deep Neural Networks Improves the Prediction Accuracy for Healthcare Applications , 2018, bioRxiv.

[28]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[29]  Jeffrey Mark Siskind,et al.  The Perils and Pitfalls of Block Design for EEG Classification Experiments , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  S. Wessely,et al.  Development of a fatigue scale. , 1993, Journal of psychosomatic research.

[31]  Maximo Cobos,et al.  Combining Inter-Subject Modeling with a Subject-Based Data Transformation to Improve Affect Recognition from EEG Signals , 2019, Sensors.

[32]  Vanessa A. Palzes,et al.  Did I do that? Abnormal predictive processes in schizophrenia when button pressing to deliver a tone. , 2014, Schizophrenia bulletin.

[33]  H. Shimodaira,et al.  Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .

[34]  Lei Xie,et al.  Confused or not Confused?: Disentangling Brain Activity from EEG Data Using Bidirectional LSTM Recurrent Neural Networks , 2017, BCB.

[35]  Lester Ingber,et al.  Statistical mechanics of neocortical interactions : Canonical momenta indicators of electroencephalography , 1995 .

[36]  Roland Vollgraf,et al.  Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[37]  Michael X. Cohen,et al.  Hippocampal-Prefrontal Connectivity Predicts Midfrontal Oscillations and Long-Term Memory Performance , 2011, Current Biology.

[38]  Girijesh Prasad,et al.  Adaptive learning with covariate shift-detection for non-stationary environments , 2014, 2014 14th UK Workshop on Computational Intelligence (UKCI).

[39]  Tim Curran,et al.  Individual differences in EEG correlates of recognition memory due to DAT polymorphisms , 2017, Brain and behavior.

[40]  H. Begleiter,et al.  Electrophysiological evidence of memory impairment in alcoholic patients , 1997, Biological Psychiatry.

[41]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[42]  James C. Christensen,et al.  Cross-Participant EEG-Based Assessment of Cognitive Workload Using Multi-Path Convolutional Recurrent Neural Networks , 2018, Sensors.

[43]  Spyridon Samothrakis,et al.  Bagging Adversarial Neural Networks for Domain Adaptation in Non-Stationary EEG , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[44]  Derek K. Jones,et al.  Resting GABA concentration predicts peak gamma frequency and fMRI amplitude in response to visual stimulation in humans , 2009, Proceedings of the National Academy of Sciences.

[45]  Xiaobo Sharon Hu,et al.  Using EEG to Improve Massive Open Online Courses Feedback Interaction , 2013, AIED Workshops.

[46]  Motoaki Kawanabe,et al.  Machine Learning in Non-Stationary Environments - Introduction to Covariate Shift Adaptation , 2012, Adaptive computation and machine learning.

[47]  Hua Wang,et al.  Classification of Alcoholic EEG Signals Using a Deep Learning Method , 2021, IEEE Sensors Journal.

[48]  R. Ellis,et al.  Entropy, large deviations, and statistical mechanics , 1985 .