Gaussian Process Regression for Predictive But Interpretable Machine Learning Models: An Example of Predicting Mental Workload across Tasks

There is increasing interest in real-time brain-computer interfaces (BCIs) for the passive monitoring of human cognitive state, including cognitive workload. Too often, however, effective BCIs based on machine learning techniques may function as “black boxes” that are difficult to analyze or interpret. In an effort toward more interpretable BCIs, we studied a family of N-back working memory tasks using a machine learning model, Gaussian Process Regression (GPR), which was both powerful and amenable to analysis. Participants performed the N-back task with three stimulus variants, auditory-verbal, visual-spatial, and visual-numeric, each at three working memory loads. GPR models were trained and tested on EEG data from all three task variants combined, in an effort to identify a model that could be predictive of mental workload demand regardless of stimulus modality. To provide a comparison for GPR performance, a model was additionally trained using multiple linear regression (MLR). The GPR model was effective when trained on individual participant EEG data, resulting in an average standardized mean squared error (sMSE) between true and predicted N-back levels of 0.44. In comparison, the MLR model using the same data resulted in an average sMSE of 0.55. We additionally demonstrate how GPR can be used to identify which EEG features are relevant for prediction of cognitive workload in an individual participant. A fraction of EEG features accounted for the majority of the model’s predictive power; using only the top 25% of features performed nearly as well as using 100% of features. Subsets of features identified by linear models (ANOVA) were not as efficient as subsets identified by GPR. This raises the possibility of BCIs that require fewer model features while capturing all of the information needed to achieve high predictive accuracy.

[1]  G. Wilson,et al.  Cognitive task classification based upon topographic EEG data , 1995, Biological Psychology.

[2]  Carryl L. Baldwin,et al.  Adaptive training using an artificial neural network and EEG metrics for within- and cross-task workload classification , 2012, NeuroImage.

[3]  Glenn F. Wilson,et al.  Real-Time Assessment of Mental Workload Using Psychophysiological Measures and Artificial Neural Networks , 2003, Hum. Factors.

[4]  S. Muthukumaraswamy High-frequency brain activity and muscle artifacts in MEG/EEG: a review and recommendations , 2013, Front. Hum. Neurosci..

[5]  Raja Parasuraman,et al.  Neuroergonomics: The Brain at Work , 2006 .

[6]  Raja Parasuraman,et al.  Neuroergonomics: Research and practice , 2003 .

[7]  Christian Kothe,et al.  Towards passive brain–computer interfaces: applying brain–computer interface technology to human–machine systems in general , 2011, Journal of neural engineering.

[8]  Scott Makeig,et al.  BCILAB: a platform for brain–computer interface development , 2013, Journal of neural engineering.

[9]  Terrence J. Sejnowski,et al.  Enhanced detection of artifacts in EEG data using higher-order statistics and independent component analysis , 2007, NeuroImage.

[10]  J. Wolpaw,et al.  Brain-Computer Interfaces: Principles and Practice , 2012 .

[11]  Feng Wan,et al.  EEG signals classification for brain computer interfaces based on Gaussian process classifier , 2009, 2009 7th International Conference on Information, Communications and Signal Processing (ICICS).

[12]  Thorsten O. Zander,et al.  Utilizing Secondary Input from Passive Brain-Computer Interfaces for Enhancing Human-Machine Interaction , 2009, HCI.

[13]  Claude Frasson,et al.  MENTOR: A Physiologically Controlled Tutoring System , 2015, UMAP.

[14]  R. Bakeman Recommended effect size statistics for repeated measures designs , 2005, Behavior research methods.

[15]  J. Algina,et al.  Generalized eta and omega squared statistics: measures of effect size for some common research designs. , 2003, Psychological methods.

[16]  Karim Jerbi,et al.  Exceeding chance level by chance: The caveat of theoretical chance levels in brain signal classification and statistical assessment of decoding accuracy , 2015, Journal of Neuroscience Methods.

[17]  Wolfgang Rosenstiel,et al.  Cognitive state monitoring and the design of adaptive instruction in digital environments: lessons learned from cognitive workload assessment using a passive brain-computer interface approach , 2014, Front. Neurosci..

[18]  Christopher D. Wickens,et al.  Multiple Resources and Mental Workload , 2008, Hum. Factors.

[19]  Stefan Haufe,et al.  On the interpretation of weight vectors of linear models in multivariate neuroimaging , 2014, NeuroImage.

[20]  J. E. Korteling,et al.  Using neurophysiological signals that reflect cognitive or affective state: six recommendations to avoid common pitfalls , 2015, Front. Neurosci..

[21]  Glenn F. Wilson,et al.  Operator Functional State Classification Using Multiple Psychophysiological Features in an Air Traffic Control Task , 2003, Hum. Factors.

[22]  F. Paas,et al.  Cognitive Architecture and Instructional Design , 1998 .

[23]  Anatole Lécuyer,et al.  Classifying EEG for brain computer interfaces using Gaussian processes , 2008, Pattern Recognit. Lett..

[24]  Michael E. Smith,et al.  Neurophysiological measures of cognitive workload during human-computer interaction , 2003 .

[25]  Kathryn M. McMillan,et al.  N‐back working memory paradigm: A meta‐analysis of normative functional neuroimaging studies , 2005, Human brain mapping.

[26]  Joseph T. Coyne,et al.  Applying Real Time Physiological Measures of Cognitive Load to Improve Training , 2009, HCI.

[27]  Scott Makeig,et al.  Estimation of task workload from EEG data: New and current tools and perspectives , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[28]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[29]  Carl E. Rasmussen,et al.  Gaussian Processes for Machine Learning (GPML) Toolbox , 2010, J. Mach. Learn. Res..

[30]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[31]  Robert Oostenveld,et al.  Estimating workload using EEG spectral power and ERPs in the n-back task , 2012, Journal of neural engineering.

[32]  M. Just,et al.  Neuroindices of cognitive workload: Neuroimaging, pupillometric and event-related potential studies of brain work , 2003 .

[33]  Virginia R. de Sa,et al.  Discriminative dimensionality reduction for analyzing EEG data , 2014, CogSci.

[34]  Hasan Ayaz,et al.  Optical brain monitoring for operator training and mental workload assessment , 2012, NeuroImage.

[35]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[36]  Claude Frasson,et al.  Modeling mental workload using EEG features for intelligent systems , 2011, UMAP'11.

[37]  Desney S. Tan,et al.  Feasibility and pragmatics of classifying working memory load with an electroencephalograph , 2008, CHI.