Real-Time Decoding of Auditory Attention from EEG via Bayesian Filtering

In a complex auditory scene comprising multiple sound sources, humans are able to target and track a single speaker. Recent studies have provided promising algorithms to decode the attentional state of a listener in a competing-speaker environment from non-invasive brain recordings sun exhibit poor performance at temporal resolutions suitable for real-time implementation, which hinders their utilization in emerging applications such as smart hearich as electroencephalography (EEG). These algorithms require substantial training datasets and ofteng aids. In this work, we propose a real-time attention decoding framework by integrating techniques from Bayesian filtering, $\ell_{1}$-regularization, state-space modeling, and Expectation Maximization, which is capable of producing robust and statistically interpretable measures of auditory attention at high temporal resolution. Application of our proposed algorithm to synthetic and real EEG data yields a performance close to the state-of-the-art offline methods, while operating in near real-time with a minimal amount of training data.

[1]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[2]  John J. Foxe,et al.  Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[3]  J. Simon,et al.  Emergence of neural encoding of auditory objects while listening to competing speakers , 2012, Proceedings of the National Academy of Sciences.

[4]  Jonathan Z. Simon,et al.  Dynamic Estimation of the Auditory Temporal Response Function From MEG in Competing-Speaker Environments , 2017, IEEE Transactions on Biomedical Engineering.

[5]  Alexander Bertrand,et al.  The effect of head-related filtering and ear-specific decoding bias on auditory attention detection , 2016, Journal of neural engineering.

[6]  Jonathan Z. Simon,et al.  The Auditory System at the Cocktail Party , 2017 .

[7]  John J. Foxe,et al.  At what time is the cocktail party? A late locus of selective attention to natural speech , 2012, The European journal of neuroscience.

[8]  Alexander Bertrand,et al.  Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Shihab A. Shamma,et al.  Recursive Sparse Point Process Regression With Application to Spectrotemporal Receptive Field Plasticity Analysis , 2015, IEEE Transactions on Signal Processing.

[10]  Richard G. Baraniuk,et al.  A Field Guide to Forward-Backward Splitting with a FASTA Implementation , 2014, ArXiv.

[11]  Maarten De Vos,et al.  Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications , 2015, Journal of neural engineering.

[12]  Zhuo Chen,et al.  Neural decoding of attentional selection in multi-speaker environments without access to clean sources , 2017, Journal of neural engineering.

[13]  Alessandro Presacco,et al.  Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling , 2016, NeuroImage.

[14]  Alexander Bertrand,et al.  Online detection of auditory attention with mobile EEG: closing the loop with neurofeedback , 2017, bioRxiv.

[15]  Sabine Van Huffel,et al.  Online detection of auditory attention in a neurofeedback application , 2016 .