An alternative approach for auditory attention tracking using single-trial EEG

Auditory selective attention plays a central role in the human capacity to reliably process complex sounds in multi-source environments. Stimulus reconstruction has been widely used for the investigation of selective auditory attention using multichannel electroencephalography (EEG). In particular, the influence of attention on sound representations in the brain has been modeled by linear time-variant filters and have been used to track the attentional state of individuals in multi-source environments. Detection of auditory attention is of interest and is important in the study of attention-related disorders and has potential application in the hearing aid and advertising industries. In analogy with the rake receiver from wireless communications, we propose a new strategy, adapting principles from minimum variance beamforming, to reconstruct stimuli for decoding the at-tentional state of listeners in a competing speaker environment. We show through experiments with real electrophysiological data how decoding accuracies can be improved using our proposed scheme.

[1]  Marc Moonen,et al.  Comparison of speech envelope extraction methods for EEG-based auditory attention detection in a cocktail party scenario , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[2]  Les E. Atlas,et al.  Flexible tracking of auditory attention , 2015, INTERSPEECH.

[3]  T. Picton,et al.  Human Cortical Responses to the Speech Envelope , 2008, Ear and hearing.

[4]  Brian N. Pasley,et al.  Reconstructing Speech from Human Auditory Cortex , 2012, PLoS biology.

[5]  D. Poeppel,et al.  Mechanisms Underlying Selective Neuronal Tracking of Attended Speech at a “Cocktail Party” , 2013, Neuron.

[6]  John J. Foxe,et al.  Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[7]  Pamela Reinagel,et al.  Decoding visual information from a population of retinal ganglion cells. , 1997, Journal of neurophysiology.

[8]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[9]  Ivan Dokmanic,et al.  Raking the Cocktail Party , 2015, IEEE Journal of Selected Topics in Signal Processing.

[10]  G B Stanley,et al.  Reconstruction of Natural Scenes from Ensemble Responses in the Lateral Geniculate Nucleus , 1999, The Journal of Neuroscience.

[11]  William Bialek,et al.  Reading a Neural Code , 1991, NIPS.

[12]  J. Simon,et al.  Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. , 2012, Journal of neurophysiology.

[13]  John J. Foxe,et al.  Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution , 2010, The European journal of neuroscience.

[14]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[15]  Tzyy-Ping Jung,et al.  Independent Component Analysis of Electroencephalographic Data , 1995, NIPS.

[16]  Harry L. Van Trees,et al.  Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory , 2002 .

[17]  Martin Vetterli,et al.  Raking echoes in the time domain , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Kyungwhoon Cheun Performance of direct-sequence spread-spectrum RAKE receivers with random spreading sequences , 1997, IEEE Trans. Commun..