论文信息 - Adaptive attention-driven speech enhancement for EEG-informed hearing prostheses

Adaptive attention-driven speech enhancement for EEG-informed hearing prostheses

State-of-the-art hearing prostheses are equipped with acoustic noise reduction algorithms to improve speech intelligibility. Currently, one of the major challenges is to perform acoustic noise reduction in so-called cocktail party scenarios with multiple speakers, in particular because it is difficult-if not impossible-for the algorithm to determine which are the target speaker(s) that should be enhanced, and which speaker(s) should be treated as interfering sources. Recently, it has been shown that electroencephalography (EEG) can be used to perform auditory attention detection, i.e., to detect to which speaker a subject is attending based on recordings of neural activity. In this paper, we combine such an EEG-based auditory attention detection (AAD) paradigm with an acoustic noise reduction algorithm based on the multi-channel Wiener filter (MWF), leading to a neuro-steered MWF. In particular, we analyze how the AAD accuracy affects the noise suppression performance of an adaptive MWF in a sliding-window implementation, where the user switches his attention between two speakers.

[1] Marc Moonen,et al. Binaural Noise Cue Preservation in a Binaural Noise Reduction System With a Remote Microphone Signal , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2] Alexander Bertrand,et al. Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[3] John J. Foxe,et al. Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. , 2015, Cerebral cortex.

[4] Alexander Bertrand,et al. EEG-Informed Attended Speaker Extraction From Recorded Speech Mixtures With Application in Neuro-Steered Hearing Prostheses , 2016, IEEE Transactions on Biomedical Engineering.

[5] Marc Moonen,et al. GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[6] Maarten De Vos,et al. Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications , 2015, Journal of neural engineering.

[7] D. P. Mandic,et al. The In-the-Ear Recording Concept: User-Centered and Wearable Brain Monitoring , 2012, IEEE Pulse.

[8] Alexander Bertrand,et al. Distributed Signal Processing for Wireless EEG Sensor Networks , 2015, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9] S. Debener,et al. Unobtrusive ambulatory EEG using a smartphone and flexible printed electrodes around the ear , 2015, Scientific Reports.

[10] Marc Moonen,et al. Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[11] Marc Moonen,et al. Energy-based multi-speaker voice activity detection with an ad hoc microphone array , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12] Marc Moonen,et al. Multi-Channel Noise Reduction in Hearing Aids with Wireless Access to an External Reference Signal , 2012, IWAENC.

[13] Volker Hohmann,et al. Database of Multichannel In-Ear and Behind-the-Ear Head-Related and Binaural Room Impulse Responses , 2009, EURASIP J. Adv. Signal Process..