A Binaural Steering Beamformer System for Enhancing a Moving Speech Source

In many daily life communication situations, several sound sources are simultaneously active. While normal-hearing listeners can easily distinguish the target sound source from interfering sound sources—as long as target and interferers are spatially or spectrally separated—and concentrate on the target, hearing-impaired listeners and cochlear implant users have difficulties in making such a distinction. In this article, we propose a binaural approach composed of a spatial filter controlled by a direction-of-arrival estimator to track and enhance a moving target sound. This approach was implemented on a real-time signal processing platform enabling experiments with test subjects in situ. To evaluate the proposed method, a data set of sound signals with a single moving sound source in an anechoic diffuse noise environment was generated using virtual acoustics. The proposed steering method was compared with a fixed (nonsteering) method that enhances sound from the frontal direction in an objective evaluation and subjective experiments using this database. In both cases, the obtained results indicated a significant improvement in speech intelligibility and quality compared with the unprocessed signal. Furthermore, the proposed method outperformed the nonsteering method.

[1]  Birger Kollmeier,et al.  Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests. , 2002, The Journal of the Acoustical Society of America.

[2]  Volker Hohmann,et al.  The ABCIT research platform , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[3]  P. Peterson,et al.  Intelligibility-weighted measures of speech-to-interference ratio and speech system performance. , 1993, The Journal of the Acoustical Society of America.

[4]  Adam A. Hersbach,et al.  Combining Directional Microphone and Single-Channel Noise Reduction Algorithms: A Clinical Evaluation in Difficult Listening Conditions With Cochlear Implant Users , 2012, Ear and hearing.

[5]  Kamil Adiloglu,et al.  Estimation of inter-channel phase differences using non-negative matrix factorization , 2014, 2014 IEEE 8th Sensor Array and Multichannel Signal Processing Workshop (SAM).

[6]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Giso Grimm,et al.  The master hearing Aid : A PC-based platform for algorithm development and evaluation , 2006 .

[8]  B Backus,et al.  A Binaural CI Research Platform for Oticon Medical SP/XP Implants Enabling ITD/ILD and Variable Rate Processing , 2015, Trends in hearing.

[9]  Volker Hohmann,et al.  Comparing Binaural Pre-processing Strategies II , 2015, Trends in hearing.

[10]  Guy J. Brown,et al.  Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Thomas Lenarz,et al.  Advanced Beamformers for Cochlear Implant Users: Acute Measurement of Speech Perception in Challenging Listening Conditions , 2014, PloS one.

[12]  Marc Moonen,et al.  Reduced-bandwidth Multi-channel Wiener Filter based binaural noise reduction and localization cue preservation in binaural hearing aids , 2014, Signal Process..

[13]  Guy J. Brown,et al.  A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Dragan Simic,et al.  A New Post-filter Algorithm Combined with Two-step Adaptive Beamformer , 2011, Circuits Syst. Signal Process..

[15]  Volker Hohmann,et al.  Comparing Binaural Pre-processing Strategies I , 2015, Trends in hearing.

[16]  Richard C. Hendriks,et al.  Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  B. Kollmeier,et al.  Combined Source Tracking and Noise Reduction for Application in Hearing Aids , 2011 .

[18]  Marc Moonen,et al.  Speech enhancement with multichannel Wiener filter techniques in multimicrophone binaural hearing aids. , 2009, The Journal of the Acoustical Society of America.

[19]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[20]  Yi Hu,et al.  Speech recognition by bilateral cochlear implant users in a cocktail-party setting. , 2009, The Journal of the Acoustical Society of America.

[21]  Kostas Kokkinakis,et al.  Multi-microphone adaptive noise reduction strategies for coordinated stimulation in bilateral cochlear implant devices. , 2010, The Journal of the Acoustical Society of America.

[22]  Volker Hohmann,et al.  Database of Multichannel In-Ear and Behind-the-Ear Head-Related and Binaural Room Impulse Responses , 2009, EURASIP J. Adv. Signal Process..

[23]  Klaus Uwe Simmer,et al.  Superdirective Microphone Arrays , 2001, Microphone Arrays.

[24]  Jörn Anemüller,et al.  A discriminative learning approach to probabilistic acoustic source localization , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[25]  Hiroshi Sawada,et al.  A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Jesper Jensen,et al.  An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech. , 2011, The Journal of the Acoustical Society of America.

[27]  Pasi Pertilä,et al.  Microphone array post-filtering using supervised machine learning for speech enhancement , 2014, INTERSPEECH.

[28]  Nasser Kehtarnavaz,et al.  Environment-adaptive speech enhancement for bilateral cochlear implants using a single processor , 2013, Speech Commun..