Speech Enhancement Based on Binaural Sound Source Localization and Cosh Measure Wiener Filtering

The existing speech enhancement algorithm has shown poor performance under low Signal Noise Ratios (SNRs). To resolve this problem, a speech enhancement algorithm based on binaural sound source localization and cosh measure filtering is proposed. Firstly, the algorithm uses a sound source localization algorithm based on head correlation functions and two-level deep learning to extract the spatial information of the binaural sound source and determine the spatial position of the sound source. The beamforming method is then used to remove the noises in different directions from the speech. Finally, the Wiener filtering of cosh measure based on logarithmic relation is used to remove the noise in the same direction as the speech to achieve speech enhancement. Experiments show that the proposed algorithm has better robustness and denoising ability than the contrast algorithms.

[1]  Chen Youyuan,et al.  A binaural speech enhancement algorithm: Application to background and directional noise fields , 2015 .

[2]  Damián Marelli,et al.  Efficient Approximation of Head-Related Transfer Functions in Subbands for Accurate Sound Localization , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  Zhou Chengxu Broadband beamforming for speech enhancement in reverberation environment , 2012 .

[4]  Chunhe Yu,et al.  Speech enhancement based on the generalized sidelobe cancellation and spectral subtraction for a microphone array , 2015, 2015 8th International Congress on Image and Signal Processing (CISP).

[5]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[6]  Hong Liu,et al.  A new hierarchical binaural sound source localization method based on Interaural Matching Filter , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Thushara D. Abhayapala,et al.  Spatial feature learning for robust binaural sound source localization using a composite feature vector , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Marc Moonen,et al.  Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Hong Liu,et al.  Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[10]  Jesper Jensen,et al.  A short-time objective intelligibility measure for time-frequency weighted noisy speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[12]  Shantanu Chakrabartty,et al.  A Min–Max Optimization Framework for Designing $\Sigma\Delta$ Learners: Theory and Hardware , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[13]  Ruth A. Bentler,et al.  New feedback detection method for performance evaluation of hearing aids , 2007 .

[14]  B Rafaely,et al.  Feedback path variability modeling for robust hearing aids. , 2000, The Journal of the Acoustical Society of America.

[15]  Dongmei Pan,et al.  Speech Enhancement Algorithm Based on Sound Source Localization and Scene Matching for Binaural Digital Hearing Aids , 2019 .

[16]  Harald Viste,et al.  Binaural Source Localization by Joint Estimation of ILD and ITD , 2010, IEEE Transactions on Audio, Speech, and Language Processing.