Post-processing for frequency-domain blind source separation in hearing aids

In this paper, we investigate post-processing for the frequency-domain blind source separation (FD-BSS) in hearing aids applications. It is known that the segregate quality of FD-BSS degrades severely in the challenging scenario of reverberant enclosures or moving source situations. A robust two-stage dynamic programming approach based on inter-frequency correlation is presented to solving the permutation ambiguity correction problem. Moreover, binary masking method and the non-stationary spectral subtraction techniques are combined to estimate and reduce the residual cross-talk components. The subtraction parameter is adaptively determined by estimated local Signal-to-Noise Ratio (SNR). An optimization scheme is proposed aiming at obtaining a tradeoff between the residual noise reduction and generated distortion suppression. Experimental results show that the post-processing procedure enhances the separated signals and the musical noise is controlled under moderate level.

[1]  Birger Kollmeier,et al.  Amplitude Modulation Decorrelation For Convolutive Blind Source Separation , 2000 .

[2]  Kostas Kokkinakis,et al.  Using blind source separation techniques to improve speech recognition in bilateral cochlear implant patients. , 2008, The Journal of the Acoustical Society of America.

[3]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[4]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[5]  Fan-Gang Zeng,et al.  Cochlear implant speech recognition with speech maskers. , 2004, The Journal of the Acoustical Society of America.

[6]  Hiroshi Sawada,et al.  A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[7]  Shiro Ikeda,et al.  A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[8]  Paris Smaragdis,et al.  Blind separation of convolved mixtures in the frequency domain , 1998, Neurocomputing.

[9]  Kiyohiro Shikano,et al.  High-Presence Hearing-Aid System using DSP-Based Real-Time Blind Source Separation Module , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[10]  Kiyohiro Shikano,et al.  Musical noise generation analysis for noise reduction methods based on spectral subtraction and MMSE STSA estimation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Nikolaos Mitianoudis,et al.  Audio source separation of convolutive mixtures , 2003, IEEE Trans. Speech Audio Process..

[12]  Lucas C. Parra,et al.  A SURVEY OF CONVOLUTIVE BLIND SOURCE SEPARATION METHODS , 2007 .

[13]  Walter Kellermann,et al.  Post-Processing for Convolutive Blind Source Separation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[14]  Kiyohiro Shikano,et al.  Enhancement of speech signals separated from their convolutive mixture by FDICA algorithm , 2009, Digit. Signal Process..

[15]  Jacob Benesty,et al.  Speech Enhancement , 2010 .

[16]  Andrzej Cichocki,et al.  Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .

[17]  Mohsen Rahmani,et al.  An objective measure for the musical noise assessment in noise reduction systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[19]  Junfeng Li,et al.  Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement , 2007, INTERSPEECH.

[20]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[21]  Simon Haykin,et al.  The Cocktail Party Problem , 2005, Neural Computation.