Enhanced speech in noisy multiple speaker environment

Noisy environments seriously degrade the performance of speech recognition systems. Here we implement a high performance speech enhancement algorithm. Data from speech separation challenge were used to evaluate the method. It was observed that the enhanced speech significantly improved the recognition performance. In 2 out of 4 SNR cases, over 100% relative percentage improvements were achieved. Standalone software prototype has been developed and evaluated.

[1]  R. Xu,et al.  Multimodal speech enhancement in noisy environment , 2004, Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004..

[2]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[3]  George S. Kang,et al.  Quality improvement of LPC-processed noisy speech by using spectral subtraction , 1989, IEEE Trans. Acoust. Speech Signal Process..

[4]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[5]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .