Mixed decision-based noise adaptation for speech enhancement

A robust noise adaptation method based on a mixed decision technique is proposed for speech enhancement. Objective speech quality tests, in terms of the SEGSNR improvement and the Itakura-Saito distortion, demonstrate its superiority in comparison with both hard- and soft-decision-based methods.

[1]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .

[2]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[3]  Wonyong Sung,et al.  A voice activity detector employing soft decision based noise spectrum adaptation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Ahmet M. Kondoz,et al.  Improved voice activity detection based on a smoothed statistical likelihood ratio , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[6]  Joon-Hyuk Chang,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Process. Lett..

[7]  H.J. Kim,et al.  A genetic algorithm-based segmentation of Markov random field modeled images , 2000, IEEE Signal Processing Letters.