论文信息 - A modular approach to speech enhancement with an application to speech coding

A modular approach to speech enhancement with an application to speech coding

Ephraim and Malah's (1984, 1985) MMSE-LSA speech enhancement algorithm, while robust and effective, is difficult to tune and adjust for the tradeoff between noise reduction and distortion. We suggest a means of generalizing this design, which allows for other estimators besides the MMSE-LSA to be used within the same supporting framework. When a modified version of Ephraim and Van Trees's (see IEEE Trans. Speech and Audio Proc., vol.3, p.251-66, 1995) spectral domain constrained signal subspace estimator is used in this manner, we obtain a system with greater flexibility and similar performance. We also explore the possibility of using different speech enhancement techniques as pre-processors for different parameter extraction modules of the IS-641 speech coder (a 7.4 kbit/s ACELP codec). We show that such a strategy can increase the quality of the coded speech and lead to a system that is more robust to differing noise types.

Richard V. Cox | Anthony J. Accardi | A. Accardi | R. Cox

[1] Jean-Pierre Adoul,et al. Enhanced full rate speech codec for IS-136 digital cellular system , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] R. McAulay,et al. Speech enhancement using a soft-decision noise suppression filter , 1980 .

[3] Yariv Ephraim,et al. A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[4] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[5] Olivier Cappé,et al. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[6] David Malah,et al. Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[7] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[8] Michael J. McLaughlin,et al. Background noise suppression for speech enhancement and coding , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.