论文信息 - Speech Enhancement Using a- Minimum Mean- Square Error Short-Time Spectral

Speech Enhancement Using a- Minimum Mean- Square Error Short-Time Spectral

Absstroct-This paper focuses on the class of speech enhancement systems which capitalize on the major importance of the short-time spectral amplitude (STSA) of the speech signal in its perception. A system which utilizes a minimum mean-square error (MMSE) STSA estimator is proposed and then compared with other widely used systems which are based on Wiener filtering and the “spectral subtraction” algorithm. In this paper we derive the MMSE STSA estimator, based on modeling speech and noise spectral components as statistically independent Gaussian random variables. We analyze the performance of the proposed STSA estimator and compare it with a STSA estimator derived from the Wiener estimator. We also examine the MMSE STSA estimator under uncertainty of signal presence in the noisy observations. In constructing the enhanced signal, the MMSE STSA estimator is combined with the complex exponential of the noisy phase. It is shown here that the latter is the MMSE estimator of the complex exponential of the original phase, which does not affect the STSA estimation. The proposed approach results in a significant reduction of the noise, and provides enhanced speech with colorless residual noise. The complexity of the proposed algorithm is approximately that of other systems in the discussed class.

A. Estimator

[1] I. S. Gradshteyn,et al. Table of Integrals, Series, and Products , 1976 .

[2] D. Middleton. An Introduction to Statistical Communication Theory , 1960 .

[3] T. Kadota. Optimum reception of binary Gaussian signals , 1964 .

[4] David Middleton,et al. Simultaneous optimum detection and estimation of signals in noise , 1968, IEEE Trans. Inf. Theory.

[5] A.V. Oppenheim,et al. Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[6] Ronald E. Crochiere,et al. Frequency domain coding of speech , 1979 .

[7] Ronald E. Crochiere,et al. A weighted overlap-add method of short-time Fourier analysis/Synthesis , 1980 .

[8] M. Portnoff. Time-frequency representation of digital signals and systems based on short-time Fourier analysis , 1980 .

[9] R. McAulay,et al. Speech enhancement using a soft-decision noise suppression filter , 1980 .

[10] D. Paul. The spectral envelope estimation vocoder , 1981 .