Speech enhancement for non-stationary noise environments

In this paper, we present an optimally-modi#ed log-spectral amplitude (OM-LSA) speech estimator and a minima controlled recursive averaging (MCRA) noise estimation approach for robust speech enhancement. The spectral gain function, which minimizes the mean-square error of the log-spectra, is obtained as a weighted geometric mean of the hypothetical gains associated with the speech presence uncertainty. The noise estimate is given by averaging past spectral power values, using a smoothing parameter that is adjusted bythe speech presence probabilityin subbands. We introduce two distinct speech presence probabilityfunctions, one for estimating the speech and one for controlling the adaptation of the noise spectrum. The former is based on the time–frequencydistribution of the a priori signal-to-noise ratio. The latter is determined bythe ratio between the local energyof the noisysignal and its minimum within a speci6ed time window. Objective and subjective evaluation under various environmental conditions con6rm the superiorityof the OM-LSA and MCRA estimators. Excellent noise suppression is achieved, while retaining weak speech components and avoiding the musical residual noise phenomena. ? 2001 Elsevier Science B.V. All rights reserved.

[1]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[2]  Jin Yang Frequency domain noise suppression approaches in mobile telephone systems , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  S. Biyiksiz,et al.  Multirate digital signal processing , 1985, Proceedings of the IEEE.

[4]  E. Bryan George Single-sensor speech enhancement using a soft-decision/variable attenuation algorithm , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[6]  Soo Ngee Koh,et al.  Improved noise suppression filter using self-adaptive estimator of probability of speech absence , 1999, Signal Process..

[7]  P. Green,et al.  Speech enhancement using a ternary-decision based filter , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Klaus Uwe Simmer,et al.  Kammeyer \Comparison of one-and two-channel noise-estimation techniques , 1997 .

[9]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[10]  K F E2K,et al.  Spectral Enhancement By Tracking Speech Presence Probability In Subbands , 2001 .

[11]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[12]  P. Laguna,et al.  Signal Processing , 2002, Yearbook of Medical Informatics.

[13]  Nam Soo Kim,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Processing Letters.

[14]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[15]  Jason Wexler,et al.  Discrete Gabor expansions , 1990, Signal Process..

[16]  Rainer Martin,et al.  An efficient algorithm to estimate the instantaneous SNR of speech signals , 1993, EUROSPEECH.

[17]  Jae S. Lim,et al.  The unimportance of phase in speech enhancement , 1982 .

[18]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[19]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[20]  VargaAndrew,et al.  Assessment for automatic speech recognition II , 1993 .

[21]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[22]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[23]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[24]  Alexander Fischer,et al.  Quantile based noise estimation for spectral subtraction and Wiener filtering , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[25]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[26]  David Malah,et al.  Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[27]  Israel Cohen,et al.  On speech enhancement under signal presence uncertainty , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[28]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[29]  Hans-Günter Hirsch,et al.  Noise estimation techniques for robust speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[30]  Peter Jax,et al.  Optimized estimation of spectral parameters for the coding of noisy speech , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).