IMPROVED SPEECH SPECTRAL VARIANCE ESTIMATION UNDER THE GENERALIZED GAMMA DISTRIBUTION

DFT-based single-microphone speech enhancement methods need an estimate of the clean speech spectral variance. Often the ”decision-directed” spectral variance estimato r is used, because of its good performance: it strongly reduces the musical noise phenomenon. It has recently been shown that this estimator is severely biased at low SNRs when the smoothing factor approaches one [1]. Here we propose a variance estimator with reduced bias, under the assumption of a generalized Gamma distribution for the clean speech spectral amplitudes. For the reconstruction, the MMSE estimator of the amplitudes themselves is used, derived under the same distribution assumption. For the same speech quality versus noise reduction trade-off, the new variance estimator leads to less musicality.

[1]  Paul R. White,et al.  Mmse Speech Spectral Amplitude Estimators With Chi and Gamma Speech Priors , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[3]  Rainer Martin,et al.  Speech enhancement based on minimum mean-square error estimation and supergaussian priors , 2005, IEEE Transactions on Speech and Audio Processing.

[4]  Susanto Rahardja,et al.  /spl beta/-order MMSE spectral amplitude estimation for speech enhancement , 2005, IEEE Transactions on Speech and Audio Processing.

[5]  Richard Heusdens,et al.  MINIMUM MEAN-SQUARE ERROR AMPLITUDE ESTIMATORS FOR SPEECH ENHANCEMENT UNDER THE GENERALIZED GAMMA DISTRIBUTION , 2006 .

[6]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[7]  Peter Vary,et al.  Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model , 2005, EURASIP J. Adv. Signal Process..

[8]  Philipos C. Loizou,et al.  Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum , 2005, IEEE Transactions on Speech and Audio Processing.

[9]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[10]  Jesper Jensen,et al.  A data-driven approach to optimizing spectral speech enhancement methods for various error criteria , 2007, Speech Commun..

[11]  Pierre Moulin,et al.  On the equivalence of set-theoretic and maxent MAP estimation , 2003, IEEE Trans. Signal Process..

[12]  Yi Hu,et al.  Subjective Comparison of Speech Enhancement Algorithms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[13]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[14]  R. Heusdens,et al.  MMSE Estimation of Discrete Fourier Coefficients with Generalized Gamma Priors , 2006 .