论文信息 - Empirical Distributions of DFT-Domain Speech Coefficients Based on Estimated Speech Variances

Empirical Distributions of DFT-Domain Speech Coefficients Based on Estimated Speech Variances

We present a novel way to estimate the empirical distribution of clean speech spectral coefficients. Rather than computing the histogram of clean speech within a certain signal-to-noise ratio interval, we normalize the spectral coefficients on the square-root of the spectral variance estimated via recursive averaging, the decision-directed approach or temporal cepstrum smoothing. We show that estimated distributions depend significantly on the used spectral variance estimator. Further, if the speech spectral variance is estimated in noisy conditions, the resulting histograms exhibit heavier tails as compared to clean conditions. The cepstral variance estimation approach is shown to result in less heavy tails as compared to the decision-directed approach.

Timo Gerkmann | Rainer Martin

[1] I. Miller. Probability, Random Variables, and Stochastic Processes , 1966 .

[2] Steven F. Boll,et al. Optimal estimators for spectral restoration of noisy speech , 1984, ICASSP.

[3] Rainer Martin,et al. Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Peter Vary,et al. Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model , 2005, EURASIP J. Adv. Signal Process..

[5] Rainer Martin,et al. Speech enhancement based on minimum mean-square error estimation and supergaussian priors , 2005, IEEE Transactions on Speech and Audio Processing.

[6] Peter Vary,et al. Digital Speech Transmission: Enhancement, Coding and Error Concealment , 2006 .

[7] Jesper Jensen,et al. Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8] Rainer Martin,et al. Parameterized MMSE spectral magnitude estimation for the enhancement of noisy speech , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9] Rainer Martin,et al. A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10] Colin Breithaupt,et al. Noise reduction algorithms for speech communications - statistical analysis and improved estimation procedures , 2008 .

[11] Jacob Benesty,et al. Spectral Enhancement Methods , 2009 .

[12] Paul R. White,et al. Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors , 2009, Speech Commun..

[13] Rainer Martin,et al. On the Statistics of Spectral Amplitudes After Variance Reduction by Temporal Cepstrum Smoothing and Cepstral Nulling , 2009, IEEE Transactions on Signal Processing.

[14] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .