论文信息 - A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions

A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions

This paper presents a speech enhancement approach derived by using a piecewise linear approximation (PLA) of an explicit model of environmental distortions. PLA is a generalization of two traditional approaches, namely vector Taylor series (VTS) and MAX approximations. Formulations are described for both maximum likelihood (ML) estimation of noise model parameters and minimum mean-squared error (MMSE) estimation of clean speech. Evaluation experiments are conducted to enhance speech signals corrupted by several types of additive noises. Compared to the traditional MAX-approximation based approach, our PLA-based speech enhancement approach achieves better performance in terms of two objective quality measures, namely segmental SNR and log-spectral distortion.

Jun Du | Qiang Huo | Qiang Huo | Jun Du

[1] Jacob Benesty,et al. Spectral Enhancement Methods , 2009 .

[2] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[3] Schuyler Quackenbush,et al. Objective measures of speech quality , 1995 .

[4] Yariv Ephraim,et al. A Bayesian estimation approach for speech enhancement using hidden Markov models , 1992, IEEE Trans. Signal Process..

[5] A.V. Oppenheim,et al. Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[6] Biing-Hwang Juang,et al. On the application of hidden Markov models for enhancing noisy speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[7] Chong Kwan Un,et al. Speech recognition in noisy environments using first-order vector Taylor series , 1998, Speech Commun..

[8] Jae S. Lim,et al. Signal estimation from modified short-time Fourier transform , 1983, ICASSP.

[9] Sharon Gannot,et al. Speech enhancement using a mixture-maximum model , 1999, IEEE Trans. Speech Audio Process..

[10] Yariv Ephraim,et al. Statistical-model-based speech enhancement systems , 1992, Proc. IEEE.

[11] Richard M. Stern,et al. A vector Taylor series approach for environment-independent speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12] Douglas A. Reynolds,et al. Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..

[13] Michael Picheny,et al. Speech recognition using noise-adaptive prototypes , 1989, IEEE Trans. Acoust. Speech Signal Process..

[14] Jun Du,et al. A feature compensation approach using piecewise linear approximation of an explicit distortion model for noisy speech recognition , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.