论文信息 - A constrained sequential EM algorithm for speech enhancement

A constrained sequential EM algorithm for speech enhancement

Speech enhancement is a fundamental problem, the goal of which is to estimate clean speech s(t), given a noise-contaminated signal s(t)+n(t), where n(t) is white or colored noise. This task can be viewed as a probabilistic inference problem which involves estimating the posterior distribution of hidden clean speech, given a noisy observation. Kalman filter is a representative method but is restricted to Gaussian distributions only. We consider the generalized auto-regressive (GAR) model in order to capture the non-Gaussian characteristics of speech. Then we present a constrained sequential EM algorithm where Rao-Blackwellized particle filters (RBPFs) are used in the E-step and model parameters are updated in a sequential manner in the M-step under positivity constraints for noise variance parameters. Numerical experiments confirm the high performance of our proposed method, compared to Kalman filter-based methods, in the task of sequential speech enhancement.

Sunho Park | Seungjin Choi | Seungjin Choi | S. Park

[1] Nando de Freitas,et al. Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks , 2000, UAI.

[2] Sunho Park,et al. Rao-Blackwellized Particle Filtering for Sequential Speech Enhancement , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[3] G. C. Tiao,et al. Bayesian inference in statistical analysis , 1973 .

[4] G. Casella,et al. Rao-Blackwellisation of sampling schemes , 1996 .

[5] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[6] A. Gray,et al. Distance measures for speech processing , 1976 .

[7] Alan V. Oppenheim,et al. All-pole modeling of degraded speech , 1978 .

[8] N. D. Freitas. Rao-Blackwellised particle filtering for fault diagnosis , 2002 .

[9] Andrzej Cichocki,et al. Flexible Independent Component Analysis , 2000, J. VLSI Signal Process..

[10] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[11] Xuemin Shen,et al. A dynamic system approach to speech enhancement using the H∞ filtering algorithm , 1999, IEEE Trans. Speech Audio Process..

[12] Ehud Weinstein,et al. Iterative and sequential algorithms for multisensor signal enhancement , 1994, IEEE Trans. Signal Process..

[13] Simon J. Godsill,et al. Particle methods for Bayesian modeling and enhancement of speech signals , 2002, IEEE Trans. Speech Audio Process..

[14] Simon J. Godsill,et al. On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[15] Kuldip K. Paliwal,et al. A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16] Meir Feder,et al. Recursive expectation-maximization (EM) algorithms for time-varying parameters with applications to multiple target tracking , 1999, IEEE Trans. Signal Process..

[17] D. Titterington. Recursive Parameter Estimation Using Incomplete Data , 1984 .

[18] Ehud Weinstein,et al. Iterative and sequential Kalman filter-based speech enhancement algorithms , 1998, IEEE Trans. Speech Audio Process..

[19] G. Kitagawa. Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[20] Yariv Ephraim,et al. A signal subspace approach for speech enhancement , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21] Hamid Sheikhzadeh,et al. HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..