A constrained sequential EM algorithm for speech enhancement

Speech enhancement is a fundamental problem, the goal of which is to estimate clean speech s(t), given a noise-contaminated signal s(t)+n(t), where n(t) is white or colored noise. This task can be viewed as a probabilistic inference problem which involves estimating the posterior distribution of hidden clean speech, given a noisy observation. Kalman filter is a representative method but is restricted to Gaussian distributions only. We consider the generalized auto-regressive (GAR) model in order to capture the non-Gaussian characteristics of speech. Then we present a constrained sequential EM algorithm where Rao-Blackwellized particle filters (RBPFs) are used in the E-step and model parameters are updated in a sequential manner in the M-step under positivity constraints for noise variance parameters. Numerical experiments confirm the high performance of our proposed method, compared to Kalman filter-based methods, in the task of sequential speech enhancement.

[1]  Nando de Freitas,et al.  Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks , 2000, UAI.

[2]  Sunho Park,et al.  Rao-Blackwellized Particle Filtering for Sequential Speech Enhancement , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[3]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[4]  G. Casella,et al.  Rao-Blackwellisation of sampling schemes , 1996 .

[5]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[6]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[7]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .

[8]  N. D. Freitas Rao-Blackwellised particle filtering for fault diagnosis , 2002 .

[9]  Andrzej Cichocki,et al.  Flexible Independent Component Analysis , 2000, J. VLSI Signal Process..

[10]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[11]  Xuemin Shen,et al.  A dynamic system approach to speech enhancement using the H∞ filtering algorithm , 1999, IEEE Trans. Speech Audio Process..

[12]  Ehud Weinstein,et al.  Iterative and sequential algorithms for multisensor signal enhancement , 1994, IEEE Trans. Signal Process..

[13]  Simon J. Godsill,et al.  Particle methods for Bayesian modeling and enhancement of speech signals , 2002, IEEE Trans. Speech Audio Process..

[14]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[15]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Meir Feder,et al.  Recursive expectation-maximization (EM) algorithms for time-varying parameters with applications to multiple target tracking , 1999, IEEE Trans. Signal Process..

[17]  D. Titterington Recursive Parameter Estimation Using Incomplete Data , 1984 .

[18]  Ehud Weinstein,et al.  Iterative and sequential Kalman filter-based speech enhancement algorithms , 1998, IEEE Trans. Speech Audio Process..

[19]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[20]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Hamid Sheikhzadeh,et al.  HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..