Speech enhancement as a realisation issue

When enhancing a speech signal using a single microphone system, various approaches based on an autoregressive speech model are referenced in the literature. Using a Kalman filter, they operate in two steps: (1) the noise variances and the autoregressive parameters are estimated, (2) the speech signal is retrieved using standard Kalman filtering. However the existing methods are usually iterative and a voice activity detector (VAD) is often required to find the silent frames for the estimation of the variance of the white noise. To avoid these drawbacks, we propose to consider Kalman filter-based speech enhancement as a realisation issue, i.e. as the estimation of the system matrices in the state space representation using the estimation of the correlation function of the observations. For this purpose, we first present various solutions, based on works initially developed in the field of identification by Van Overschee et al. and Verhaegen. Their non-iterative extensions to coloured noise are also addressed and used with car noise. In the second part of the paper we propose an alternative approach based on Mehra et al. and Belanger's approaches dealing with the estimation of the steady Kalman gain and previously derived in the framework of identification. This approach still avoids the use of a VAD.

[1]  Mats Viberg,et al.  Subspace-based methods for the identification of linear time-invariant systems , 1995, Autom..

[2]  Charles W. Therrien,et al.  Discrete Random Signals and Statistical Signal Processing , 1992 .

[3]  Marc Moonen,et al.  A stochastic subspace algorithm for blind channel identification in noise fields with unknown spatial covariance , 2000, Signal Process..

[4]  B. Friedlander,et al.  The Modified Yule-Walker Method of ARMA Spectral Estimation , 1984, IEEE Transactions on Aerospace and Electronic Systems.

[6]  Petre Stoica,et al.  A test for whiteness , 1977 .

[7]  P. Belanger,et al.  Identification of optimum filter steady-state gain for systems with unknown noise covariances , 1973 .

[8]  A. Bryson,et al.  Linear filtering for time-varying systems using measurements containing colored noise , 1965 .

[9]  Patrick Dewilde,et al.  Subspace model identification Part 1. The output-error state-space model identification class of algorithms , 1992 .

[10]  Bart De Moor,et al.  The singular value decomposition and long and short spaces of noisy matrices , 1993, IEEE Trans. Signal Process..

[11]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[12]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[13]  Eric A. Wan,et al.  Removal of noise from speech using the dual EKF algorithm , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[14]  S. D. Gray,et al.  Filtering of colored noise for speech enhancement and coding , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  Kah-Chye Tan,et al.  Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model , 1999, IEEE Trans. Speech Audio Process..

[16]  R. Mehra On the identification of variances and adaptive Kalman filtering , 1970 .

[17]  R. Mehra On-line identification of linear dynamic systems with applications to Kalman filtering , 1971 .

[18]  Mohamed Najim,et al.  A single microphone Kalman filter-based noise canceller , 1999, IEEE Signal Processing Letters.

[19]  Steven F. Boll A spectral subtraction algorithm for suppression of acoustic noise in speech , 1979, ICASSP.

[20]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[22]  Wallace E. Larimore,et al.  Canonical variate analysis in identification, filtering, and adaptive control , 1990, 29th IEEE Conference on Decision and Control.

[23]  Bart De Moor,et al.  A unifying theorem for three subspace system identification algorithms , 1995, Autom..

[24]  P. Overschee,et al.  About the choice of state space basis in combined deterministic-stochastic subspace identification , 1995 .

[25]  Bart De Moor,et al.  Subspace algorithms for the stochastic identification problem, , 1993, Autom..

[26]  J. Mendel Lessons in Estimation Theory for Signal Processing, Communications, and Control , 1995 .

[27]  H. Akaike Markovian Representation of Stochastic Processes by Canonical Variables , 1975 .

[28]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[29]  S. Huffel,et al.  A fast algorithm for subspace state-space system identification via exploitation of the displacement structure , 2001 .

[30]  Bart De Moor,et al.  N4SID: Subspace algorithms for the identification of combined deterministic-stochastic systems , 1994, Autom..

[31]  Ehud Weinstein,et al.  Iterative and sequential Kalman filter-based speech enhancement algorithms , 1998, IEEE Trans. Speech Audio Process..

[32]  Michel Verhaegen,et al.  Identification of the deterministic part of MIMO state space models given in innovations form from input-output data , 1994, Autom..

[33]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[34]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[35]  Gene H. Golub,et al.  Matrix computations , 1983 .

[36]  Dimitrie C. Popescu,et al.  Kalman filtering of colored noise for speech enhancement , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[37]  Mohamed A. Deriche AR parameter estimation from noisy data using the EM algorithm , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[38]  Ehud Weinstein,et al.  Single-sensor active noise cancellation , 1994, IEEE Trans. Speech Audio Process..

[39]  Thomas Kailath,et al.  Linear Systems , 1980 .