Subband Kalman filtering for speech enhancement

Kalman filtering is an effective speech-enhancement technique, in which speech signals are usually modeled as autoregressive (AR) processes and represented in the state-space domain. Since AR coefficients identification and Kalman filtering require extensive computations, real-time implementation of this approach is difficult. This paper proposes a simple and practical scheme that overcomes these obstacles. Speech signals are first decomposed into subbands. Subband speech signals are then modeled as low-order AR processes, such that low-order Kalman filters can be applied. Enhanced fullband speech signals are finally obtained by combining the enhanced subband speech signals. To identify AR coefficients, prediction-error filters adapted by the LMS algorithm are applied. Due to noisy inputs, the LMS algorithm converges to biased solutions. The performance of the Kalman filter with biased parameters is analyzed. It is shown that accurate estimates of AR coefficients are not required when the driving-noise variance is properly estimated. New methods for making such estimates are proposed. Thus, we can tolerate biased AR coefficients and take advantage of the LMS algorithm's simple structure. Simulation results show that speech enhancement in the subband domain not only greatly reduces the computational complexity, but also achieves better performance compared to that in the fullband domain.

[1]  S. Biyiksiz,et al.  Multirate digital signal processing , 1985, Proceedings of the IEEE.

[2]  B. Anderson,et al.  Optimal Filtering , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Anthony G. Constantinides,et al.  Residual echo signal in critically sampled subband acoustic echo cancellers based on IIR and FIR filter banks , 1997, IEEE Trans. Signal Process..

[4]  Ki Yong Lee,et al.  Efficient recursive estimation for speech enhancement in colored noise , 1996, IEEE Signal Processing Letters.

[5]  Hitoshi Kiya,et al.  FSF (frequency sampling filter) bank for adaptive system identification , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  John H. L. Hansen,et al.  Constrained iterative speech enhancement with application to speech recognition , 1991, IEEE Trans. Signal Process..

[7]  Allen Gersho,et al.  Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..

[8]  Sanjit K. Mitra,et al.  Adaptive line enhancement using multirate techniques , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[9]  H. Ochi,et al.  Polyphase based adaptive structure for adaptive filtering and tracking , 1996 .

[10]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[11]  Yariv Ephraim,et al.  A Bayesian estimation approach for speech enhancement using hidden Markov models , 1992, IEEE Trans. Signal Process..

[12]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Thippur V. Sreenivas,et al.  Codebook constrained Wiener filtering for speech enhancement , 1996, IEEE Trans. Speech Audio Process..

[14]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[15]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[16]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[17]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[18]  Martin Vetterli,et al.  Adaptive filtering in subbands with critical sampling: analysis, experiments, and application to acoustic echo cancellation , 1992, IEEE Trans. Signal Process..

[19]  Jae Lim,et al.  Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise , 1978 .

[20]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[21]  Douglas D. O'Shaughnessy,et al.  Speech enhancement based conceptually on auditory evidence , 1991, IEEE Trans. Signal Process..

[22]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[23]  Maciej Niedzwiecki,et al.  Adaptive scheme for elimination of broadband noise and impulsive disturbances from AR and ARMA signals , 1996, IEEE Trans. Signal Process..

[24]  Jerry D. Gibson,et al.  Filtering of colored noise for speech enhancement and coding , 1991, IEEE Trans. Signal Process..

[25]  Byung-Gook Lee,et al.  An EM-based approach for parameter enhancement with an application to speech signals , 1995, Signal Process..

[26]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .