Robustness metric-based tuning of the augmented Kalman filter for the enhancement of speech corrupted with coloured noise

Abstract In this paper, we describe a tuning method based on a robustness metric and extended to work with the augmented Kalman filter for enhancing coloured-noise-corrupted speech. The method proposed within utilises the robustness metric to provide dynamic and adaptive tuning of the Kalman filter gain in order to reduce the residual noise that results from poor speech model estimates. An analysis of the Kalman filter recursion equations is presented that augments the robustness metric equations to include coloured noise model parameters. Objective and blind AB subjective listening tests were performed on the NOIZEUS speech corpus for both white and coloured noises with the results being compared with the MMSE method. In the blind AB subjective testing, the 15 English-speaking listeners showed preference for the proposed method over both the MMSE and oracle Kalman filter methods (where clean speech parameters were used). These results imply that the proposed tuned Kalman filter produces more perceptibly-acceptable enhanced speech than the oracle Kalman filter, which is considered the ideal for this enhancement technique.

[1]  Kuldip K. Paliwal,et al.  Single-channel speech enhancement using spectral subtraction in the short-time modulation domain , 2010, Speech Commun..

[2]  Jun Du,et al.  An Experimental Study on Speech Enhancement Based on Deep Neural Networks , 2014, IEEE Signal Processing Letters.

[3]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[4]  Susanto Rahardja,et al.  Subband Kalman filtering incorporating masking properties for noisy speech signal , 2007, Speech Commun..

[5]  Kuldip K. Paliwal,et al.  Suppressing the influence of additive noise on the Kalman gain for low residual noise speech enhancement , 2011, Speech Commun..

[6]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[7]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Kuldip K. Paliwal,et al.  Kalman fitler with phase spectrum compensation algorithm for speech enhancement , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Kuldip K. Paliwal,et al.  Modulation-domain Kalman filtering for single-channel speech enhancement , 2011, Speech Commun..

[10]  Li-Rong Dai,et al.  A Regression Approach to Speech Enhancement Based on Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[11]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[12]  S. D. Gray,et al.  Filtering of colored noise for speech enhancement and coding , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[14]  A. D. Hestenes The extrapolation, interpolation and smoothing of stationary time series with engineering applications: by Norbert Wiener. 163 pages, 15 × 24 cm. New York, John Wiley & Sons, Inc., 1949. Price, $4.00 , 1950 .

[15]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[16]  Kuldip K. Paliwal,et al.  A Non-Iterative Kalman Filtering Algorithm with Dynamic Gain Adjustment for Single-Channel Speech Enhancement , 2016 .

[17]  Kuldip K. Paliwal,et al.  Kalman Filter with Sensitivity Tuning for Improved Noise Reduction in Speech , 2017, Circuits Syst. Signal Process..

[18]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[19]  Wen-Rong Wu,et al.  Subband Kalman filtering for speech enhancement , 1998 .

[20]  Manika Saha,et al.  Robustness and Sensitivity Metrics for Tuning the Extended Kalman Filter , 2014, IEEE Transactions on Instrumentation and Measurement.

[21]  Kuldip K. Paliwal,et al.  Single Channel Speech Enhancement Using MMSE Estimation of Short-Time Modulation Magnitude Spectrum , 2011, INTERSPEECH.

[22]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[23]  Kuldip K. Paliwal,et al.  Exploiting Conjugate Symmetry of the Short-Time Fourier Spectrum for Speech Enhancement , 2008, IEEE Signal Processing Letters.

[24]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .