Evaluation of Robust Constrained MFMVDR Filtering for Single-Channel Speech Enhancement

By considering the multi-frame signal model, speech correlation between different time-frames can be exploited. Based on this signal model, the multi-frame minimum variance distortionless response (MFMVDR) filter for single-channel speech enhancement has been derived, which minimizes the total signal output power while avoiding speech distortion. It has been shown that the MFMVDR filter is very sensitive to estimation errors in the speech correlation vector resulting in correlated speech components being mistakenly suppressed. Inspired by robust beamforming approaches, in this paper we propose a robust constrained MFMVDR filter for single-channel speech enhancement by estimating the speech correlation vector that maximizes the total signal output power within a spherical uncertainty set. For the upper bound of the spherical uncertainty set, we propose to use a trained mapping function that depends on the a-priori signal-to-noise ratio (SNR). Experimental results for different noise types and SNRs show that the proposed robust approach yields a more accurate estimate of the speech correlation vector. A perceptual evaluation shows that the robust constrained MFMVDR filter leads to an improved speech quality but a lower noise reduction than the original non-robust MFMVDR filter, while still being preferred in overall quality.

[1]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[2]  Emanuel A. P. Habets,et al.  Combined Single-Microphone Wiener and MVDR Filtering based on Speech Interframe Correlations and Speech Presence Probability , 2016, ITG Symposium on Speech Communication.

[3]  Sven Nordholm,et al.  Multichannel Signal Enhancement Algorithms for Assisted Listening Devices: Exploiting spatial diversity using multiple microphones , 2015, IEEE Signal Processing Magazine.

[4]  Timo Gerkmann,et al.  An evaluation of the perceptual quality of phase-aware single-channel speech enhancement. , 2016, The Journal of the Acoustical Society of America.

[5]  Jacob Benesty,et al.  A Multi-Frame Approach to the Frequency-Domain Single-Channel Noise Reduction Problem , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  W. Marsden I and J , 2012 .

[7]  Emanuel A. P. Habets,et al.  Speech Enhancement in the STFT Domain , 2011, Springer Briefs in Electrical and Computer Engineering.

[8]  E. Hänsler,et al.  Acoustic Echo and Noise Control: A Practical Approach , 2004 .

[9]  Sergiy A. Vorobyov,et al.  Robust Adaptive Beamforming Based on Steering Vector Estimation With as Little as Possible Prior Information , 2012, IEEE Transactions on Signal Processing.

[10]  Jingdong Chen,et al.  Experimental study of robust beamforming techniques for acoustic applications , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[11]  Jacob Benesty,et al.  A single-channel noise reduction MVDR filter , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Simon Doclo,et al.  Sensitivity analysis of the multi-frame MVDR filter for single-microphone speech enhancement , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[13]  Jesper Jensen,et al.  DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement , 2013, DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement.

[14]  Sergiy A. Vorobyov,et al.  Principles of minimum variance robust adaptive beamforming design , 2013, Signal Process..

[15]  Rainer Martin,et al.  Estimation of Subband Speech Correlations for Noise Reduction via MVDR Processing , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[16]  Jian Li,et al.  On robust Capon beamforming and diagonal loading , 2003, IEEE Trans. Signal Process..

[17]  Marc Moonen,et al.  Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[18]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.