Sensitivity analysis of the multi-frame MVDR filter for single-microphone speech enhancement

Recently, a multi-frame minimum variance distortionless response (MFMVDR) filter for single-microphone noise reduction has been proposed, which exploits speech correlation across consecutive time frames. It has been shown that the MFMVDR filter achieves impressive results when the speech interframe correlation vector can be accurately estimated. In this paper, we analyze the influence of estimation errors for all required parameters, i.e., the speech interframe correlation vector and the undesired correlation matrix, on the performance of the MFMVDR filter. We compare the performance difference between oracle estimators and practically feasible blind estimators. Experimental results show that even small estimation errors substantially degrade the speech quality, where the most critical parameter is the speech interframe correlation vector.

[1]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[2]  Jacob Benesty,et al.  A single-channel noise reduction MVDR filter , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[4]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[5]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[6]  Jesper Jensen,et al.  DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement , 2013, DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement.

[7]  E. Hänsler,et al.  Acoustic Echo and Noise Control: A Practical Approach , 2004 .

[8]  Jacob Benesty,et al.  A Multi-Frame Approach to the Frequency-Domain Single-Channel Noise Reduction Problem , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[10]  Emanuel A. P. Habets,et al.  Combined Single-Microphone Wiener and MVDR Filtering based on Speech Interframe Correlations and Speech Presence Probability , 2016, ITG Symposium on Speech Communication.

[11]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[12]  Rainer Martin,et al.  Estimation of Subband Speech Correlations for Noise Reduction via MVDR Processing , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .