Multichannel Wiener filter estimation using source location knowledge for speech enhancement

In this paper a technique for estimating the single channel Wiener filter post-processor using two complementary adaptive near-field beamformers is presented as an alternative to voice activity detection for speech enhancement applications. Two near-field beamformers, the MVDR beamformer and an adaptive nullformer based on noise to signal maximisation, are used to generate estimates of signal and noise statistics which can be used to compute an estimate of the single channel Wiener filter for noise reduction. It is demonstrated that the performance of the estimated filter compares well with the perfect Wiener filtering case, and shows good improvement in speech intelligibility.

[1]  H. Vincent Poor,et al.  IEEE Workshop on Statistical Signal Processing, SSP 2014, Gold Coast, Australia, June 29 - July 2, 2014 , 2014, Symposium on Software Performance.

[2]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[3]  Sven Nordholm,et al.  A subband space constrained beamformer incorporating voice activity detection [speech enhancement applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4]  Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs , 2002 .

[5]  R.A. Kennedy,et al.  Spatial correlation for general distributions of scatterers , 2002, IEEE Signal Processing Letters.

[6]  Hervé Bourlard,et al.  Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..

[7]  R. Kress,et al.  Inverse Acoustic and Electromagnetic Scattering Theory , 1992 .

[8]  Israel Cohen,et al.  Speech enhancement based on the general transfer function GSC and postfiltering , 2003, IEEE Transactions on Speech and Audio Processing.

[9]  Petros Maragos,et al.  A generalized estimation approach for linear and nonlinear microphone array post-filters , 2007, Speech Commun..

[10]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[11]  Sven Nordholm,et al.  Space constrained beamforming with source PSD updates , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Torsten Dau,et al.  The Effect of a Voice Activity Detector on the Speech Enhancement Performance of the Binaural Multichannel Wiener Filter , 2010, EURASIP J. Audio Speech Music. Process..

[13]  Stephen P. Boyd,et al.  Robust minimum variance beamforming , 2005, IEEE Transactions on Signal Processing.

[14]  Cha Zhang,et al.  Enhanced MVDR Beamforming for Arrays of Directional Microphones , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[15]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..