Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation

In this paper, we present a novel blocking matrix and fixed beamformer design for a generalized sidelobe canceler for speech enhancement in a reverberant enclosure. They are based on a new method for estimating the acoustical transfer function ratios in the presence of stationary noise. The estimation method relies on solving a generalized eigenvalue problem in each frequency bin. An adaptive eigenvector tracking utilizing the power iteration method is employed and shown to achieve a high convergence speed. Simulation results demonstrate that the proposed beamformer leads to better noise and interference reduction and reduced speech distortions compared to other blocking matrix designs from the literature.

[1]  Ehud Weinstein,et al.  System identification using nonstationary signals , 1996, IEEE Trans. Signal Process..

[2]  Israel Cohen,et al.  Relative transfer function identification using speech signals , 2004, IEEE Transactions on Speech and Audio Processing.

[3]  Jean-Marc Odobez,et al.  Short-Term Spatio–Temporal Clustering Applied to Multiple Moving Speakers , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Marc Moonen,et al.  Comparison of frequency domain noise reduction strategies based on multichannel Wiener filtering and spatial prediction , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Jacob Benesty,et al.  A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Israel Cohen,et al.  On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain , 2007, IEEE Signal Processing Letters.

[7]  Hiroshi Sawada,et al.  Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[8]  Reinhold Häb-Umbach,et al.  Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[10]  Reinhold Häb-Umbach,et al.  Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[12]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[13]  N. Gaubitch,et al.  ANALYSIS OF THE DEREVERBERATION PERFORMANCE OF MICROPHONE ARRAYS , 2005 .

[14]  Jacob Benesty,et al.  Time Delay Estimation in Room Acoustic Environments: An Overview , 2006, EURASIP J. Adv. Signal Process..

[15]  Eap Emanuël Habets Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[16]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[17]  Rainer Huber,et al.  Objective assessment of audio quality using an auditory processing model , 2004 .

[18]  Israel Cohen,et al.  Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Marc Moonen,et al.  GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[20]  Reinhold Häb-Umbach,et al.  Joint speaker segmentation, localization and identification for streaming audio , 2007, INTERSPEECH.

[21]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[22]  Israel Cohen,et al.  Performance analysis of dual source transfer-function generalized sidelobe canceller , 2007, Speech Commun..

[23]  Juha Karhunen,et al.  Adaptive algorithms for estimating eigenvectors of correlation type matrices , 1984, ICASSP.

[24]  Ernst Warsitz Mehrkanalige Sprachsignalverbesserung durch adaptive Lösung eines Eigenwertproblems im Frequenzbereich , 2009 .

[25]  Emanuel A. P. Habets,et al.  Dereverberation performance of rigid and open spherical microphone arrays: Theory & simulation , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[26]  O. Hoshuyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[27]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[28]  Koeng-Mo Sung,et al.  Analysis of blocking matrices for generalized sidelobe cancellers for non-stationary broadband signals , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[29]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[30]  Marc Moonen,et al.  Multimicrophone noise reduction using recursive GSVD-based optimal filtering with ANC postprocessing stage , 2005, IEEE Transactions on Speech and Audio Processing.

[31]  Walter Kellermann,et al.  Efficient frequency-domain realization of robust generalized, sidelobe cancellers , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[32]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .