论文信息 - Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation

Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation

In this paper, we present a novel blocking matrix and fixed beamformer design for a generalized sidelobe canceler for speech enhancement in a reverberant enclosure. They are based on a new method for estimating the acoustical transfer function ratios in the presence of stationary noise. The estimation method relies on solving a generalized eigenvalue problem in each frequency bin. An adaptive eigenvector tracking utilizing the power iteration method is employed and shown to achieve a high convergence speed. Simulation results demonstrate that the proposed beamformer leads to better noise and interference reduction and reduced speech distortions compared to other blocking matrix designs from the literature.

[1] Ehud Weinstein,et al. System identification using nonstationary signals , 1996, IEEE Trans. Signal Process..

[2] Israel Cohen,et al. Relative transfer function identification using speech signals , 2004, IEEE Transactions on Speech and Audio Processing.

[3] Jean-Marc Odobez,et al. Short-Term Spatio–Temporal Clustering Applied to Multiple Moving Speakers , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4] Marc Moonen,et al. Comparison of frequency domain noise reduction strategies based on multichannel Wiener filtering and spatial prediction , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] Jacob Benesty,et al. A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6] Israel Cohen,et al. On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain , 2007, IEEE Signal Processing Letters.

[7] Hiroshi Sawada,et al. Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[8] Reinhold Häb-Umbach,et al. Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .

[10] Reinhold Häb-Umbach,et al. Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[11] B.D. Van Veen,et al. Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[12] Ehud Weinstein,et al. Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[13] N. Gaubitch,et al. ANALYSIS OF THE DEREVERBERATION PERFORMANCE OF MICROPHONE ARRAYS , 2005 .

[14] Jacob Benesty,et al. Time Delay Estimation in Room Acoustic Environments: An Overview , 2006, EURASIP J. Adv. Signal Process..

[15] Eap Emanuël Habets. Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[16] O. L. Frost,et al. An algorithm for linearly constrained adaptive array processing , 1972 .

[17] Rainer Huber,et al. Objective assessment of audio quality using an auditory processing model , 2004 .

[18] Israel Cohen,et al. Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[19] Marc Moonen,et al. GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[20] Reinhold Häb-Umbach,et al. Joint speaker segmentation, localization and identification for streaming audio , 2007, INTERSPEECH.

[21] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .

[22] Israel Cohen,et al. Performance analysis of dual source transfer-function generalized sidelobe canceller , 2007, Speech Commun..

[23] Juha Karhunen,et al. Adaptive algorithms for estimating eigenvectors of correlation type matrices , 1984, ICASSP.

[24] Ernst Warsitz. Mehrkanalige Sprachsignalverbesserung durch adaptive Lösung eines Eigenwertproblems im Frequenzbereich , 2009 .

[25] Emanuel A. P. Habets,et al. Dereverberation performance of rigid and open spherical microphone arrays: Theory & simulation , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[26] O. Hoshuyama,et al. A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[27] J. Capon. High-resolution frequency-wavenumber spectrum analysis , 1969 .

[28] Koeng-Mo Sung,et al. Analysis of blocking matrices for generalized sidelobe cancellers for non-stationary broadband signals , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[29] A. Gray,et al. Distance measures for speech processing , 1976 .

[30] Marc Moonen,et al. Multimicrophone noise reduction using recursive GSVD-based optimal filtering with ANC postprocessing stage , 2005, IEEE Transactions on Speech and Audio Processing.

[31] Walter Kellermann,et al. Efficient frequency-domain realization of robust generalized, sidelobe cancellers , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[32] L. J. Griffiths,et al. An alternative approach to linearly constrained adaptive beamforming , 1982 .