Identification of the Relative Transfer Function between Sensors in the Short-Time Fourier Transform Domain

In this chapter, we delve into the problem of relative transfer function (RTF) identification. First, we focus on identification algorithms that exploit specific properties of the input data. In particular, we exploit the non-stationarity of speech signals and the existence of segments where speech is absent in arbitrary utterances. Second, we explore approaches that aim at better modeling the signals and systems. We describe a common approach to represent a linear convolution in the short-time Fourier transform (STFT) domain as a multiplicative transfer function (MTF). Then, we present a new modeling approach for a linear convolution in the STFT domain as a convolution transfer function (CTF). The new approach is associated with larger model complexity and enables better representation of the signals and systems in the STFT domain. Then, we employ RTF identification algorithms based on the new model, and demonstrate improved results.

[1]  Sharon Gannot,et al.  Time difference of arrival estimation of speech source in a noisy and reverberant environment , 2005, Signal Process..

[2]  Israel Cohen,et al.  System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Jacob Benesty,et al.  A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Akihiko Sugiyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1999, IEEE Trans. Signal Process..

[5]  Eap Emanuël Habets Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .

[6]  Israel Cohen,et al.  Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[8]  Israel Cohen,et al.  Relative Transfer Function Identification Using Convolutive Transfer Function Approximation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Sharon Gannot,et al.  Theoretical Performance Analysis of the General Transfer Function GSC , 2002 .

[10]  S. Gannot,et al.  Speech enhancement based on the general transfer function GSC and postfiltering , 2004, IEEE Trans. Speech Audio Process..

[11]  Ehud Weinstein,et al.  System identification using nonstationary signals , 1996, IEEE Trans. Signal Process..

[12]  Israel Cohen,et al.  Relative transfer function identification using speech signals , 2004, IEEE Transactions on Speech and Audio Processing.

[13]  Israel Cohen,et al.  On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain , 2007, IEEE Signal Processing Letters.

[14]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[15]  Israel Cohen,et al.  Joint noise reduction and acoustic echo cancellation using the transfer-function generalized sidelobe canceller , 2007, Speech Commun..