An informed MMSE filter based on multiple instantaneous direction-of-arrival estimates

Sound acquisition in noisy and reverberant conditions where the acoustic scene changes rapidly remains a challenging task. In this work, we consider the problem of obtaining a desired, arbitrary spatial response for at most L sound sources being simultaneously active per time-frequency instant. We propose a minimum mean-squared error spatial filter that adapts quickly to changes in the acoustic scene by incorporating instantaneous parametric information on the sound field. In addition, an estimator for the power spectral densities of the L sources is developed that exhibits a sufficiently high temporal and spectral resolution to achieve both dereverberation and noise reduction. Simulation results demonstrate that a strong attenuation of undesired noise and interfering components can be achieved with a tolerable amount of signal distortion.

[1]  Giovanni Del Galdo,et al.  Spatial filtering using directional audio coding parameters , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Emanuel A. P. Habets,et al.  A Two-Stage Beamforming Approach for Noise Reduction and Dereverberation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Giovanni Del Galdo,et al.  Generating virtual microphone signals using geometrical information gathered by distributed arrays , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[4]  Thomas Kailath,et al.  ESPRIT-estimation of signal parameters via rotational invariance techniques , 1989, IEEE Trans. Acoust. Speech Signal Process..

[5]  Emanuel A. P. Habets,et al.  MMSE-Based Blind Source Extraction in Diffuse Noise Fields Using a Complex Coherence-Based a Priori SAP Estimator , 2012, IWAENC.

[6]  Emanuel A. P. Habets,et al.  Sound Field Model Violations in Parametric Spatial Sound Processing , 2012, IWAENC.

[7]  R. K. Cook,et al.  Measurement of Correlation Coefficients in Reverberant Sound Fields , 1955 .

[8]  Akihiko Sugiyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1999, IEEE Trans. Signal Process..

[9]  Israel Cohen,et al.  Convolutive Transfer Function Generalized Sidelobe Canceler , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Bhaskar D. Rao,et al.  Performance analysis of Root-Music , 1989, IEEE Trans. Acoust. Speech Signal Process..

[11]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[12]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[13]  Israel Cohen,et al.  Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Ivan Tashev,et al.  Microphone Array for Headset with Spatial Noise Suppressor , 2005 .

[15]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[16]  Oliver Thiergart,et al.  An informed LCMV filter based on multiple instantaneous direction-of-arrival estimates , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Sven Nordholm,et al.  Adaptive array noise suppression of handsfree speaker input in cars , 1993 .

[18]  Israel Cohen,et al.  Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Reinhold Häb-Umbach,et al.  Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.