DOA estimation in noisy environment with unknown noise power using the EM algorithm

A direction of arrival (DOA) estimator for concurrent speakers in a noisy environment with unknown noise power is presented. Spatially colored noise, if not properly addressed, is known to degrade the performance of DOA estimators. In our contribution, the DOA estimation task is formulated as a maximum likelihood (ML) problem, which is solved using the expectation-maximization (EM) procedure. The received microphone signals are modelled as a sum of the speech and noise components. The noise power spectral density (PSD) matrix is modelled by a time-invariant full-rank coherence matrix multiplied by the noise power. The PSDs of the speech and noise components are estimated as part of the EM procedure. The benefit of the presented algorithm in a simulated noisy environment using measured room impulse responses is demonstrated.

[1]  Hao Ye,et al.  Maximum likelihood DOA estimation and asymptotic Cramer-Rao bounds for additive unknown colored noise , 1995, IEEE Trans. Signal Process..

[2]  Emanuel A. P. Habets,et al.  Multiple DOA estimation and blind source separation using estimation-maximization , 2016, 2016 IEEE International Conference on the Science of Electrical Engineering (ICSEE).

[3]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[4]  E. Habets,et al.  Generating sensor signals in isotropic noise fields. , 2007, The Journal of the Acoustical Society of America.

[5]  Peter Vary,et al.  Multichannel audio database in various acoustic environments , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[6]  Sharon Gannot,et al.  Speaker Tracking Using Recursive EM Algorithms , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[7]  Thomas Hofmann,et al.  An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments , 2007 .

[8]  Sergiy A. Vorobyov,et al.  Maximum likelihood direction-of-arrival estimation in unknown noise fields using sparse sensor arrays , 2005, IEEE Transactions on Signal Processing.

[9]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[10]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .