Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation

This paper investigates different steering techniques for spherical microphone arrays and proposes a time-difference of arrival (TDOA) model for microphones on surface of a rigid sphere. The model is based on geometric interpretation of wavefront incident angle and the extra distance the wavefront needs to travel to reach microphones on the opposite side of a sphere. We evaluate the proposed model by comparing analytic TDOAs to measured TDOAs extracted from impulse responses (IR) of a rigid sphere (r = 7.5cm). The proposed method achieves over 40% relative improvement in TDOA accuracy in comparison to free-field propagation and TDOAs extracted from analytic IRs of a spherical microphone array provide an additional 10% improvement. We test the proposed model for the application of source direction of arrival (DOA) estimation using steered response power (SRP) with real reverberant recordings of moving speech sources. All tested methods perform equally well in noise-free scenario, while the proposed model and simulated IRs improve over free-field assumption in low SNR conditions. The proposed model has the benefit of only using single delay for steering the array.

[1]  Jacob Benesty,et al.  Steered Beamforming Approaches for Acoustic Source Localization , 2010 .

[2]  Jacob Benesty,et al.  Direction of Arrival Estimation using Eigenanalysis of the Parameterized Spatial Correlation Matrix , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[3]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[4]  Archontis Politis,et al.  Microphone array processing for parametric spatial audio techniques , 2016 .

[5]  Jacob Benesty,et al.  A Generalized Steered Response Power Method for Computationally Viable Source Localization , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  B. Rafaely,et al.  Sound-field analysis by plane-wave decomposition using spherical microphone array , 2005 .

[7]  Jacob Benesty,et al.  Direction of Arrival Estimation Using the Parameterized Spatial Correlation Matrix , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  R. Duda,et al.  Range dependence of the response of a spherical head model , 1998 .

[9]  Tuomas Virtanen,et al.  Ieee Transactions on Audio, Speech and Language Processing Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation , 2022 .

[10]  John W. McDonough,et al.  Kalman Filters for Time Delay of Arrival-Based Source Localization , 2005, EURASIP J. Adv. Signal Process..

[11]  Paris Smaragdis,et al.  Directional NMF for joint source localization and separation , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[12]  Gary W. Elko,et al.  A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Walter Kellermann,et al.  Robust localization of multiple sources in reverberant environments using EB-ESPRIT with spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Ivan Tashev,et al.  Sound Capture and Processing: Practical Approaches , 2009 .