Direction of Arrival Estimation of Reflections from Room Impulse Responses Using a Spherical Microphone Array

This paper studies the direction of arrival estimation of reflections in short time windows of room impulse responses measured with a spherical microphone array. Spectral-based methods, such as multiple signal classification (MUSIC) and beamforming, are commonly used in the analysis of spatial room impulse responses. However, the room acoustic reflections are highly correlated or even coherent in a single analysis window and this imposes limitations on the use of spectral-based methods. Here, we apply maximum likelihood (ML) methods, which are suitable for direction of arrival estimation of coherent reflections. These methods have been earlier developed in the linear space domain and here we present the ML methods in the context of spherical microphone array processing and room impulse responses. Experiments are conducted with simulated and real data using the em32 Eigenmike. The results show that direction estimation with ML methods is more robust against noise and less biased than MUSIC or beamforming.

[1]  Angelo Farina,et al.  Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique , 2000 .

[2]  Boaz Rafaely,et al.  Acoustic analysis by spherical microphone array processing of room impulse responses. , 2012, The Journal of the Acoustical Society of America.

[3]  Juha Merimaa,et al.  Spatial Impulse Response Rendering I: Analysis and Synthesis , 2005 .

[4]  S. Unnikrishna Pillai,et al.  Forward/backward spatial smoothing techniques for coherent signal identification , 1989, IEEE Trans. Acoust. Speech Signal Process..

[5]  B. Rafaely Plane-wave decomposition of the sound field on a sphere by spherical convolution , 2004 .

[6]  Ying Han,et al.  Spatial difference smoothing for DOA estimation of coherent signals , 2005, IEEE Signal Process. Lett..

[7]  Zhongfu Ye,et al.  Efficient Method of DOA Estimation for Uncorrelated and Coherent Signals , 2008, IEEE Antennas and Wireless Propagation Letters.

[8]  Angelo Farina,et al.  Spatial Analysis of Room Impulse Responses Captured with a 32-Capsule Microphone Array , 2011 .

[9]  Ramani Duraiswami,et al.  Imaging concert hall acoustics using visual and audio cameras , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[11]  Ramani Duraiswami,et al.  Flexible and Optimal Design of Spherical Microphone Arrays for Beamforming , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Sverre Holm,et al.  Robust 3-D Sound Source Localization Using Spherical Microphone Arrays , 2013 .

[13]  Xian-Da Zhang,et al.  An ESPRIT-like algorithm for coherent DOA estimation , 2005, IEEE Antennas and Wireless Propagation Letters.

[14]  Juha Merimaa,et al.  Measurement, Analysis, and Visualization of Directional Room Responses , 2001 .

[15]  Sverre Holm,et al.  Transformation Between Uniform Linear and Spherical Microphone Arrays With Symmetric Responses , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Craig T. Jin,et al.  Design, Optimization and Evaluation of a Dual-Radius Spherical Microphone Array , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[17]  Stephanie Bertet,et al.  3D Sound Field Recording with Higher Order Ambisonics – Objective Measurements and Validation of a 4th order Spherical Microphone , 2006 .

[18]  M. Viberg,et al.  Two decades of array signal processing research: the parametric approach , 1996, IEEE Signal Process. Mag..

[19]  Heinrich Kuttruff,et al.  Room acoustics , 1973 .

[20]  Gary W. Elko,et al.  A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Björn E. Ottersten,et al.  Direction-of-arrival estimation for wide-band signals using the ESPRIT algorithm , 1990, IEEE Trans. Acoust. Speech Signal Process..

[22]  Rajesh M. Hegde,et al.  Stochastic Cramér-Rao Bound Analysis for DOA Estimation in Spherical Harmonics Domain , 2015, IEEE Signal Processing Letters.

[23]  Bjorn Ottersten,et al.  Exact and Large Sample ML Techniques for Parameter Estimation and Detection in Array Processing , 1993 .

[24]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[25]  Walter Kellermann,et al.  Comparison of subspace-based and steered beamformer-based reflection localization methods , 2011, 2011 19th European Signal Processing Conference.

[26]  G. Carter Coherence and time delay estimation , 1987, Proceedings of the IEEE.

[27]  Thushara D. Abhayapala,et al.  Theory and design of high order sound field microphones using spherical microphone array , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[28]  Tapio Lokki,et al.  Spatial Decomposition Method for Room Impulse Responses , 2013 .

[29]  Tapio Lokki,et al.  Analysis of concert hall acoustics via visualizations of time-frequency and spatiotemporal responses. , 2013, The Journal of the Acoustical Society of America.

[30]  B. Rafaely,et al.  Sound-field analysis by plane-wave decomposition using spherical microphone array , 2005 .

[31]  Boaz Rafaely,et al.  Spherical array processing for acoustic analysis using room impulse responses and time-domain smoothing. , 2013, The Journal of the Acoustical Society of America.

[32]  Hong Wang,et al.  Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..

[33]  Ilan Ziskind,et al.  Maximum likelihood localization of multiple sources by alternating projection , 1988, IEEE Trans. Acoust. Speech Signal Process..

[34]  Boaz Rafaely,et al.  Phase-mode versus delay-and-sum spherical microphone array processing , 2005, IEEE Signal Processing Letters.

[35]  Walter Kellermann,et al.  Robust localization of multiple sources in reverberant environments using EB-ESPRIT with spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  Mostafa Kaveh,et al.  Focussing matrices for coherent signal-subspace processing , 1988, IEEE Trans. Acoust. Speech Signal Process..