Augmented Intensity Vectors for Direction of Arrival Estimation in the Spherical Harmonic Domain

Pseudointensity vectors (PIVs) provide a means of direction of arrival (DOA) estimation for spherical microphone arrays using only the zeroth and the first-order spherical harmonics. An augmented intensity vector (AIV) is proposed which improves the accuracy of PIVs by exploiting higher order spherical harmonics. We compared DOA estimation using our proposed AIVs against PIVs, steered response power (SRP) and subspace methods where the number of sources, their angular separation, the reverberation time of the room and the sensor noise level are varied. The results show that the proposed approach outperforms the baseline methods and performs at least as accurately as the state-of-the-art method with strong robustness to reverberation, sensor noise, and number of sources. In the single and multiple source scenarios tested, which include realistic levels of reverberation and noise, the proposed method had average error of <inline-formula><tex-math notation="LaTeX">$1.5^{\circ }$</tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$2^{\circ }$</tex-math></inline-formula>, respectively.

[1]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[2]  Sakari Tervo Direction estimation based on sound intensity vectors , 2009, 2009 17th European Signal Processing Conference.

[3]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[4]  Athanasios Mouchtaris,et al.  3D localization of multiple audio sources utilizing 2D DOA histograms , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[5]  Craig T. Jin,et al.  Direction of arrival estimation for spherical microphone arrays by combination of independent component analysis and sparse recovery , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Sina Hafezi,et al.  Multiple source localization in the spherical harmonic domain using augmented intensity vectors based on grid search , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[7]  Alastair H. Moore,et al.  Direction of Arrival Estimation in the Spherical Harmonic Domain Using Subspace Pseudointensity Vectors , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Thomas Kailath,et al.  ESPRIT-estimation of signal parameters via rotational invariance techniques , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Patrick A. Naylor,et al.  Speech Dereverberation , 2010 .

[10]  Shefeng Yan,et al.  Optimal Modal Beamforming for Spherical Microphone Arrays , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Boaz Rafaely,et al.  Coherent signals direction-of-arrival estimation using a spherical microphone array: Frequency smoothing approach , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[12]  Xiaojun Qiu,et al.  A maximum likelihood direction of arrival estimation method for open-sphere microphone arrays in the spherical harmonic domain. , 2015, The Journal of the Acoustical Society of America.

[13]  Alastair H. Moore,et al.  Multiple source localisation in the spherical harmonic domain , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[14]  Kung Yao,et al.  Maximum-likelihood source localization and unknown sensor location estimation for wideband signals in the near-field , 2002, IEEE Trans. Signal Process..

[15]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[16]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[17]  H. C. Schau,et al.  Passive source localization employing intersecting spherical surfaces from time-of-arrival differences , 1987, IEEE Trans. Acoust. Speech Signal Process..

[18]  Boaz Rafaely,et al.  Spatial Aliasing in Spherical Microphone Arrays , 2007, IEEE Transactions on Signal Processing.

[19]  Emanuel A. P. Habets,et al.  3D source localization in the spherical harmonic domain using a pseudointensity vector , 2010, 2010 18th European Signal Processing Conference.

[20]  Boaz Rafaely The Spherical-Shell Microphone Array , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Emanuel A. P. Habets,et al.  Simulating room impulse responses for spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Archontis Politis,et al.  Direction of Arrival Estimation of Reflections from Room Impulse Responses Using a Spherical Microphone Array , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[23]  Craig Jin,et al.  Independent Component Analysis Using Spherical Microphone Arrays , 2012 .

[24]  Jens Meyer,et al.  Spherical Microphone Array for Spatial Sound Recording , 2002 .

[25]  Boaz Rafaely,et al.  The nearfield spherical microphone array , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Alastair H. Moore,et al.  Direction of arrival estimation using pseudo-intensity vectors with direct-path dominance test , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[27]  B. Rafaely Plane-wave decomposition of the sound field on a sphere by spherical convolution , 2004 .

[28]  Ramani Duraiswami,et al.  Flexible and Optimal Design of Spherical Microphone Arrays for Beamforming , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Boaz Rafaely,et al.  Fundamentals of Spherical Array Processing , 2015, Springer Topics in Signal Processing.

[30]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[31]  Sailes K. Sengijpta Fundamentals of Statistical Signal Processing: Estimation Theory , 1995 .

[32]  Boaz Rafaely,et al.  Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[33]  Thushara D. Abhayapala,et al.  Theory and design of high order sound field microphones using spherical microphone array , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[34]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[35]  Sina Hafezi,et al.  3D acoustic source localization in the spherical harmonic domain based on optimized grid search , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  Holography Book,et al.  Fourier Acoustics Sound Radiation And Nearfield Acoustical Holography , 2016 .

[37]  Sina Hafezi,et al.  Multiple source localization using Estimation Consistency in the Time-Frequency domain , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[38]  Athanasios Mouchtaris,et al.  3D DOA estimation of multiple sound sources based on spatially constrained beamforming driven by intensity vectors , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  Boaz Rafaely,et al.  Open-Sphere Designs for Spherical Microphone Arrays , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[40]  Walter Kellermann,et al.  EB-ESPRIT: 2D localization of multiple wideband acoustic sources using eigen-beams , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[41]  Athanasios Mouchtaris,et al.  3D localization of multiple sound sources with intensity vector estimates in single source zones , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[42]  Gary W. Elko,et al.  Spherical Microphone Arrays for 3D Sound Recording , 2004 .

[43]  Boaz Rafaely,et al.  Near-Field Spherical Microphone Array Processing With Radial Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[44]  E. Habets,et al.  On the angular error of intensity vector based direction of arrival estimation in reverberant sound fields. , 2010, The Journal of the Acoustical Society of America.

[45]  Boaz Rafaely,et al.  Spherical Microphone Array Beamforming , 2010 .

[46]  Boaz Rafaely,et al.  Acoustic analysis by spherical microphone array processing of room impulse responses. , 2012, The Journal of the Acoustical Society of America.

[47]  Boaz Rafaely,et al.  Phase-mode versus delay-and-sum spherical microphone array processing , 2005, IEEE Signal Processing Letters.

[48]  Ramani Duraiswami,et al.  Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[49]  Alastair H. Moore,et al.  Direction of Arrival Estimation in the Spherical Harmonic Domain using Subspace Pseudo-Intensity Vectors , 2016 .