Closed-Form DOA Estimation Using First-Order Differential Microphone Arrays via Joint Temporal-Spectral-Spatial Processing

Sound source direction-of-arrival (DOA) estimation in reverberant environments with low-computational complexity remains a challenging problem, especially for small-sized microphone arrays. To address the problem, a promising method is based on the sound intensity (SI) measurement using differential microphone arrays (DMAs), which can give a closed-form solution and, hence, is computationally efficient. Unfortunately, the SI-based method has been shown to be sensitive to room reverberation, and therefore, more works need to be done to improve its performance in reverberant environments. In this paper, we propose an SI-based closed-form DOA estimation algorithm through the joint temporal-spectral-spatial processing by using two orthogonal first-order DMAs. The proposed method consists of two stages. In the first stage, it focuses on temporal-spectral processing to improve the SI-based DOA estimation, and a preliminary DOA estimate is made through a dual-threshold technique inspired by the local DOA variance weighting scheme in the time-frequency domain. In the second stage, the preliminary DOA estimate is then refined through the spatial processing, where the two first-order orthogonal DMAs are further utilized to construct a first-order steerable DMA for dereverberation, which is shown to be useful when the signal to noise ratio is not too low. Simulations and real-world experiments in reverberant environments have demonstrated the superior performance of the proposed closed-form DOA estimation method.

[1]  F. Fahy Measurement of acoustic intensity using the cross‐spectral density of two microphone signals , 1977 .

[2]  Atiyeh Alinaghi,et al.  Reverberant speech separation with probabilistic time-frequency masking for B-format recordings , 2015, Speech Commun..

[3]  Yannick Deville,et al.  A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources , 2005, Signal Process..

[4]  Emanuel A. P. Habets,et al.  3D source localization in the spherical harmonic domain using a pseudointensity vector , 2010, 2010 18th European Signal Processing Conference.

[5]  Krishnaraj M. Varma,et al.  Time-Delay-Estimate Based Direction-of-Arrival Estimation for Speech in Reverberant Environments , 2002 .

[6]  Jacob Benesty,et al.  Direction of Arrival Estimation Using the Parameterized Spatial Correlation Matrix , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Jingdong Chen,et al.  Time Difference of Arrival Estimation Exploiting Multichannel Spatio-Temporal Prediction , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[9]  Athanasios Mouchtaris,et al.  3D localization of multiple sound sources with intensity vector estimates in single source zones , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[10]  Dovid Levin,et al.  Impact of source signal coloration on intensity vector based DOA estimation , 2010 .

[11]  Robert Hickling,et al.  Use of pitch‐azimuth plots in determining the direction of a noise source in water with a vector sound‐intensity probe , 1995 .

[12]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[13]  Sven Nordholm,et al.  A novel fuzzy clustering algorithm using observation weighting and context information for reverberant blind speech separation , 2010, Signal Process..

[14]  K. C. Ho,et al.  A simple and efficient estimator for hyperbolic location , 1994, IEEE Trans. Signal Process..

[15]  Amir Said,et al.  A Steered-Response Power Algorithm Employing Hierarchical Search for Acoustic Source Localization Using Microphone Arrays , 2014, IEEE Transactions on Signal Processing.

[16]  Ahmet M. Kondoz,et al.  Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  A. A. Handzel,et al.  Biomimetic sound-source localization , 2002 .

[18]  E. Habets,et al.  On the angular error of intensity vector based direction of arrival estimation in reverberant sound fields. , 2010, The Journal of the Acoustical Society of America.

[19]  Sergio Silvestri,et al.  Calibration and Uncertainty Evaluation Using Monte Carlo Method of a Simple 2D Sound Localization System , 2013, IEEE Sensors Journal.

[20]  T. Engin Tuncer,et al.  Classical and Modern Direction-of-Arrival Estimation , 2009 .

[21]  Joseph H. DiBiase A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .

[22]  Alper Bozkurt,et al.  Sound Localization Sensors for Search and Rescue Biobots , 2016, IEEE Sensors Journal.

[23]  Emanuel A. P. Habets,et al.  Multiple-Hypothesis Extended Particle Filter for Acoustic Source Localization in Reverberant Environments , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Robert Hickling,et al.  Finding the direction of a sound source using a vector sound‐intensity probe , 1993 .

[25]  René Martinus Maria Derkx,et al.  Theoretical Analysis of a First-Order Azimuth-Steerable Superdirective Microphone Array , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Walter Kellermann,et al.  Beamforming for Speech and Audio Signals , 2008 .

[27]  Hüseyin Hacihabiboglu,et al.  Theoretical Analysis of Open Spherical Microphone Arrays for Acoustic Intensity Measurements , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[29]  Sven Nordholm,et al.  Robust Source Localization in Reverberant Environments Based on Weighted Fuzzy Clustering , 2009, IEEE Signal Processing Letters.

[30]  Jacob Benesty,et al.  Robust time delay estimation exploiting redundancy among multiple microphones , 2003, IEEE Trans. Speech Audio Process..

[31]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[32]  Gary W. Elko,et al.  Superdirectional microphone arrays , 2000 .

[33]  Hong Wang,et al.  Voice source localization for automatic camera pointing system in videoconferencing , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.