Real Time High Accuracy 3-D PHAT-Based Sound Source Localization Using a Simple 4-Microphone Arrangement

This paper investigates wideband sound source localization in outdoor cases. In such cases, time difference of arrival (TDOA)-based methods are commonly used for 2-D and 3-D wideband sound source localization. These methods have lower accuracy in comparison with direction of arrival-based approaches. However, they feature fewer microphones and less computation time. High accuracy sound source localization using these methods needs highly accurate time delay measurement, and therefore, high frequency signal sampling rates. Moreover, the need to use numerical analysis methods for local calculations (solving nonlinear equations of closed-form methods) will increase computation time while the calculations may still not converge. Also, a good initial guess close to the true solution is needed to avoid local minima. In this paper, a simple, fast (real time) and accurate pure geometric phase transform-based exact location calculation approach for 3-D localization of wideband sound sources in outdoor far-field and low degree reverberation cases, using only four microphones, is proposed. Based on the proposed method, a simple arrangement of microphones is implemented. Experimental results show that the proposed method has more accuracy and less computation time simultaneously, in comparison with previous closed-form hyperbolic intersection and other TDOA-based state-of-the-art location calculation methods, overcoming their major weaknesses. Also, as the nonlinear closed-form equations are linearized, no initial guess is required. It features less than 0.2° error for angle of arrival, less than 5% error for 3-D location finding, and computation times as low as 250 ms for the localization of a typical wideband sound source such as a flying object (helicopter).

[1]  H. C. Schau,et al.  Passive source localization employing intersecting spherical surfaces from time-of-arrival differences , 1987, IEEE Trans. Acoust. Speech Signal Process..

[2]  K. C. Ho,et al.  Passive Source Localization Using Time Differences of Arrival and Gain Ratios of Arrival , 2008, IEEE Transactions on Signal Processing.

[3]  Zhengyou Zhang,et al.  Why does PHAT work well in lownoise, reverberative environments? , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Sheldon Razin Explicit (Noniterative) Loran Solution , 1967 .

[5]  K. C. Ho,et al.  An Accurate Algebraic Closed-Form Solution for Energy-Based Source Localization , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Maurizio Omologo,et al.  Acoustic source location in a three-dimensional space using crosspower spectrum phase , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jhing-Fa Wang,et al.  A Design of Far-Field Speaker Localization System Using Independent Component Analysis with Subspace Speech Enhancement , 2009, 2009 11th IEEE International Symposium on Multimedia.

[8]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[9]  Harry Lee,et al.  A Novel Procedure for Assessing the Accuracy of Hyperbolic Multilateration Systems , 1975, IEEE Transactions on Aerospace and Electronic Systems.

[10]  Jian Li,et al.  Exact and Approximate Solutions of Source Localization Problems , 2008, IEEE Transactions on Signal Processing.

[11]  Julius O. Smith,et al.  Closed-form least-squares source location estimation from range-difference measurements , 1987, IEEE Trans. Acoust. Speech Signal Process..

[12]  Brian M. Sadler,et al.  A Simple Closed-Form Linear Source Localization Algorithm , 2007, MILCOM 2007 - IEEE Military Communications Conference.

[13]  G.B. Giannakis,et al.  Localization via ultra-wideband radios: a look at positioning aspects for future sensor networks , 2005, IEEE Signal Processing Magazine.

[14]  Ba-Ngu Vo,et al.  Tracking an unknown time-varying number of speakers using TDOA measurements: a random finite set approach , 2006, IEEE Transactions on Signal Processing.

[15]  R.L. Moses,et al.  Locating the nodes: cooperative localization in wireless sensor networks , 2005, IEEE Signal Processing Magazine.

[16]  Shantanu Chakrabartty,et al.  Far-Field Acoustic Source Localization and Bearing Estimation Using $\Sigma\Delta$ Learners , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[17]  W. R. Hahn Optimum signal processing for passive sonar range and bearing estimation , 1975 .

[18]  van J Etten NAVIGATION SYSTEMS. FUNDAMENTALS OF LOW- AND VERY-LOW-FREQUENCY HYPERBOLIC TECHNIQUES , 1970 .

[19]  G. Carter Time delay estimation for passive sonar signal processing , 1981 .

[20]  Alfred O. Hero,et al.  Locating the Nodes , 2005 .

[21]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[22]  Gordon L. Stüber,et al.  Subscriber location in CDMA cellular networks , 1998 .

[23]  La-or Kovavisaruch,et al.  Source Localization Using TDOA and FDOA Measurements in the Presence of Receiver Location Errors: Analysis and Solution , 2007, IEEE Transactions on Signal Processing.

[24]  Jacob Benesty,et al.  Real-time passive source localization: a practical linear-correction least-squares approach , 2001, IEEE Trans. Speech Audio Process..

[25]  B. T. Fang,et al.  Simple solutions for hyperbolic and related position fixes , 1990 .

[26]  Benjamin Friedlander,et al.  On the Cramer-Rao bound for time delay and Doppler estimation , 1984, IEEE Trans. Inf. Theory.

[27]  B. Friedlander A passive localization algorithm and its accuracy analysis , 1987 .

[28]  Li Sha,et al.  An algorithm for locating microseismic events , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[29]  E. Weinstein Optimal source localization and tracking from passive array measurements , 1982 .

[30]  WADE FOY,et al.  Position-Location Solutions by Taylor-Series Estimation , 1976, IEEE Transactions on Aerospace and Electronic Systems.

[31]  Hong Wang,et al.  Voice source localization for automatic camera pointing system in videoconferencing , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[33]  Hong Wang,et al.  Voice source localization for automatic camera pointing system in videoconferencing , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[34]  Maurizio Omologo,et al.  Acoustic event localization using a crosspower-spectrum phase based technique , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Erik G. Larsson,et al.  Accuracy Comparison of LS and Squared-Range LS for Source Localization , 2010, IEEE Transactions on Signal Processing.

[36]  Shigeki Sagayama,et al.  R-means localization: A simple iterative algorithm for range-difference-based source localization , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[37]  James B. Y. Tsui,et al.  Fundamentals of global positioning system receivers , 2000 .

[38]  R. Schmidt A New Approach to Geometry of Range Difference Location , 1972, IEEE Transactions on Aerospace and Electronic Systems.

[39]  J. Abel A divide and conquer approach to least-squares estimation , 1990 .

[40]  Shantanu Chakrabartty,et al.  Far-field Acoustic Source Localization and Bearing Estimation using Σ ∆ Learners , 2009 .

[41]  Julius O. Smith,et al.  Source range and depth estimation from multipath range difference measurements , 1989, IEEE Trans. Acoust. Speech Signal Process..

[42]  Eduardo Lleida,et al.  Robust continuous speech recognition system based on a microphone array , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[43]  Minsoo Hahn,et al.  Real-time sound localization using time difference for human robot interaction , 2005 .

[44]  G. M.,et al.  A Treatise on the Differential Geometry of Curves and Surfaces , 1910, Nature.

[45]  D. M. Y. Sommerville The elements of non-Euclidean geometry , 1914 .

[46]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[47]  Volkan Cevher,et al.  Target Tracking Using a Joint Acoustic Video System , 2007, IEEE Transactions on Multimedia.

[48]  Benoît Champagne,et al.  Cepstral prefiltering for time delay estimation in reverberant environments , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[49]  Frankie K. W. Chan,et al.  Closed-Form Formulae for Time-Difference-of-Arrival Estimation , 2008, IEEE Transactions on Signal Processing.

[50]  Harvey F. Silverman,et al.  A Linear Closed-Form Algorithm for Source Localization From Time-Differences of Arrival , 2008, IEEE Signal Processing Letters.