Measurement Combination for Acoustic Source Localization in a Room Environment

The behavior of time delay estimation (TDE) is well understood and therefore attractive to apply in acoustic source localization (ASL). A time delay between microphones maps into a hyperbola. Furthermore, the likelihoods for different time delays are mapped into a set of weighted nonoverlapping hyperbolae in the spatial domain. Combining TDE functions from several microphone pairs results in a spatial likelihood function (SLF) which is a combination of sets of weighted hyperbolae. Traditionally, the maximum SLF point is considered as the source location but is corrupted by reverberation and noise. Particle filters utilize past source information to improve localization performance in such environments. However, uncertainty exists on how to combine the TDE functions. Results from simulated dialogues in various conditions favor TDE combination using intersection-based methods over union. The real-data dialogue results agree with the simulations, showing a 45% RMSE reduction when choosing the intersection over union of TDE functions.

[1]  Parham Aarabi,et al.  Robust sound localization using conditional time-frequency histograms , 2003, Inf. Fusion.

[2]  R. Moses,et al.  Acoustic time delay estimation and sensor network self-localization: Experimental results , 2005 .

[3]  Jacob Benesty,et al.  Time Delay Estimation in Room Acoustic Environments: An Overview , 2006, EURASIP J. Adv. Signal Process..

[4]  N. Bershad,et al.  Time delay estimation using the LMS adaptive filter--Dynamic behavior , 1981 .

[5]  M. Schroeder New Method of Measuring Reverberation Time , 1965 .

[6]  Kung Yao,et al.  A maximum-likelihood parametric approach to source localizations , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  Jacob Benesty,et al.  A Generalized Steered Response Power Method for Computationally Viable Source Localization , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Pasi Pertilä,et al.  TUT Acoustic Source Tracking System 2006 , 2006, CLEAR.

[9]  Yu Hen Hu,et al.  Maximum likelihood multiple-source localization using acoustic energy measurements with wireless sensor networks , 2005, IEEE Trans. Signal Process..

[10]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[11]  John Vanderkooy,et al.  Transfer-Function Measurement with Maximum-Length Sequences , 1989 .

[12]  Benoît Champagne,et al.  Performance of time-delay estimation in the presence of room reverberation , 1996, IEEE Trans. Speech Audio Process..

[13]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[14]  Ramani Duraiswami,et al.  Accelerated speech source localization via a hierarchical search of steered response power , 2004, IEEE Transactions on Speech and Audio Processing.

[15]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[16]  Jacob Benesty,et al.  Performance of GCC- and AMDF-Based Time-Delay Estimation in Practical Reverberant Environments , 2005, EURASIP J. Adv. Signal Process..

[17]  R. Boucher,et al.  Performance of the generalized cross correlator in the presence of a strong spectral peak in the signal , 1981 .

[18]  Maurizio Omologo,et al.  Use of the crosspower-spectrum phase in acoustic event location , 1997, IEEE Trans. Speech Audio Process..

[19]  Eric A. Lehmann,et al.  Particle Filter with Integrated Voice Activity Detection for Acoustic Source Tracking , 2007, EURASIP J. Adv. Signal Process..

[20]  Benesty Adaptive eigenvalue decomposition algorithm for passive acoustic source localization , 2000, The Journal of the Acoustical Society of America.

[21]  Pasi Pertilä,et al.  TUT Acoustic Source Tracking System 2007 , 2007, CLEAR.

[22]  Parham Aarabi,et al.  EURASIP Journal on Applied Signal Processing 2003:4, 338–347 c ○ 2003 Hindawi Publishing Corporation The Fusion of Distributed Microphone Arrays for Sound Localization , 2002 .

[23]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[24]  Rainer Stiefelhagen,et al.  Eval-ware: Multimodal Interaction , 2007 .

[25]  Harvey F. Silverman,et al.  A Fast Microphone Array SRP-PHAT Source Location Implementation using Coarse-To-Fine Region Contraction(CFRC) , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[26]  Jean Rouat,et al.  Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering , 2007, Robotics Auton. Syst..

[27]  A. Weiss,et al.  Fundamental limitations in passive time delay estimation--Part I: Narrow-band systems , 1983 .

[28]  Fredrik Gustafsson,et al.  On Resampling Algorithms for Particle Filters , 2006, 2006 IEEE Nonlinear Statistical Signal Processing Workshop.

[29]  John Garofoio,et al.  Eval-Ware: Multimodal Interaction [Best of the Web] , 2007, IEEE Signal Processing Magazine.

[30]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[31]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[32]  Ying Yu,et al.  A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[33]  Eric A. Lehmann,et al.  Particle filtering methods for acoustic source localisation and tracking , 2004 .

[34]  Jingdong Chen,et al.  TIME DELAY ESTIMATION USING SPATIAL CORRELATION TECHNIQUES , 2003 .

[35]  Mohan M. Trivedi,et al.  Source localization in reverberant environments: modeling and statistical analysis , 2003, IEEE Trans. Speech Audio Process..

[36]  Jacck Izydorczyk,et al.  Time delay estimation using the LMS adaptive filter , 2006 .

[37]  Eric A. Lehmann,et al.  Modeling of Motion Dynamics and its Influence on the Performance of a Particle Filter for Acoustic Speaker Tracking , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.