Acoustic source tracking in reverberant environment using regional steered response power measurement

Acoustic source localization and tracking using a microphone array is challenging due to the presence of background noise and room reverberation. Conventional algorithms employ the steered response power (SRP) as the measurement function in a particle filter based tracking framework. The particle weight is updated according to a pseudo-likelihood derived from the SRP value of each particle position. The performance of this approach reduces in a noisy and reverberant environment. In this paper, instead of evaluating the SRP value for each discrete particle position, we propose to apply a regional SRP beamformer which takes into account a circular region centered on each particle position, in order to provide a more robust particle likelihood evaluation. In addition, a proper mapping function is proposed to transform the regional SRP value to the likelihood. Simulation results show that the proposed method achieves robustness in tracking a speech source in a noisy and reverberant environment.

[1]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[2]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[3]  Joseph H. DiBiase A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .

[4]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[5]  E. Lehmann,et al.  Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.

[6]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[7]  Jacob Benesty,et al.  Real-time passive source localization: a practical linear-correction least-squares approach , 2001, IEEE Trans. Speech Audio Process..

[8]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[9]  Simon J. Godsill,et al.  Acoustic Source Localization and Tracking Using Track Before Detect , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Jacob Benesty,et al.  A Generalized Steered Response Power Method for Computationally Viable Source Localization , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Maximo Cobos,et al.  A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling , 2011, IEEE Signal Processing Letters.

[12]  Eric A. Lehmann,et al.  Particle Filter with Integrated Voice Activity Detection for Acoustic Source Tracking , 2007, EURASIP J. Adv. Signal Process..

[13]  Andy W. H. Khong,et al.  Speaker localization and tracking in the presence of sound interference by exploiting speech harmonicity , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Andrew Blake,et al.  Nonlinear filtering for speaker tracking in noisy and reverberant environments , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).