Speaker tracking based on distributed particle filter and interacting multiple model in distributed microphone networks

Abstract The reverberations and Gaussian noises from indoor environments pollute speech signals and bring the degradation of speaker tracking performance. In the paper, a speaker tracking method based on the distributed particle filter (DPF) and the interacting multiple model (IMM) is proposed in distributed microphone networks. The generalized cross-correlation (GCC) function is first employed to estimate the time difference of arrival (TDOA) of speech signals received by two microphones at each node. To overcome the adverse effects of reverberation and noise, multiple TDOAs are selected to calculate the multiple-hypothesis model as the local weight of the DPF. To simulate a real speaker motion, the IMM algorithm is applied and a calculation method of the local measurement is presented based on multiple TDOAs. Finally, the speaker tracking method based on the DPF and IMM is performed to track a moving speaker and obtain a global consistent position estimate. The proposed method can track the moving speaker in reverberant and noisy environments with high tracking accuracy in distributed networks, and it is robust against the fault nodes. Simulation results demonstrate the validity of the proposed speaker tracking method.

[1]  James R. Hopgood,et al.  A Time–Frequency Masking Based Random Finite Set Particle Filtering Method for Multiple Acoustic Source Detection and Tracking , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  P. H. Foo,et al.  Combining the interacting multiple model method with particle filters for manoeuvring target tracking , 2011 .

[3]  Sharon Gannot,et al.  Microphone Array Speaker Localizers Using Spatial-Temporal Information , 2006, EURASIP J. Adv. Signal Process..

[4]  Nando de Freitas,et al.  The Unscented Particle Filter , 2000, NIPS.

[5]  Y. Boers,et al.  Interacting multiple model particle filter , 2003 .

[6]  Augusto Sarti,et al.  Acoustic Source Localization With Distributed Asynchronous Microphone Networks , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Y. Bar-Shalom,et al.  The interacting multiple model algorithm for systems with Markovian switching coefficients , 1988 .

[8]  K. C. Ho,et al.  Sensor Allocation for Source Localization With Decoupled Range and Bearing Estimation , 2008, IEEE Transactions on Signal Processing.

[9]  Petar M. Djuric,et al.  Distributed particle filtering in agent networks: A survey, classification, and comparison , 2013, IEEE Signal Processing Magazine.

[10]  Robert Babuska,et al.  Parametric Bayesian Filters for Nonlinear Stochastic Dynamical Systems: A Survey , 2013, IEEE Transactions on Cybernetics.

[11]  V. Jilkov,et al.  Survey of maneuvering target tracking. Part V. Multiple-model methods , 2005, IEEE Transactions on Aerospace and Electronic Systems.

[12]  Michael R. M. Jenkin,et al.  Audiovisual localization of multiple speakers in a video teleconferencing setting , 2003, Int. J. Imaging Syst. Technol..

[13]  Fuliang Yin,et al.  Speaker Tracking Based on Distributed Particle Filter in Distributed Microphone Networks , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Zhe Chen,et al.  Microphone Clustering and BP Network based Acoustic Source Localization in Distributed Microphone Arrays , 2013 .

[15]  Parham Aarabi,et al.  Self-localizing dynamic microphone arrays , 2002 .

[16]  Zhe Chen,et al.  Distributed iterated extended Kalman filter for speaker tracking in microphone array networks , 2017 .

[17]  Amir Averbuch,et al.  Interacting Multiple Model Methods in Target Tracking: A Survey , 1988 .

[18]  Zhengyou Zhang,et al.  Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings , 2008, IEEE Transactions on Multimedia.

[19]  Augusto Sarti,et al.  A Robust and Low-Complexity Source Localization Algorithm for Asynchronous Distributed Microphone Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20]  Ye Tian,et al.  Distributed IMM-Unscented Kalman Filter for Speaker Tracking in Microphone Array Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[22]  Zhe Chen,et al.  Speaker Tracking Based on Distributed Particle Filter and Iterative Covariance Intersection in Distributed Microphone Networks , 2019, IEEE Journal of Selected Topics in Signal Processing.

[23]  Amir Asif,et al.  Consensus-based distributed dynamic sensor selection in decentralised sensor networks using the posterior Cramér-Rao lower bound , 2015, Signal Process..

[24]  Fuliang Yin,et al.  Distributed Marginalized Auxiliary Particle Filter for Speaker Tracking in Distributed Microphone Networks , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[25]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[26]  William Read,et al.  TDOA Estimation With Compressive Sensing Measurements and Hadamard Matrix , 2018, IEEE Transactions on Aerospace and Electronic Systems.