Global Coherence Field and Distributed Particle Filter-Based Speaker Tracking in Distributed Microphone Networks

Based on the combination of global coherence field (GCF) and distributed particle filter (DPF) a speaker tracking method is proposed for distributed microphone networks in this paper. In the distributed microphone network, each node comprises a microphone pair, and its generalized cross-correlation (GCC) function is estimated. Based on the average over all local GCC observations, a global coherence field-based pseudo-likelihood (GCF-PL) function is developed as the likelihood for a DPF. In the proposed method, all nodes share an identical particle set, and each node performs local particle filtering simultaneously. In the local particle filter, the likelihood GCF-PL for each particle weight is computed with an average consensus algorithm. With an identical particle set and the consistent estimate of GCF-PL for each particle weight, all individual nodes possess a common particle presentation for the global posterior of the speaker state, which is utilized by each node for an estimated global speaker position. Employing the GCF-PL as the likelihood for DPF, no assumption is required about the independence of nodes observations as well as observation noise statistics. Additionally, only local information exchange occurs among neighboring nodes; and finally each node has a global estimate of the speaker position. Simulation results demonstrate the validity of the proposed method.

[1]  Mohammad Ali Tinati,et al.  Distributed multi-target tracking using joint probabilistic data association and average consensus filter , 2011, Ann. des Télécommunications.

[2]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[3]  Seiichi Nakagawa,et al.  Automatic estimation of position and orientation of an acoustic source by a microphone array network. , 2009, The Journal of the Acoustical Society of America.

[4]  Simon J. Godsill,et al.  Acoustic Source Localization and Tracking of a Time-Varying Number of Speakers , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Na Zhu,et al.  Locating arbitrarily time-dependent sound sources in three dimensional space in real time. , 2010, The Journal of the Acoustical Society of America.

[6]  Anthony G. Constantinides,et al.  Audio–Visual Active Speaker Tracking in Cluttered Indoors Environments , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Augusto Sarti,et al.  Acoustic Source Localization With Distributed Asynchronous Microphone Networks , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[9]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[10]  Tapio Lokki,et al.  Speaker Tracking for Teleconferencing via Binaural Headset Microphones , 2012, IWAENC.

[11]  Petar M. Djuric,et al.  Distributed particle filtering in agent networks: A survey, classification, and comparison , 2013, IEEE Signal Processing Magazine.

[12]  Eric A. Lehmann,et al.  Reverberation-Time Prediction Method for Room Impulse Responses Simulated with the Image-Source Model , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[13]  Amir Asif,et al.  Distributed Particle Filter Implementation With Intermittent/Irregular Consensus Convergence , 2013, IEEE Transactions on Signal Processing.

[14]  Eric A. Lehmann,et al.  Particle Filter Design Using Importance Sampling for Acoustic Source Localisation and Tracking in Reverberant Environments , 2006, EURASIP J. Adv. Signal Process..

[15]  Zhe Chen,et al.  A Time Delay Estimation Method Based on Wavelet Transform and Speech Envelope for Distributed Microphone Arrays , 2013 .

[16]  Zhiyong Yu,et al.  Capture, recognition, and visualization of human semantic interactions in meetings , 2010, 2010 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[17]  Jacob Benesty,et al.  Time-delay estimation via linear interpolation and cross correlation , 2004, IEEE Transactions on Speech and Audio Processing.

[18]  Ari Visa,et al.  Measurement Combination for Acoustic Source Localization in a Room Environment , 2008, EURASIP J. Audio Speech Music. Process..

[19]  Takanobu Nishiura,et al.  Robust speaker localization in a disturbance noise environment using a distributed microphone system , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[20]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[21]  Finn Jacobsen,et al.  Beamforming with a circular microphone array for localization of environmental noise sources. , 2010, The Journal of the Acoustical Society of America.

[22]  Na Zhu,et al.  SOUND SOURCE LOCALIZATION IN THREE-DIMENSIONAL SPACE IN REAL TIME WITH REDUNDANCY CHECKS , 2012 .

[23]  Sven Nordholm,et al.  Real-Time Implementation of a Particle Filter with Integrated Voice Activity Detector for Acoustic Speaker Tracking , 2006, APCCAS 2006 - 2006 IEEE Asia Pacific Conference on Circuits and Systems.

[24]  Fotios Talantzis An Acoustic Source Localization and Tracking Framework Using Particle Filtering and Information Theory , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Stephen P. Boyd,et al.  Fast linear iterations for distributed averaging , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[26]  Na Zhu,et al.  Passive sonic detection and ranging for locating sound sources. , 2013, The Journal of the Acoustical Society of America.

[27]  Alessio Brutti,et al.  Classification of Acoustic Maps to Determine Speaker Position and Orientation from a Distributed Microphone Network , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[28]  Seong Ho Kang,et al.  Object tracking for security monitoring system using microphone array , 2007, 2007 International Conference on Control, Automation and Systems.