Tracking an unknown time-varying number of speakers using TDOA measurements: a random finite set approach

Speaker location estimation techniques based on time-difference-of-arrival measurements have attracted much attention recently. Many existing localization ideas assume that only one speaker is active at a time. In this paper, we focus on a more realistic assumption that the number of active speakers is unknown and time-varying. Such an assumption results in a more complex localization problem, and we employ the random finite set (RFS) theory to deal with that problem. The RFS concepts provide us with an effective, solid foundation where the multispeaker locations and the number of speakers are integrated to form a single set-valued variable. By applying a sequential Monte Carlo implementation, we develop a Bayesian RFS filter that simultaneously tracks the time-varying speaker locations and number of speakers. The tracking capability of the proposed filter is demonstrated in simulated reverberant environments

[1]  G. Matheron Random Sets and Integral Geometry , 1976 .

[2]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[3]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[4]  Hong Wang,et al.  Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..

[5]  Anthony J. Weiss,et al.  Direction finding for wide-band signals using an interpolated array , 1993, IEEE Trans. Signal Process..

[6]  K. C. Ho,et al.  A simple and efficient estimator for hyperbolic location , 1994, IEEE Trans. Signal Process..

[7]  T. Mattfeldt Stochastic Geometry and Its Applications , 1996 .

[8]  M. Viberg,et al.  Two decades of array signal processing research: the parametric approach , 1996, IEEE Signal Process. Mag..

[9]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[10]  I. R. Goodman,et al.  Mathematics of Data Fusion , 1997 .

[11]  Jacob Benesty,et al.  Microphone arrays for video camera steering , 2000 .

[12]  van Marie-Colette Lieshout,et al.  Markov Point Processes and Their Applications , 2000 .

[13]  Satoshi Nakamura,et al.  Speech enhancement based on the subspace method , 2000, IEEE Trans. Speech Audio Process..

[14]  Benesty,et al.  Adaptive eigenvalue decomposition algorithm for passive acoustic source localization , 2000, The Journal of the Acoustical Society of America.

[15]  Nando de Freitas,et al.  An Introduction to Sequential Monte Carlo Methods , 2001, Sequential Monte Carlo Methods in Practice.

[16]  Nando de Freitas,et al.  Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[17]  Andrew Blake,et al.  Nonlinear filtering for speaker tracking in noisy and reverberant environments , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[18]  Jacob Benesty,et al.  Real-time passive source localization: a practical linear-correction least-squares approach , 2001, IEEE Trans. Speech Audio Process..

[19]  Patrick Pérez,et al.  Sequential Monte Carlo methods for multiple target tracking and data fusion , 2002, IEEE Trans. Signal Process..

[20]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[21]  Ronald P. S. Mahler,et al.  Particle-systems implementation of the PHD multitarget-tracking filter , 2003, SPIE Defense + Commercial Sensing.

[22]  R. Mahler Multitarget Bayes filtering via first-order multitarget moments , 2003 .

[23]  Jacob Benesty,et al.  A class of frequency-domain adaptive approaches to blind multichannel identification , 2003, IEEE Trans. Signal Process..

[24]  Freda Kemp,et al.  An Introduction to Sequential Monte Carlo Methods , 2003 .

[25]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[26]  Hedvig Kjellström,et al.  Tracking Random Sets of Vehicles in Terrain , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[27]  Mohan M. Trivedi,et al.  Source localization in reverberant environments: modeling and statistical analysis , 2003, IEEE Trans. Speech Audio Process..

[28]  Sumeetpal S. Singh,et al.  Sequential monte carlo implementation of the phd filter for multi-target tracking , 2003, Sixth International Conference of Information Fusion, 2003. Proceedings of the.

[29]  R.P.S. Mahler,et al.  "Statistics 101" for multisensor, multitarget data fusion , 2004, IEEE Aerospace and Electronic Systems Magazine.

[30]  Ba-Ngu Vo,et al.  Joint detection and tracking of multiple maneuvering targets in clutter using random finite sets , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[31]  Huimin Chen,et al.  Tracking of multiple moving speakers with multiple microphone arrays , 2004, IEEE Transactions on Speech and Audio Processing.

[32]  Ramani Duraiswami,et al.  Accelerated speech source localization via a hierarchical search of steered response power , 2004, IEEE Transactions on Speech and Audio Processing.

[33]  Ba-Ngu Vo,et al.  Probability hypothesis density filter versus multiple hypothesis tracking , 2004, SPIE Defense + Commercial Sensing.

[34]  Ronald P. S. Mahler,et al.  Multitarget miss distance via optimal assignment , 2004, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[35]  Thiagalingam Kirubarajan,et al.  Data association combined with the probability hypothesis density filter for multitarget tracking , 2004, SPIE Defense + Commercial Sensing.

[36]  Ba-Ngu Vo,et al.  Localizing an unknown time-varying number of speakers: a Bayesian random finite set approach , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[37]  A. Doucet,et al.  Sequential Monte Carlo methods for multitarget filtering with random finite sets , 2005, IEEE Transactions on Aerospace and Electronic Systems.

[38]  Charles J. Geyer,et al.  Likelihood inference for spatial point processes , 2019, Stochastic Geometry.

[39]  Jesper Møller,et al.  Markov chain Monte Carlo and spatial point processes , 2019, Stochastic Geometry.