Cartesian tracking of unknown time-varying number of speakers using distributed microphone pairs

This paper considers the challenging problem of Cartesian tracking of multiple sources using multiple distributed microphone arrays when the number of sources is unknown and varies with time due to new sources appearing and existing sources disappearing or undergoing long silence periods. The problem is posed in a bearings-only tracking framework. Frequency-domain independent component analysis (ICA) in conjunction with state coherence transform (SCT) is used as a robust method to extract the bearing information of the speakers. Also, by exploiting the frequency sparsity of the sources, ICA/SCT has proven to be effective even when the number of simultaneous speakers is larger than the number of microphones in an array. Next, the bearing information for each array is fused using a sequential-corrector probability hypothesis density (PHD) filter with a limited field of view (FOV) for each microphone array. The limited FOV is essential for applications like speech in order to account for the more distant sources not registering detections with respect to a sensor array. The promising tracking capability of the proposed method is demonstrated using simulations of multiple speakers in a reverberant environment.

[1]  Hiroshi Sawada,et al.  Direction of arrival estimation for multiple source signals using independent component analysis , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[2]  Francesco Nesta,et al.  Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Ronald P. S. Mahler The multisensor PHD filter: I. General solution via multitarget calculus , 2009, Defense + Commercial Sensing.

[4]  E. Lehmann,et al.  Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.

[5]  Bhaskar D. Rao,et al.  An ICA-SCT-PHD Filter Approach for Tracking and Separation of Unknown Time-Varying Number of Sources , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  R. Mahler Multitarget Bayes filtering via first-order multitarget moments , 2003 .

[7]  Bhaskar D. Rao,et al.  An ICA-based RFS approach for DOA tracking of unknown time-varying numberof sources , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[8]  Ronald P. S. Mahler,et al.  Statistical Multisource-Multitarget Information Fusion , 2007 .

[9]  Ba-Ngu Vo,et al.  The Gaussian Mixture Probability Hypothesis Density Filter , 2006, IEEE Transactions on Signal Processing.

[10]  Sim Heng Ong,et al.  Multiple sensor multiple object tracking with GMPHD filter , 2007, 2007 10th International Conference on Information Fusion.