Indoor Multiple Sound Source Localization via Multi-Dimensional Assignment Data Association

In this paper, we address the multiple sound source localization problem by associating and fusing the direction of arrival (DOA) estimates from multiple microphone arrays. For multi-source scenarios especially in indoor environments, a critical issue is to tell the correspondence among DOA estimates across different arrays, which is known as the data association problem. We propose a multi-dimensional assignment-based data association approach to find the optimal associations of DOA estimates from the same source. First, in the sense of maximum likelihood, the data association problem is formulated by finding the most likely partition of the measurement set into the source-originated and false alarm-originated subsets. Next, by defining the association costs appropriately, the problem of finding the most likely measurement partition is transformed into a generalized multi-dimensional assignment problem which can be solved efficiently by a Lagrangian relaxation algorithm. After the optimal associations of DOA estimates across different arrays are obtained, the locations of sources can be estimated by fusing the same source-originated DOA estimates. In the presence of missed detections, false alarms and the unknown number of sources, the proposed method achieves high accuracy in data association and localization, and outperforms the competing method in reverberant and noisy environments. In addition, since our method does not require additional features and uses DOA estimates only, it is more computationally efficient than the competing method.

[1]  Qi Cheng,et al.  Multiple Sound Source Localization Based on a Multi-Dimensional Assignment Model , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[2]  Y. Bar-Shalom,et al.  A new relaxation algorithm and passive sensor data association , 1992 .

[3]  Athanasios Mouchtaris,et al.  Source counting in real-time sound source localization using a circular microphone array , 2012, 2012 IEEE 7th Sensor Array and Multichannel Signal Processing Workshop (SAM).

[4]  Kutluyil Dogançay,et al.  Bearings-only target localization using total least squares , 2005, Signal Process..

[5]  Emmanuel Vincent,et al.  Multi-source TDOA estimation in reverberant audio using angular spectra and clustering , 2012, Signal Process..

[6]  Maximo Cobos,et al.  Maximum a Posteriori Binary Mask Estimation for Underdetermined Source Separation Using Smoothed Posteriors , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Athanasios Mouchtaris,et al.  Addressing the data-association problem for multiple sound source localization using DOA estimates , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[8]  J.D. Reed,et al.  Multiple-source localization using line-of-bearing measurements: Approaches to the data association problem , 2008, MILCOM 2008 - 2008 IEEE Military Communications Conference.

[9]  Athanasios Mouchtaris,et al.  Localizing multiple audio sources from DOA estimates in a wireless acoustic sensor network , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[10]  Alexander Bertrand,et al.  Applications and trends in wireless acoustic sensor networks: A signal processing perspective , 2011, 2011 18th IEEE Symposium on Communications and Vehicular Technology in the Benelux (SCVT).

[11]  Yu Hen Hu,et al.  Maximum likelihood multiple-source localization using acoustic energy measurements with wireless sensor networks , 2005, IEEE Transactions on Signal Processing.

[12]  M. Morf,et al.  The signal subspace approach for multiple wide-band emitter location , 1983 .

[13]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[14]  Gang Wang,et al.  Efficient Convex Relaxation Methods for Robust Target Localization by a Sensor Network Using Time Differences of Arrivals , 2009, IEEE Transactions on Signal Processing.

[15]  Athanasios Mouchtaris,et al.  Localizing multiple audio sources in a wireless acoustic sensor network , 2015, Signal Process..

[16]  Peter Molnar,et al.  Bearings-only target localization for an acoustical unattended ground sensor network , 2001, SPIE Defense + Commercial Sensing.

[17]  Helen J Simon,et al.  Bilateral amplification and sound localization: then and now. , 2005, Journal of rehabilitation research and development.

[18]  Y. Bar-Shalom,et al.  m-best S-D assignment algorithm with application to multitarget tracking , 2001 .

[19]  Krishna R. Pattipati,et al.  Fast data association using multidimensional assignment with clustering , 2001 .

[20]  Y. Bar-Shalom,et al.  A generalized S-D assignment algorithm for multisensor-multitarget state estimation , 1997, IEEE Transactions on Aerospace and Electronic Systems.

[21]  Michael S. Brandstein,et al.  Robust automatic video-conferencing with multiple cameras and microphones , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[22]  Athanasios Mouchtaris,et al.  Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Yangsheng Xu,et al.  Surveillance Robot Utilizing Video and Audio Information , 2009, J. Intell. Robotic Syst..

[24]  Wing-Kin Ma,et al.  Least squares algorithms for time-of-arrival-based mobile location , 2004, IEEE Transactions on Signal Processing.

[25]  Qi Cheng,et al.  Indoor multiple sound source localization using a novel data selection scheme , 2014, 2014 48th Annual Conference on Information Sciences and Systems (CISS).

[26]  Akihiko Sugiyama,et al.  A new DOA estimation method using a circular microphone array , 2007, 2007 15th European Signal Processing Conference.

[27]  Hong Wang,et al.  Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..

[28]  Yu Hen Hu,et al.  Energy-Based Collaborative Source Localization Using Acoustic Microsensor Array , 2003, EURASIP J. Adv. Signal Process..

[29]  Zhi Ding,et al.  Multiple Source Localization in Wireless Sensor Networks Based on Time of Arrival Measurement , 2014, IEEE Transactions on Signal Processing.

[30]  Eric A. Lehmann,et al.  Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  Nedelko Grbic,et al.  Source localization for multiple speech sources using low complexity non-parametric source separation and clustering , 2011, Signal Process..

[32]  Qi Cheng,et al.  Indoor multi-sound source localization based on nonparametric Bayesian clustering , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[33]  R. G. Stansfield,et al.  Statistical theory of d.f. fixing , 1947 .

[34]  Qi Cheng,et al.  Real-time microphone array processing for sound source separation and localization , 2013, 2013 47th Annual Conference on Information Sciences and Systems (CISS).

[35]  Thomas Kailath,et al.  ESPRIT-estimation of signal parameters via rotational invariance techniques , 1989, IEEE Trans. Acoust. Speech Signal Process..

[36]  Emmanuel Vincent,et al.  The 2008 Signal Separation Evaluation Campaign: A Community-Based Approach to Large-Scale Evaluation , 2009, ICA.

[37]  Péter Molnár,et al.  Maximum likelihood methods for bearings-only target localization , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[38]  Patrick Danès,et al.  Broadband variations of the MUSIC high-resolution method for Sound Source Localization in Robotics , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Athanasios Mouchtaris,et al.  Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.