Indoor multiple sound source localization using a novel data selection scheme

The multiple sound source localization problem in reverberant environments is studied by using a circular microphone array. The problem is approached by performing single source localization at each of the selected time-frequency (TF) data points from the received signals followed by a histogram technique to achieve robust localization. A novel TF data point selection scheme is proposed based on the ratio of the power of a TF point to that of its adjacent preceding TF point at the same frequency. The method introduces little extra computation and is suitable for real-time implementation. The proposed algorithm is compared with the algorithm in [1] to show its effectiveness and efficiency using extensive simulations.

[1]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[2]  Athanasios Mouchtaris,et al.  Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[4]  Thomas Kailath,et al.  ESPRIT-estimation of signal parameters via rotational invariance techniques , 1989, IEEE Trans. Acoust. Speech Signal Process..

[5]  Parham Aarabi,et al.  Self-localizing dynamic microphone arrays , 2002 .

[6]  Thomas Hofmann,et al.  An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments , 2007 .

[7]  Akihiko Sugiyama,et al.  A new DOA estimation method using a circular microphone array , 2007, 2007 15th European Signal Processing Conference.

[8]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[9]  Bhaskar D. Rao,et al.  A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[11]  H. Akaike A new look at the statistical model identification , 1974 .

[12]  Yannick Deville,et al.  A Time-Frequency CORRelation-Based Blind Source Separation Method for Time-Delayed Mixtures , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[13]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[14]  Ying Yu,et al.  A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Ying Yu,et al.  Performance of real-time source-location estimators for a large-aperture microphone array , 2005, IEEE Transactions on Speech and Audio Processing.

[16]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .