Performance enhancement of SSC sound source localization for indoor environment

The steered response power-phase transform algorithm (SRP-PHAT) has been widely utilized for robust sound source localization for indoor environment. Searching space clustering algorithm (SSC) is the improved version of SRP-PHAT, in which the computational complexity could be greatly reduced via the space division and clustering. However, SSC has to frequently perform the space division and clustering when the positions of microphone arrays are changed, which will induce additional computational complexity. In this paper, we proposed a coarse-to-fine region contraction SSC (CFRC-SSC) method to reduce the computational complexity of SSC for the sound source localization algorithm. The coarse level SSC with limited computational complexity will contract the whole searching space to several candidate spaces with limited size, which will reduce the searching volume for fine level SSC without omitting the actual sound source localization. Simulation results demonstrate that the proposed CFRC-SSC show a lower computational complexity in terms of SRP function evaluation times and space clustering calculation times compared to SSC.

[1]  Chris Kyriakakis,et al.  Hybrid algorithm for robust, real-time source localization in reverberant environments , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[2]  Jacob Benesty,et al.  Fast steered response power source localization using inverse mapping of relative delays , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Harvey F. Silverman,et al.  Microphone array optimization by stochastic region contraction , 1991, IEEE Trans. Signal Process..

[4]  Ramani Duraiswami,et al.  Accelerated speech source localization via a hierarchical search of steered response power , 2004, IEEE Transactions on Speech and Audio Processing.

[5]  B. Kwon,et al.  Sound source localization for robot auditory system using the summed GCC method , 2008, 2008 International Conference on Control, Automation and Systems.

[6]  Hyunsoo Kim,et al.  Sound source localization for robot auditory systems , 2009, IEEE Transactions on Consumer Electronics.

[7]  Ying Yu,et al.  A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[8]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[9]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[10]  Hoang Tran Huy Do,et al.  Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array , 2011 .