Accelerated steered response power method for sound source localization via clustering search

The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a microphone array, each location corresponds to a set of time differences of arrival (TDOAs), and this paper collects them into a TDOA vector. Since the TDOA vectors in the adjacent regions are similar, we present a fast algorithm based on clustering search to reduce the computation complexity of SRP-PHAT. In the training stage, the K-means or Iterative Self-Organizing Data Analysis Technique (ISODATA) clustering algorithm is used to find the centroid in each cluster with similar TDOA vectors. In the procedure of sound localization, the optimal cluster is found by comparing the steered response powers (SRPs) of all centroids. The SRPs of all candidate locations in the optimal cluster are compared to localize the sound source. Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computational load of the proposed method with those of the conventional SRP-PHAT algorithm. The results show that the proposed method is able to reduce the computational load drastically and maintains almost the same localization accuracy and robustness as those of the conventional SRP-PHAT algorithm. The difference in localization performance brought by different clustering algorithms used in the training stage is trivial.

[1]  Zhenyang Wu,et al.  Accelerated steered response power method for sound source localization using orthogonal linear array , 2010 .

[2]  Chris Kyriakakis,et al.  Hybrid algorithm for robust, real-time source localization in reverberant environments , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[3]  Maurizio Omologo,et al.  Speech Recognition with Microphone Arrays , 2001, Microphone Arrays.

[4]  Hong Kim,et al.  A name recognition based call-and-come service for home robots , 2008, IEEE Transactions on Consumer Electronics.

[5]  Xiaoyu Ma,et al.  Study on 2-dimensional microphone circular array speech enhancement algorithm based on the DOA , 2011, Proceedings of 2011 International Conference on Electronic & Mechanical Engineering and Information Technology.

[6]  Ramani Duraiswami,et al.  Accelerated speech source localization via a hierarchical search of steered response power , 2004, IEEE Transactions on Speech and Audio Processing.

[7]  Jean-Marc Odobez,et al.  AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking , 2004, MLMI.

[8]  Hyunsoo Kim,et al.  Sound source localization for robot auditory systems , 2009, IEEE Transactions on Consumer Electronics.

[9]  Michael S. Brandstein,et al.  Real-Time Automated Video and Audio Capture with Multiple Cameras and Microphones , 2001, J. VLSI Signal Process..

[10]  Mengyin Fu,et al.  Sound Source Localization Sensor of Robot for TDOA Method , 2011, 2011 Third International Conference on Intelligent Human-Machine Systems and Cybernetics.

[11]  Jacob Benesty,et al.  A Generalized Steered Response Power Method for Computationally Viable Source Localization , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Steve Renals,et al.  A digital microphone array for distant speech recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[14]  Joseph H. DiBiase A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .

[15]  J.M. Peterson,et al.  Analysis of Fast Localization Algorithms for Acoustical Environments , 2005, Conference Record of the Thirty-Ninth Asilomar Conference onSignals, Systems and Computers, 2005..

[16]  Zhu Liang Yu,et al.  A simple microphone array for source direction and distance estimation , 2011, 2011 6th IEEE Conference on Industrial Electronics and Applications.

[17]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[18]  François Michaud,et al.  Evaluating real-time audio localization algorithms for artificial audition in robotics , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Mohammad Sharifkhani,et al.  Implementation of a cost efficient SSL based on an Angular beamformer SRP-PHAT , 2011, 2011 18th IEEE International Conference on Electronics, Circuits, and Systems.