Multiple Speech Sources Localization in Room Reverberant Environment Using Spherical Harmonic Sparse Bayesian Learning

In recent years, sparse representation techniques have been proposed for source localization using spherical microphone arrays (SMAs). However, the performance of these sparse representation techniques for SMAs degrades for speech source localization in the room environment due to sound reverberation. This article proposes a robust sparse presentation method for localization of multiple speech sources in the room environment using an SMA, which employs the spherical harmonic temporal extension of multiple response model sparse Bayesian learning. Real-world experimental results demonstrate that the proposed method outperforms its existing counterparts for speech source localization in the real room environment.

[1]  Boaz Rafaely,et al.  Fundamentals of Spherical Array Processing , 2015, Springer Topics in Signal Processing.

[2]  Rajesh M. Hegde,et al.  Near-Field Acoustic Source Localization and Beamforming in Spherical Harmonics Domain , 2016, IEEE Transactions on Signal Processing.

[3]  Walter Kellermann,et al.  EB-ESPRIT: 2D localization of multiple wideband acoustic sources using eigen-beams , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[5]  Shefeng Yan,et al.  Spherical harmonics MUSIC versus conventional MUSIC , 2011 .

[6]  Boaz Rafaely,et al.  Spherical array processing for acoustic analysis using room impulse responses and time-domain smoothing. , 2013, The Journal of the Acoustical Society of America.

[7]  Rajesh M. Hegde,et al.  Joint source localization and separation in spherical harmonic domain using a sparsity based method , 2015, INTERSPEECH.

[8]  Bhaskar D. Rao,et al.  Sparse Signal Recovery With Temporally Correlated Source Vectors Using Sparse Bayesian Learning , 2011, IEEE Journal of Selected Topics in Signal Processing.

[9]  Craig T. Jin,et al.  A dereverberation algorithm for spherical microphone arrays using compressed sensing techniques , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Boaz Rafaely,et al.  Coherent signals direction-of-arrival estimation using a spherical microphone array: Frequency smoothing approach , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[11]  Qinghua Huang,et al.  Real-valued DOA estimation for spherical arrays using sparse Bayesian learning , 2016, Signal Process..

[12]  Walter Kellermann,et al.  Joint DOA and TDOA estimation for 3D localization of reflective surfaces using eigenbeam MVDR and spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Walter Kellermann,et al.  Robust localization of multiple sources in reverberant environments using EB-ESPRIT with spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Dmitry M. Malioutov,et al.  A sparse signal reconstruction perspective for source localization with sensor arrays , 2005, IEEE Transactions on Signal Processing.

[15]  Igal Bilik,et al.  Spatial Compressive Sensing for Direction-of-Arrival Estimation of Multiple Sources using Dynamic Sensor Arrays , 2011, IEEE Transactions on Aerospace and Electronic Systems.

[16]  Boaz Rafaely,et al.  Acoustic analysis by spherical microphone array processing of room impulse responses. , 2012, The Journal of the Acoustical Society of America.