Joint Source Localization and Dereverberation by Sound Field Interpolation Using Sparse Regularization

In this paper, source localization and dereverberation are formulated jointly as an inverse problem. The inverse problem consists in the interpolation of the sound field measured by a set of microphones by matching the recorded sound pressure with that of a particular acoustic model. This model is based on a collection of equivalent sources creating either spherical or plane waves. In order to achieve meaningful results, spatial, spatio-temporal and spatio-spectral sparsity can be promoted in the signals originating from the equivalent sources. The inverse problem consists of a large-scale optimization problem that is solved using a first order matrix-free optimization algorithm. It is shown that once the equivalent source signals capable of effectively interpolating the sound field are obtained, they can be readily used to localize a speech sound source in terms of Direction of Arrival (DOA) and to perform dereverberation in a highly reverberant environment.

[1]  Rémi Gribonval,et al.  Physics-Driven Inverse Problems Made Tractable With Cosparse Regularization , 2016, IEEE Transactions on Signal Processing.

[2]  Álvaro González Measurement of Areas on a Sphere Using Fibonacci and Latitude–Longitude Lattices , 2009, 0912.4540.

[3]  Walter Kellermann,et al.  Coherent-to-Diffuse Power Ratio Estimation for Dereverberation , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[4]  Ralf Hiptmair,et al.  Vekua theory for the Helmholtz operator , 2011 .

[5]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Philippe Souères,et al.  A survey on sound source localization in robotics: From binaural to array processing methods , 2015, Comput. Speech Lang..

[7]  Marc Moonen,et al.  Source localization and signal reconstruction in a reverberant field using the FDTD method , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[8]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[9]  Marc Moonen,et al.  Room Impulse Response Interpolation Using a Sparse Spatio-Temporal Representation of the Sound Field , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[10]  Stefan Goetze,et al.  Regularization for Partial Multichannel Equalization for Speech Dereverberation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Laurent Daudet,et al.  Robust source localization from wavefield separation including prior information. , 2017, The Journal of the Acoustical Society of America.

[12]  Biing-Hwang Juang,et al.  Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Toon van Waterschoot,et al.  Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[15]  Marc Moonen,et al.  On the Modeling of Rectangular Geometries in Room Acoustic Simulations , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[16]  Emanuel A. P. Habets,et al.  Dual-Microphone Speech Dereverberation using a Reference Signal , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[17]  Martin Vetterli,et al.  Room helps: Acoustic localization with finite elements , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Julien de Rosny,et al.  A Blind Dereverberation Method for Narrowband Source Localization , 2015, IEEE Journal of Selected Topics in Signal Processing.