A plenacoustic approach to acoustic signal extraction

This paper considers the problem of separation of acoustic sources from convolutive mixtures captured by a microphone array. The problem is approached through Plane Wave Decomposition (PWD) of the sound field measured at multiple points along the extension of the array. The directional components of the sound field are analyzed by means of the plenacoustic framework to accurately estimate the direction of arrival of the desired and undesired sources at every point at which the PWD is measured. Multiple spatial filters are designed, one for each PWD measurement point, to leave undistorted the desired source and attenuate the interferer. A successive stage of delay and sum of the outputs of the individual spatial filters enables the reconstruction of the desired source. The use of the plenacoustic framework allows us to gather intuitive and immediate interpretation of the acoustic scene. We prove the effectiveness of the proposed solution through simulations on speech data.

[1]  Emanuel A. P. Habets,et al.  Speech Enhancement in the STFT Domain , 2011, Springer Briefs in Electrical and Computer Engineering.

[2]  R. K. Cook,et al.  Measurement of Correlation Coefficients in Reverberant Sound Fields , 1955 .

[3]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[4]  Bhaskar D. Rao,et al.  Separation and tracking of multiple speakers in a reverberant environment using a multiple model particle filter glimpsing method , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Harry L. Van Trees,et al.  Optimum Array Processing , 2002 .

[6]  Jingdong Chen,et al.  Microphone Array Signal Processing , 2008 .

[7]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[8]  Augusto Sarti,et al.  Tracking Multiple Acoustic Sources in Reverberant Environments using Regularized Particle Filter , 2007, 2007 15th International Conference on Digital Signal Processing.

[9]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[10]  Jont B. Allen,et al.  Short term spectral analysis, synthesis, and modification by discrete Fourier transform , 1977 .

[11]  Jacob Benesty,et al.  Speech Enhancement , 2010 .

[12]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[13]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Augusto Sarti,et al.  Deconvolution of plenacoustic images , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[15]  Fabian J. Theis,et al.  The signal separation evaluation campaign (2007-2010): Achievements and remaining challenges , 2012, Signal Process..

[16]  Israel Cohen,et al.  Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Israel Cohen,et al.  Dual-Source Transfer-Function Generalized Sidelobe Canceller , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Augusto Sarti,et al.  Soundfield Imaging in the Ray Space , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Petre Stoica,et al.  Spectral Analysis of Signals , 2009 .

[20]  Augusto Sarti,et al.  Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Mahmood R. Azimi-Sadjadi,et al.  Wideband DOA estimation algorithms for multiple target detection and tracking using unattended acoustic sensors , 2004, SPIE Defense + Commercial Sensing.

[22]  Oliver Thiergart,et al.  An informed LCMV filter based on multiple instantaneous direction-of-arrival estimates , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.