Soundfield Imaging in the Ray Space

In this work we propose a general approach to acoustic scene analysis based on a novel data structure (ray-space image) that encodes the directional plenacoustic function over a line segment (Observation Window, OW). We define and describe a system for acquiring a ray-space image using a microphone array and refer to it as ray-space (or “soundfield”) camera. The method consists of acquiring the pseudo-spectra corresponding to a grid of sampling points over the OW, and remapping them onto the ray space, which parameterizes acoustic paths crossing the OW. The resulting ray-space image displays the information gathered by the sensors in such a way that the elements of the acoustic scene (sources and reflectors) will be easy to discern, recognize and extract. The key advantage of this method is that ray-space images, irrespective of the application, are generated by a common (and highly parallelizable) processing layer, and can be processed using methods coming from the extensive literature of pattern analysis. After defining the ideal ray-space image in terms of the directional plenacoustic function, we show how to acquire it using a microphone array. We also discuss resolution and aliasing issues and show two simple examples of applications of ray-space imaging.

[1]  Parhi,et al.  Wideband DOA Estimation Algorithms for Multiple Moving Sources using Unattended Acoustic Sensors , 2008 .

[2]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[3]  Marc Moonen,et al.  Joint DOA and multi-pitch estimation based on subspace techniques , 2012, EURASIP J. Adv. Signal Process..

[4]  Walter Kellermann,et al.  Localization of distinct reflections in rooms using spherical microphone array eigenbeam processing. , 2012, The Journal of the Acoustical Society of America.

[5]  Tsuhan Chen,et al.  A survey on image-based rendering - representation, sampling and compression , 2004, Signal Process. Image Commun..

[6]  Terence Betlehem,et al.  Theory and design of sound field reproduction in reverberant rooms. , 2005, The Journal of the Acoustical Society of America.

[7]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[8]  Martin Vetterli,et al.  The plenacoustic function, sampling and reconstruction , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  T. Ajdler,et al.  The Plenacoustic Function and Its Sampling , 2006, IEEE Transactions on Signal Processing.

[10]  Augusto Sarti,et al.  A room-compensated virtual surround system exploiting early reflections in a reverberant room , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[11]  Flavio P. Ribeiro,et al.  Fast Transforms for Acoustic Imaging— Part I: Theory , 2011, IEEE Transactions on Image Processing.

[12]  F. Alton Everest,et al.  Master handbook of acoustics , 1981 .

[13]  Tapio Lokki,et al.  Augmented reality audio for location-based games , 2009 .

[14]  Cha Zhang,et al.  Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Walter Kellermann,et al.  TDOA Estimation for Multiple Sound Sources in Noisy and Reverberant Environments Using Broadband Independent Component Analysis , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Emanuel A. P. Habets,et al.  On the Noise Reduction Performance of a Spherical Harmonic Domain Tradeoff Beamformer , 2012, IEEE Signal Processing Letters.

[17]  Emanuel A. P. Habets,et al.  Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Harry L. Van Trees,et al.  Optimum Array Processing , 2002 .

[19]  Alessio Brutti,et al.  An environment aware ML estimation of acoustic radiation pattern with distributed microphone pairs , 2013, Signal Process..

[20]  Richard Szeliski,et al.  The lumigraph , 1996, SIGGRAPH.

[21]  Walter Kellermann,et al.  An Acoustic Human-Machine Front-End for Multimedia Applications , 2003, EURASIP J. Adv. Signal Process..

[22]  Pier Luigi Dragotti,et al.  Layer-based sparse representation of multiview images , 2012, EURASIP J. Adv. Signal Process..

[23]  Edward H. Adelson,et al.  Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Augusto Sarti,et al.  Plenacoustic Imaging in the Ray Space , 2012, IWAENC.

[25]  Augusto Sarti,et al.  Two-Dimensional Beam Tracing from Visibility Diagrams for Real-Time Acoustic Rendering , 2010, EURASIP J. Adv. Signal Process..

[26]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[27]  Augusto Sarti,et al.  Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Vítor H. Nascimento,et al.  Fast Transforms for Acoustic Imaging—Part II: Applications , 2011, IEEE Transactions on Image Processing.

[29]  Bin Yang,et al.  Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Boaz Rafaely,et al.  Near-Field Spherical Microphone Array Processing With Radial Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  Augusto Sarti,et al.  From direction of arrival estimates to localization of planar reflectors in a two dimensional geometry , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Augusto Sarti,et al.  Reflection Coefficient Estimation by Pseudospectrum Matching , 2012, IWAENC.

[33]  Petre Stoica,et al.  Spectral Analysis of Signals , 2009 .

[34]  Augusto Sarti,et al.  Visibility-based beam tracing for soundfield rendering , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.