论文信息 - Soundfield Imaging in the Ray Space

Soundfield Imaging in the Ray Space

In this work we propose a general approach to acoustic scene analysis based on a novel data structure (ray-space image) that encodes the directional plenacoustic function over a line segment (Observation Window, OW). We define and describe a system for acquiring a ray-space image using a microphone array and refer to it as ray-space (or “soundfield”) camera. The method consists of acquiring the pseudo-spectra corresponding to a grid of sampling points over the OW, and remapping them onto the ray space, which parameterizes acoustic paths crossing the OW. The resulting ray-space image displays the information gathered by the sensors in such a way that the elements of the acoustic scene (sources and reflectors) will be easy to discern, recognize and extract. The key advantage of this method is that ray-space images, irrespective of the application, are generated by a common (and highly parallelizable) processing layer, and can be processed using methods coming from the extensive literature of pattern analysis. After defining the ideal ray-space image in terms of the directional plenacoustic function, we show how to acquire it using a microphone array. We also discuss resolution and aliasing issues and show two simple examples of applications of ray-space imaging.

[1] Parhi,et al. Wideband DOA Estimation Algorithms for Multiple Moving Sources using Unattended Acoustic Sensors , 2008 .

[2] Richard O. Duda,et al. Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[3] Marc Moonen,et al. Joint DOA and multi-pitch estimation based on subspace techniques , 2012, EURASIP J. Adv. Signal Process..

[4] Walter Kellermann,et al. Localization of distinct reflections in rooms using spherical microphone array eigenbeam processing. , 2012, The Journal of the Acoustical Society of America.

[5] Tsuhan Chen,et al. A survey on image-based rendering - representation, sampling and compression , 2004, Signal Process. Image Commun..

[6] Terence Betlehem,et al. Theory and design of sound field reproduction in reverberant rooms. , 2005, The Journal of the Acoustical Society of America.

[7] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[8] Martin Vetterli,et al. The plenacoustic function, sampling and reconstruction , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9] T. Ajdler,et al. The Plenacoustic Function and Its Sampling , 2006, IEEE Transactions on Signal Processing.

[10] Augusto Sarti,et al. A room-compensated virtual surround system exploiting early reflections in a reverberant room , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[11] Flavio P. Ribeiro,et al. Fast Transforms for Acoustic Imaging— Part I: Theory , 2011, IEEE Transactions on Image Processing.

[12] F. Alton Everest,et al. Master handbook of acoustics , 1981 .

[13] Tapio Lokki,et al. Augmented reality audio for location-based games , 2009 .

[14] Cha Zhang,et al. Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[15] Walter Kellermann,et al. TDOA Estimation for Multiple Sound Sources in Noisy and Reverberant Environments Using Broadband Independent Component Analysis , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[16] Emanuel A. P. Habets,et al. On the Noise Reduction Performance of a Spherical Harmonic Domain Tradeoff Beamformer , 2012, IEEE Signal Processing Letters.

[17] Emanuel A. P. Habets,et al. Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[18] Harry L. Van Trees,et al. Optimum Array Processing , 2002 .

[19] Alessio Brutti,et al. An environment aware ML estimation of acoustic radiation pattern with distributed microphone pairs , 2013, Signal Process..

[20] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[21] Walter Kellermann,et al. An Acoustic Human-Machine Front-End for Multimedia Applications , 2003, EURASIP J. Adv. Signal Process..

[22] Pier Luigi Dragotti,et al. Layer-based sparse representation of multiview images , 2012, EURASIP J. Adv. Signal Process..

[23] Edward H. Adelson,et al. Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[24] Augusto Sarti,et al. Plenacoustic Imaging in the Ray Space , 2012, IWAENC.

[25] Augusto Sarti,et al. Two-Dimensional Beam Tracing from Visibility Diagrams for Real-Time Acoustic Rendering , 2010, EURASIP J. Adv. Signal Process..

[26] E. Adelson,et al. The Plenoptic Function and the Elements of Early Vision , 1991 .

[27] Augusto Sarti,et al. Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[28] Vítor H. Nascimento,et al. Fast Transforms for Acoustic Imaging—Part II: Applications , 2011, IEEE Transactions on Image Processing.

[29] Bin Yang,et al. Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[30] Boaz Rafaely,et al. Near-Field Spherical Microphone Array Processing With Radial Filtering , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[31] Augusto Sarti,et al. From direction of arrival estimates to localization of planar reflectors in a two dimensional geometry , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32] Augusto Sarti,et al. Reflection Coefficient Estimation by Pseudospectrum Matching , 2012, IWAENC.

[33] Petre Stoica,et al. Spectral Analysis of Signals , 2009 .

[34] Augusto Sarti,et al. Visibility-based beam tracing for soundfield rendering , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.