论文信息 - Efficient merging of multiple audio streams for spatial sound reproduction in Directional Audio Coding

Efficient merging of multiple audio streams for spatial sound reproduction in Directional Audio Coding

Directional Audio Coding (DirAC) is an efficient technique to capture and reproduce spatial sound. The analysis step outputs a mono DirAC stream, comprising an omnidirectional microphone pressure signal and side information, i.e., direction of arrival and diffuseness of the sound field expressed in time-frequency domain. This contribution proposes a method to merge two or more mono DirAC streams for a joint playback at the reproduction side. This problem arises in applications such as immersive spatial audio teleconferencing. With respect to a trivial direct merging, the proposed method is more efficient as it does not require the synthesis step. From this follows the benefit that the loudspeaker setup at the reproduction side does not have to be known in advance. Simulations and informal listening tests confirm the absence of any artifacts and that the proposed method is practically indistinguishable from the ideal merging.

[1] M. Ericson,et al. The Intelligibility of Multiple Talkers Separated Spatially in Noise , 2001 .

[2] Ville Pulkki,et al. Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[3] Oliver Hellmuth,et al. Spatial Audio Object Coding (SAOC) - The Upcoming MPEG Standard on Parametric Object Based Audio Coding , 2008 .

[4] Juha Merimaa,et al. Applications of a 3-D Microphone Array , 2002 .

[5] Jukka Ahonen,et al. Directional Analysis of Sound Field with Linear Microphone Array and Applications in Sound Reproduction , 2008 .

[6] Tapio Lokki,et al. Teleconference Application and B-Format Microphone Array for Directional Audio Coding , 2007 .

[7] Sascha Disch,et al. MPEG Surround: The Forthcoming ISO Standard for Spatial Audio Coding , 2006 .