Efficient merging of multiple audio streams for spatial sound reproduction in Directional Audio Coding

Directional Audio Coding (DirAC) is an efficient technique to capture and reproduce spatial sound. The analysis step outputs a mono DirAC stream, comprising an omnidirectional microphone pressure signal and side information, i.e., direction of arrival and diffuseness of the sound field expressed in time-frequency domain. This contribution proposes a method to merge two or more mono DirAC streams for a joint playback at the reproduction side. This problem arises in applications such as immersive spatial audio teleconferencing. With respect to a trivial direct merging, the proposed method is more efficient as it does not require the synthesis step. From this follows the benefit that the loudspeaker setup at the reproduction side does not have to be known in advance. Simulations and informal listening tests confirm the absence of any artifacts and that the proposed method is practically indistinguishable from the ideal merging.