Binaural Source Localization and Spatial Audio Reproduction for Telepresence Applications

Telepresence is generally described as the feeling of being immersed in a remote environment, be it virtual or real. A multimodal telepresence environment, equipped with modalities such as vision, audition, and haptic, improves immersion and augments the overall perceptual presence. The present work focuses on acoustic telepresence at both the teleoperator and operator sites. On the teleoperator side, we build a novel binaural sound source localizer using generic Head Related Transfer Functions (HRTFs). This new localizer provides estimates for the direction of a single sound source given in terms of azimuth and elevation angles in free space by using only two microphones. It also uses an algorithm that is efficient compared to the currently known algorithms used in similar localization processes. On the operator side, the paper addresses the problem of spatially interpolating HRTFs for densely sampled high-fidelity 3D sound synthesis. In our telepresence application scenario the synthesized 3D sound is presented to the operator over headphones and shall achieve a high-fidelity acoustic immersion. Using measured HRTF data, we create interpolated HRTFs between the existing functions using a matrix-valued interpolation function. The comparison with existing interpolation methods reveals that our new method offers superior performance and is capable of achieving high-fidelity reconstructions of HRTFs.

[1]  S Handel Space is to time as vision is to audition: seductive but misleading. , 1988, Journal of experimental psychology. Human perception and performance.

[2]  S. Wermter,et al.  A recurrent neural network for sound-source motion tracking and prediction , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[3]  Mikio Toyama,et al.  Effect of Arrival Time Correction on the Accuracy of Binaural Impulse Response Interpolation--Interpolation Methods of Binaural Response , 2004 .

[4]  Parham Aarabi,et al.  Theory and design of multirate sensor arrays , 2005, IEEE Transactions on Signal Processing.

[5]  Athanasios C. Antoulas,et al.  On the Scalar Rational Interpolation Problem , 1986 .

[6]  I. Kale,et al.  Low-order modeling of head-related transfer functions using balanced model truncation , 1997, IEEE Signal Processing Letters.

[7]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[8]  Amir A. Handzel High acuity sound-source localization by means of a triangular spherical array , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[9]  Klaus Diepold,et al.  A New Method for Binaural 3-D Localization Based on Hrtfs , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Toshiharu Mukai,et al.  3D sound source localization system based on learning of binaural hearing , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[11]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[12]  Henrik Møller Fundamentals of binaural technology , 1991 .

[13]  A. A. Handzel,et al.  Biomimetic sound-source localization , 2002 .

[14]  Parham Aarabi,et al.  Acoustic robot navigation using distributed microphone arrays , 2004, Inf. Fusion.

[15]  G.D. Cain,et al.  Approximation of FIR by IIR digital filters: an algorithm based on balanced model reduction , 1992, IEEE Trans. Signal Process..

[16]  S. Robeson Spherical Methods for Spatial Interpolation: Review and Evaluation , 1997 .

[17]  Durand R. Begault,et al.  3-D Sound for Virtual Reality and Multimedia Cambridge , 1994 .

[18]  Martin Vetterli,et al.  Plenacoustic function on the circle with application to HRTF interpolation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[19]  Flemming Christensen,et al.  Directional resolution of head-related transfer functions required in binaural synthesis , 2005 .

[20]  W. G. Gardner,et al.  HRTF measurements of a KEMAR , 1995 .

[21]  R. Duraiswami,et al.  Plane-wave decomposition analysis for spherical microphone arrays , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[22]  Yong Rui,et al.  New direct approaches to robust sound source localization , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[23]  Timothy K. Horiuchi "Seeing" in the dark: neuromorphic VLSI modeling of bat echolocation , 2005 .

[24]  Brian D. O. Anderson,et al.  Rational interpolation and state variable realizations , 1990, 29th IEEE Conference on Decision and Control.