Creation of virtual auditory spaces

High-quality virtual audio scene rendering is a must for emerging virtual/augmented reality applications and for perceptual user interfaces. We describe algorithms for creation of virtual auditory spaces using measured and non-individualized HRTFs and head tracking. Details of algorithms for HRTF interpolation, room impulse response creation, and audio scene presentation are presented. Tests show that individuals externalize well, and find our interface natural. The system runs in real time with latency of less than 30 ms on an office PC without specialized DSP.

[1]  Larry S. Davis,et al.  Efficient evaluation of reverberant sound fields , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[2]  Ramani Duraiswami,et al.  Creating Virtual Spatial Audio Via Scientific Computing and Computer Vision , 2000 .

[3]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[4]  B. Shinn-Cunningham DISTANCE CUES FOR VIRTUAL AUDITORY SPACE , 2000 .

[5]  Jean-Marc Jot,et al.  Real-time spatial processing of sounds for music, multimedia and interactive human-computer interfaces , 1999, Multimedia Systems.

[6]  William G. Gardner,et al.  Efficient Convolution without Input/Output Delay , 1995 .

[7]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[8]  W. Hartmann,et al.  Localization of sound in rooms, II: The effects of a single reflecting surface. , 1985, The Journal of the Acoustical Society of America.

[9]  Tomlinson Holman,et al.  Surrounded by sound , 1999 .

[10]  J. Borish Extension of the image model to arbitrary polyhedra , 1984 .

[11]  Michael A. Casey,et al.  Vision-Steered Beam Forming and Transaural Rendering for the Artificial Life Interactive Video Environment (ALIVE) , 1995 .

[12]  V. Ralph Algazi,et al.  Estimation of a Spherical-Head Model from Anthropometry , 2001 .

[13]  Chris Kyriakakis Fundamental and technological limitations of immersive audio systems , 1998 .

[14]  J. Hebrank,et al.  Pinna reflections as cues for localization. , 1974, The Journal of the Acoustical Society of America.

[15]  Richard O. Duda,et al.  Modeling head related transfer functions , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[16]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[17]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..