Use of personalized binaural audio and interactive distance cues in an auditory goal-reaching task

While the angular spatialization of source sounds through individualized Head-related transfer functions (HRTFs) has been extensively investigated in auditory display research, also leading to effective real-time rendering of these functions, conversely the interactive simulation of egocentric distance information has received less attention. The latter, in fact, suffers from the lack of realtime rendering solutions also due to a too sparse literature on the perception of dynamic distance cues. By adding a virtual environment based on a Digital waveguide mesh (DWM) model simulating a small tubular shape to a binaural rendering system through selection techniques of HRTF, we have come up with an auditory display affording interactive selection of absolute 3D spatial cues of angular spatialization as well as egocentric distance. The tube metaphor in particular minimized loudness changes with distance, hence providing mainly direct-to-reverberant and spectral cues. A goal-reaching experiment assessed the proposed display: participants were asked to explore a virtual map with a pen tablet and reach a sound source (the target) using only auditory information; then, subjective time to reach and traveled distance were analyzed. Results suggest that participants achieved a first level of spatial knowledge, i.e., knowledge about a point in space, by performing comparably to when they relied on more robust, although relative, loudness cues. Further work is needed to add fully physical consistency to the proposed auditory display.

[1]  Pavel Zahorik,et al.  Direct-to-reverberant energy ratio sensitivity. , 2002, The Journal of the Acoustical Society of America.

[2]  Michele Geronazzo,et al.  Personalization support for binaural headphone reproduction in web browsers , 2015 .

[3]  Charlotte Magnusson,et al.  Non Visual Haptic Audio Tools for Virtual Environments , 2006, HAID.

[4]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[5]  Jack M. Loomis,et al.  Auditory distance perception by translating observers , 1993, Proceedings of 1993 IEEE Research Properties in Virtual Reality Symposium.

[6]  Michele Geronazzo Mixed Structural Models for 3D Audio in Virtual Environments , 2014 .

[7]  Simone Spagnol,et al.  Enhancing vertical localization with image-guided selection of non-individual head-related transfer functions , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Heidi Christensen,et al.  Active binaural distance estimation for dynamic sources , 2007, INTERSPEECH.

[9]  Davide Rocchesso,et al.  Linearizing Auditory Distance Estimates by Means of Virtual Acoustics , 2008 .

[10]  John G. Neuhoff,et al.  An Adaptive Bias in the Perception of Looming Auditory Motion , 2001 .

[11]  Pavel Zahorik,et al.  Assessing auditory distance perception using virtual acoustics. , 2002, The Journal of the Acoustical Society of America.

[12]  Vesa Välimäki,et al.  A Modified Rectangular Waveguide Mesh Structure with Interpolated Input and Output Points , 2001, ICMC.

[13]  Gaëtan Parseihian,et al.  Perceptually based head-related transfer function database optimization. , 2012, The Journal of the Acoustical Society of America.

[14]  Dorte Hammershøi,et al.  Binaural Technique: Do We Need Individual Recordings? , 1996 .

[15]  Christoph Hölscher,et al.  Taxonomy of Human Wayfinding Tasks: A Knowledge-Based Approach , 2009, Spatial Cogn. Comput..

[16]  Federico Avanzini,et al.  Multimodal Exploration of Virtual Objects with a Spatialized Anchor Sound , 2014 .

[17]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[18]  Tammo Houtgast,et al.  Auditory distance perception in rooms , 1999, Nature.

[19]  Pavel Zahorik,et al.  AUDITORY DISPLAY OF SOUND SOURCE DISTANCE , 2002 .

[20]  D. R. Campbell,et al.  A MATLAB Simulation of “ Shoebox ” Room Acoustics for use in Research and Teaching , 2022 .

[21]  Davide Rocchesso,et al.  A Physics-Based Approach to the Presentation of Acoustic Depth , 2003 .

[22]  Davide Rocchesso,et al.  Auditory distance perception in an acoustic pipe , 2008, TAP.

[23]  Kazuhiro Iida,et al.  Personalization of head-related transfer functions in the median plane based on the anthropometry of the listener's pinnae. , 2014, The Journal of the Acoustical Society of America.

[24]  Barbara G. Shinn-Cunningham,et al.  Learning Reverberation: Considerations for Spatial Auditory Displays , 2000 .

[25]  W. G. Gardner,et al.  HRTF measurements of a KEMAR , 1995 .

[26]  A. Bronkhorst,et al.  Auditory distance perception in humans : A summary of past and present research , 2005 .

[27]  Matti Karjalainen,et al.  Modeling of reflections and air absorption in acoustical spaces a digital filter design approach , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[28]  Janina Fels,et al.  Perceptually Robust Headphone Equalization for Binaural Reproduction , 2011 .

[29]  Zoran Cvetkovic,et al.  Scattering Delay Network: An Interactive Reverberator for Computer Games , 2011 .

[30]  Michele Geronazzo,et al.  Interactive spatial sonification for non-visual exploration of virtual maps , 2016, Int. J. Hum. Comput. Stud..

[31]  Simon Conan,et al.  Sound Effect Metaphors for Near Field Distance Sonification , 2012 .

[32]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.