Interpolation of Head-Related Transfer Functions

Abstract : Using current techniques it is usually impractical to measure head-related transfer functions (HRTFs) at a spatial resolution that does not exceed the minimum audible angle, i.e. 1-2 degrees for a source directly in front, by a considerable amount. As a result, measured HRTFs must be interpolated to generate a display in which auditory space is rendered smoothly. The spatial resolution at which it is necessary to measure HRTFs for the display to be of high spatial fidelity will depend on the quality of the interpolation technique. This report describes an interpolation technique that involves the application of a novel, inverse-distance-weighted averaging algorithm to HRTFs represented in the frequency domain. The quality of this technique was evaluated by comparing four listeners' abilities to localize virtual sound sources generated using measured or interpolated HRTFs. The measured HRTFs were shown to be of sufficiently high fidelity to allow virtual sources to be localized as accurately as real sources. Localization error measures, i.e. lateral errors, polar errors and proportions of front/back confusions, for HRTFs interpolated across up to 30 degrees of either lateral or polar angle, or 20 degrees of both lateral and polar angle, did not differ noticeably from those for measured HRTFs. On the basis of this finding we recommend that HRTFs be measured at a 20 degrees lateral- and polar-angle resolution.

[1]  R L McKinley,et al.  3-dimensional auditory displays: development, applications, and performance. , 1994, Aviation, space, and environmental medicine.

[2]  D. M. Green,et al.  Characterization of external ear impulse responses using Golay codes. , 1992, The Journal of the Acoustical Society of America.

[3]  Kai Crispien,et al.  Evaluation of the cocktail-party effect for multiple speech stimuli within a spatial auditory display , 1995 .

[4]  Elizabeth M. Wenzel,et al.  Perceptual consequences of interpolating head-related transfer functions during spatial synthesis , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[5]  A. Bronkhorst,et al.  Multichannel speech intelligibility and talker recognition using monaural, binaural, and three-dimensional auditory presentation. , 2000, The Journal of the Acoustical Society of America.

[6]  Karen Stephan,et al.  Effects of Supplementing Head-Down Displays With 3-D Audio During Visual Target Acquisition , 2004 .

[7]  D R Begault,et al.  Three-dimensional audio versus head-down traffic alert and collision avoidance system displays. , 1996, The International journal of aviation psychology.

[8]  Irwin Pollack,et al.  Cocktail Party Effect , 1957 .

[9]  D R Begault,et al.  Head-Up Auditory Displays for Traffic Collision Avoidance System Advisories: A Preliminary Investigation , 1993, Human factors.

[10]  A. Bronkhorst Localization of real and virtual sound sources , 1995 .

[11]  Richard L. McKinley,et al.  Aurally Aided Visual Search under Virtual and Free-Field Listening Conditions , 1996, Hum. Factors.

[12]  Mark A. Ericson,et al.  Factors That Influence Intelligibility in Multitalker Speech Displays , 2004 .

[13]  B. V. Van Veen,et al.  A spatial feature extraction and regularization model for the head-related transfer function. , 1995, The Journal of the Acoustical Society of America.

[14]  William L. Martens,et al.  Principal Components Analysis and Resynthesis of Spectral Cues to Perceived Direction , 1987, ICMC.

[15]  Russell L. Martin,et al.  Aurally and Visually Guided Visual Search in a Virtual Environment , 1998, Hum. Factors.

[16]  Russell L. Martin,et al.  Free-Field Equivalent Localization of Virtual Audio , 2001 .

[17]  F. K. Lam,et al.  A time domain binaural model based on spatial feature extraction for the head-related transfer function. , 1997, The Journal of the Acoustical Society of America.

[18]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[19]  Michael Friis Sørensen,et al.  Head-Related Transfer Functions of Human Subjects , 1995 .

[20]  E. Langendijk,et al.  Fidelity of three-dimensional-sound reproduction using a virtual auditory display. , 2000, The Journal of the Acoustical Society of America.

[21]  G. L. Ricard,et al.  Intelligibility and Localization of Speech from Virtual Directions , 1994, Human factors.

[22]  Klaus Hartung,et al.  Comparison of Different Methods for the Interpolation of Head-Related Transfer Functions , 1999 .

[23]  Adelbert W. Bronkhorst,et al.  Application of a Three-Dimensional Auditory Display in a Flight Task , 1996, Hum. Factors.

[24]  D R Begault,et al.  Multichannel spatial auditory display for speech communications. , 1993, Journal of the Audio Engineering Society. Audio Engineering Society.

[25]  Robert S. Bolia,et al.  Aurally Aided Visual Search in Three-Dimensional Space , 1999, Hum. Factors.