Do We Need Individual Head-Related Transfer Functions for Vertical Localization? The Case Study of a Spectral Notch Distance Metric

This paper deals with the issue of individualizing the head-related transfer function (HRTF) rendering process for auditory elevation perception. Is it possible to find a nonindividual, personalized HRTF set that allows a listener to have an equally accurate localization performance than with his/her individual HRTFs? We propose a psychoacoustically motivated, anthropometry based mismatch function between HRTF pairs that exploits the close relation between the listener's pinna geometry and localization cues. This is evaluated using an auditory model that computes a mapping between HRTF spectra and perceived spatial locations. Results on a large number of subjects in the center for image processing and integrated computing (CIPIC) and acoustics research institute (ARI) HRTF databases suggest that there exists a nonindividual HRTF set, which allows a listener to have an equally accurate vertical localization than with individual HRTFs. Furthermore, we find the optimal parameterization of the proposed mismatch function, i.e., the one that best reflects the information given by the auditory model. Our findings show that the selection procedure yields statistically significant improvements with respect to dummy-head HRTFs or random HRTF selection, with potentially high impact from an applicative point of view.

[1]  Douglas Brungart,et al.  Spectral HRTF enhancement for improved vertical-polar auditory localization , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[2]  Simon R. Oldfield,et al.  Detection and discrimination of spectral peaks and notches at 1 and 8 kHz. , 1989, The Journal of the Acoustical Society of America.

[3]  Flemming Christensen,et al.  Directional resolution of head-related transfer functions required in binaural synthesis , 2005 .

[4]  W. M. Rabinowitz,et al.  Auditory localization of nearby sources. Head-related transfer functions. , 1999, The Journal of the Acoustical Society of America.

[5]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[6]  Marc Schönwiesner,et al.  Fast and persistent adaptation to new spectral cues for sound localization suggests a many-to-one mapping mechanism. , 2016, The Journal of the Acoustical Society of America.

[7]  F. Asano,et al.  Role of spectral cues in median plane localization. , 1990, The Journal of the Acoustical Society of America.

[8]  Piotr Majdak,et al.  3-D localization of virtual sound sources: Effects of visual environment, pointing method, and training , 2010, Attention, perception & psychophysics.

[9]  Gaëtan Parseihian,et al.  Perceptually based head-related transfer function database optimization. , 2012, The Journal of the Acoustical Society of America.

[10]  J. C. Middlebrooks,et al.  Psychophysical customization of directional transfer functions for virtual sound localization. , 2000, The Journal of the Acoustical Society of America.

[11]  Durand R. Begault,et al.  Inter-Laboratory Round Robin HRTF Measurement Comparison , 2015, IEEE Journal of Selected Topics in Signal Processing.

[12]  F L Wightman,et al.  Headphone simulation of free-field listening. II: Psychophysical validation. , 1989, The Journal of the Acoustical Society of America.

[13]  Woon-Seng Gan,et al.  User-defined spectral manipulation of HRTF for improved localisation in 3D sound systems , 1998 .

[14]  Robert Baumgartner,et al.  Assessment of Sagittal-Plane Sound Localization Performance in Spatial-Audio Applications , 2013 .

[15]  Youngjin Park,et al.  Customization of Spatially Continuous Head-Related Impulse Responses in the Median Plane , 2010 .

[16]  Toshiharu Mukai,et al.  3D sound source localization system based on learning of binaural hearing , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[17]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[18]  Ramani Duraiswami,et al.  Gaussian process data fusion for heterogeneous HRTF datasets , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[19]  B. Katz,et al.  ITD INTERPOLATION AND PERSONALIZATION FOR BINAURAL SYNTHESIS USING SPHERICAL HARMONICS , 2012 .

[20]  Richard H. Y. So,et al.  Effects of Spectral Manipulation on Nonindividualized Head-Related Transfer Functions (HRTFs) , 2011, Hum. Factors.

[21]  W. G. Gardner,et al.  HRTF measurements of a KEMAR , 1995 .

[22]  Catarina Mendonça,et al.  Learning Auditory Space: Generalization and Long-Term Effects , 2013, PloS one.

[23]  Simone Spagnol,et al.  Estimation and modeling of pinna-related transfer functions. , 2010 .

[24]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[25]  Yukio Iwaya,et al.  Transfer effects on sound localization performances from playing a virtual three-dimensional auditory game , 2007 .

[26]  Federico Avanzini,et al.  Acoustic selfies for extraction of external ear features in mobile audio augmented reality , 2016, VRST.

[27]  Robert Baumgartner,et al.  Modeling sound-source localization in sagittal planes for human listeners. , 2014, The Journal of the Acoustical Society of America.

[28]  Federico Avanzini,et al.  Improving elevation perception with a tool for image-guided head-related transfer function selection , 2017 .

[29]  Simone Spagnol,et al.  Enhancing vertical localization with image-guided selection of non-individual head-related transfer functions , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  J. C. Middlebrooks,et al.  Individual differences in external-ear transfer functions reduced by scaling in frequency. , 1999, The Journal of the Acoustical Society of America.

[31]  Sophie Savel,et al.  Perceptual factors contribute more than acoustical factors to sound localization abilities with virtual sources , 2015, Front. Neurosci..

[32]  Nick Zacharov,et al.  Perceptual attributes for the comparison of head-related transfer functions. , 2016, The Journal of the Acoustical Society of America.

[33]  Kazuhiro Iida,et al.  Personalization of head-related transfer functions in the median plane based on the anthropometry of the listener's pinnae. , 2014, The Journal of the Acoustical Society of America.

[34]  S Carlile,et al.  Neural system identification model of human sound localization. , 2000, The Journal of the Acoustical Society of America.

[35]  Federico Avanzini,et al.  Round Robin Comparison of Inter-Laboratory HRTF Measurements – Assessment with an auditory model for elevation , 2018, 2018 IEEE 4th VR Workshop on Sonic Interactions for Virtual Environments (SIVE).

[36]  Dorte Hammershøi,et al.  Binaural Technique: Do We Need Individual Recordings? , 1996 .

[37]  P M Hofman,et al.  Spectro-temporal factors in two-dimensional human sound localization. , 1998, The Journal of the Acoustical Society of America.

[38]  J. C. Middlebrooks Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. , 1999, The Journal of the Acoustical Society of America.

[39]  Larry S. Davis,et al.  Rendering localized spatial audio in a virtual auditory space , 2004, IEEE Transactions on Multimedia.

[40]  Gaëtan Parseihian,et al.  Rapid head-related transfer function adaptation using a virtual auditory environment. , 2012, The Journal of the Acoustical Society of America.

[41]  Ville Pulkki,et al.  HRIR Database with Measured Actual Source Direction Data , 2012 .

[42]  Robert Baumgartner,et al.  Acoustic and non-acoustic factors in modeling listener-specific performance of sagittal-plane sound localization , 2014, Front. Psychol..

[43]  Ville Pulkki,et al.  A single-azimuth pinna-related transfer function database , 2011 .

[44]  Peter Balazs,et al.  Multiple Exponential Sweep Method for Fast Measurement of Head-Related Transfer Functions , 2007 .

[45]  Federico Avanzini,et al.  Frequency Estimation Of The First Pinna Notch In Head-Related Transfer Functions With A Linear Anthropometric Model , 2015 .

[46]  A John Van Opstal,et al.  Reconstructing spectral cues for sound localization from responses to rippled noise stimuli , 2017, PloS one.

[47]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[48]  Bosun Xie,et al.  Typical data and cluster analysis on head-related transfer functions from Chinese subjects , 2015 .

[49]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[50]  M. Cynader,et al.  A computational theory of spectral cue localization , 1993 .

[51]  José Santos-Victor,et al.  Sound Localization for Humanoid Robots - Building Audio-Motor Maps based on the HRTF , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[52]  Yukio Iwaya Individualization of head-related transfer functions with tournament-style listening test: Listening with other's ears , 2006 .

[53]  E. Shaw,et al.  Sound pressure generated in an external-ear replica and real human ears by a nearby point source. , 1968, The Journal of the Acoustical Society of America.

[54]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[55]  Stephan Paul,et al.  Binaural Recording Technology: A Historical Review and Possible Future Developments , 2009 .

[56]  E. Langendijk,et al.  Contribution of spectral cues to human sound localization. , 1999, The Journal of the Acoustical Society of America.

[57]  M. Gardner,et al.  Problem of localization in the median plane: effect of pinnae cavity occlusion. , 1973, The Journal of the Acoustical Society of America.

[58]  Simone Spagnol On distance dependence of pinna spectral patterns in head-related transfer functions. , 2015, The Journal of the Acoustical Society of America.

[59]  H. Takemoto,et al.  Mechanism for generating peaks and notches of head-related transfer functions in the median plane. , 2012, The Journal of the Acoustical Society of America.

[60]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[61]  Kazuhiro Iida,et al.  Median plane localization using a parametric model of the head-related transfer function based on spectral cues , 2007 .

[62]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[63]  Stefan Weinzierl,et al.  Individualization of Dynamic Binaural Synthesis by Real Time Manipulation of ITD , 2010 .

[64]  Brian D. Simpson,et al.  Do you hear where I hear?: isolating the individualized sound localization cues , 2014, Front. Neurosci..