HRTF selection for binaural synthesis from a database using morphological parameters

Auditory virtual environments are becoming increasingly relevant for applications such as teleconferencing, hearing aids, video games, and general immersive listening. To enable high fidelity renderings of the sound scene in such environments, the audio content must be treated with the actual listener's acoustical filters, the so-called head-related transfer functions (HRTFs). The current challenge for general public applications, given the difficulty of measuring HRTFs for a given listener, is to be able to individually generate HRTFs or perform a selection from a database of pre-existing HRTFs, so as to provide the listener an HRTF that enables a listening experience that is as realistic as possible, using for example only data taken from a photo of the listener's ear. A process is described in which a database of 46 measured HRTFs was analysed using various data reduction techniques such as principal component analysis and frequency scaling. A selection of the subjects' most significant morphological parameters was performed using data mining techniques such as support vector machines. This subset of morphological parameters for subjects associated to the HRTF database were then used to perform multiple linear regressions against the reduced dataset of HRTFs in order to predict what might be the listener's preferred HRTFs. The prediction performance was then compared to the results of a perceptual evaluation of the HRTFs from the database using a listening test. The results show that the proposed process was able to predict preferred HRTFs for a listener significantly better than if the HRTFs were chosen at random. The results from the listening test were also used to explore a perceptually relevant frequency range of the HRTF.

[1]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[2]  B F Katz,et al.  Boundary element method calculation of individual head-related transfer function. I. Rigid model calculation. , 2001, The Journal of the Acoustical Society of America.

[3]  D. M. Green,et al.  Directional dependence of interaural envelope delays. , 1990, The Journal of the Acoustical Society of America.

[4]  Dorte Hammershøi,et al.  Binaural Technique: Do We Need Individual Recordings? , 1996 .

[5]  Philip A. Nelson,et al.  Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models , 2007 .

[6]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[7]  Doris J. Kistler,et al.  Measurement and validation of human HRTFs for use in hearing research , 2005 .

[8]  Kazuya Takeda,et al.  Estimation of HRTFs on the horizontal plane using physical features , 2007 .

[9]  P. Guillon Individualisation des indices spectraux pour la synthèse binaurale : recherche et exploitation des similarités inter-individuelles pour l’adaptation ou la reconstruction de HRTF , 2009 .

[10]  Ramani Duraiswami,et al.  Fast head-related transfer function measurement via reciprocity. , 2006, The Journal of the Acoustical Society of America.

[11]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[12]  S. Carlile,et al.  Measuring the human head-related transfer functions: a novel method for the construction and calibration of a miniature "in-ear" recording system. , 1994, The Journal of the Acoustical Society of America.

[13]  Yukio Iwaya Individualization of head-related transfer functions with tournament-style listening test: Listening with other's ears , 2006 .

[14]  Gavriel Salvendy,et al.  Improved method to individualize head-related transfer function using anthropometric measurements , 2008 .

[15]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[16]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.

[17]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[18]  J. C. Middlebrooks,et al.  Individual differences in external-ear transfer functions reduced by scaling in frequency. , 1999, The Journal of the Acoustical Society of America.

[19]  M. Gardner,et al.  Problem of localization in the median plane: effect of pinnae cavity occlusion. , 1973, The Journal of the Acoustical Society of America.

[20]  William L. Martens,et al.  PERCEIVED NATURALNESS OF SPEECH SOUNDS PRESENTED USING PERSONALIZED VERSUS NON-PERSONALIZED HRTFS , 2007 .

[21]  Larry S. Davis,et al.  Rendering localized spatial audio in a virtual auditory space , 2004, IEEE Transactions on Multimedia.

[22]  William L. Martens,et al.  Principal Components Analysis and Resynthesis of Spectral Cues to Perceived Direction , 1987, ICMC.

[23]  S. Carlile,et al.  Enabling Individualized Virtual Auditory Space using Morphological Measurments , 2000 .

[24]  J. C. Middlebrooks Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. , 1999, The Journal of the Acoustical Society of America.

[25]  Bernhard U. Seeber,et al.  Subjective selection of non-individual head-related transfer functions , 2003 .

[26]  Frederic L. Wightman,et al.  Multidimensional scaling analysis of head-related transfer functions , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.