Usage of Spectral Distortion for Objective Evalua- tion of Personalized HRTF in the Median Plane

Measuring the head-related transfer functions (HRTFs) for each subject is a complex process. Therefore, it is necessary to develop procedures that allow the estimation of personalized HRTFs. It is common to estimate the weights of the principal component analysis (PCA) of a group of subjects based on some anthropometric parameters using multivariable regression modelling. Moreover, to objectively evaluate the goodness of fit between the original HRTFs and the personalized ones, the spectral distortion (SD) is usually used too. However, its suitability in the median plane, in which the spectral profiles are crucial to localize a sound source, has not yet been demonstrated. This paper analyses the validity of the SD as a measure of the quality of the HRTF personalization in the median plane, from the localization point of view. The HRTFs were modelled from the weights estimated by multiple linear regression and artificial neural networks (ANNs). The SD was used to compare the HRTFs measured with those estimated. Likewise, the level of fitting accuracy of characteristic resonance and notches in the median plane was also compared. Despite the fact that the SD scores of ANNs are lower than those of the multiple linear regression and are similar to those reported by other studies, the errors obtained from analysing both central frequencies and levels for resonance and notches could be discriminated.

[1]  V. Ralph Algazi,et al.  Physical and Filter Pinna Models Based on Anthropometry , 2007 .

[2]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[3]  Mendel Kleiner,et al.  Auralization-An Overview , 1993 .

[4]  J. Hebrank,et al.  Spectral cues used in the localization of sound sources on the median plane. , 1974, The Journal of the Acoustical Society of America.

[5]  Gaël Mahé,et al.  Correction of the voice timbre distortions in telephone networks: method and evaluation , 2004, Speech Commun..

[6]  Gavriel Salvendy,et al.  Identification of Anthropometric Measurements for Individualization of Head-Related Transfer Functions , 2009 .

[7]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[8]  Barry J. Wythoff,et al.  Backpropagation neural networks , 1993 .

[9]  F. Itakura,et al.  Interpolating head related transfer functions in the median plane , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[10]  Youngjin Park,et al.  HRIR Customization in the Median Plane via Principal Components Analysis , 2007 .

[11]  Fabián C. Tommasini,et al.  Simplificación de las Funciones de Transferencia de Cabeza Mediante Análisis de Componentes Principales , 2008 .

[12]  Zhenyang Wu,et al.  HRTF personalization based on artificial neural network in individual virtual auditory space , 2008 .

[13]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[15]  E. Shaw,et al.  External-ear acoustic models with simple geometry. , 1968, The Journal of the Acoustical Society of America.

[16]  Youn-sik Park,et al.  Modeling and Customization of Head-Related Impulse Responses Based on General Basis Functions in Time Domain , 2008 .

[17]  Simon R. Oldfield,et al.  Detection and discrimination of spectral peaks and notches at 1 and 8 kHz. , 1989, The Journal of the Acoustical Society of America.

[18]  Simone Spagnol,et al.  Fitting pinna-related transfer functions to anthropometry for binaural sound rendering , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[19]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[20]  R. Duda,et al.  Modeling the Contralateral HRTF , 1999 .

[21]  F. Asano,et al.  Role of spectral cues in median plane localization. , 1990, The Journal of the Acoustical Society of America.

[22]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Michael Vorländer,et al.  Anthropometric Parameters Influencing Head-Related Transfer Functions , 2009 .

[24]  Zhenyang Wu,et al.  Head Related Transfer Function Personalization Based on Multiple Regression Analysis , 2006, 2006 International Conference on Computational Intelligence and Security.

[25]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[26]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[27]  Andreas Silzle Selection and Tuning of HRTFs , 2002 .

[28]  Gavriel Salvendy,et al.  Individualization of Head-Related Transfer Function for Three-Dimensional Virtual Auditory Display: A Review , 2007, HCI.

[29]  E. Shaw,et al.  Sound pressure generated in an external-ear replica and real human ears by a nearby point source. , 1968, The Journal of the Acoustical Society of America.

[30]  Kazuhiro Iida,et al.  Median plane localization using a parametric model of the head-related transfer function based on spectral cues , 2007 .

[31]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[32]  Liang Chen,et al.  The Estimation of Personalized HRTFs in Individual VAS , 2008, 2008 Fourth International Conference on Natural Computation.