HRTF personalization based on artificial neural network in individual virtual auditory space

The synthesis of individual virtual auditory space (VAS) is an important and challenging task in virtual reality. One of the key factors for individual VAS is to obtain a set of individual head related transfer functions (HRTFs). A customization method based on back-propagation (BP) artificial neural network (ANN) is proposed to obtain an individual HRTF without complex measurement. The inputs of the neural network are the anthropometric parameters chosen by correlation analysis and the outputs are the characteristic parameters of HRTFs together with the interaural time difference (ITD). Objective simulation experiments and subjective sound localization experiments are implemented to evaluate the performance of the proposed method. Experiments show that the estimated non-individual HRTF has small mean square error, and has similar perception effect to the corresponding one obtained from the database. Furthermore, the localization accuracy of personalized HRTF is increased compared to the non-individual HRTF.

[1]  Barry J. Wythoff,et al.  Backpropagation neural networks , 1993 .

[2]  J. C. Middlebrooks Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. , 1999, The Journal of the Acoustical Society of America.

[3]  F. K. Lam,et al.  A time domain binaural model based on spatial feature extraction for the head-related transfer function. , 1997, The Journal of the Acoustical Society of America.

[4]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[5]  Larry S. Davis,et al.  HRTF personalization using anthropometric measurements , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[6]  Armando Barreto,et al.  Modeling Head-Related Transfer Functions Based on Pinna Anthropometry , 2004 .

[7]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[8]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[9]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[10]  Kazuya Takeda,et al.  Estimating Head Related Transfer Function Using Multiple Regression Analysis , 2000 .

[11]  Zhenyang Wu,et al.  Head Related Transfer Function Personalization Based on Multiple Regression Analysis , 2006, 2006 International Conference on Computational Intelligence and Security.