Individualized HRTFs from few measurements: a statistical learning approach

Virtual auditory space (VAS) refers to the synthesis and simulation of spatial hearing using earphones and/or a speaker system. High-fidelity VAS requires the use of individualized head-related transfer functions (HRTFs) which describe the acoustic filtering properties of the listener's external auditory periphery. HRTFs serve the increasingly dominant role of implementation 3-D audio systems, which have been realized in some commercial applications. However, the cost of a 3-D audio system cannot be brought down because the efficiency of computation, the size of memory, and the synthesis of unmeasured HRTFs remain to be made better. Because HRTFs are unique for each user depending on his morphology, the economically realist synthesis of individualized HRTFs has to rely on some measurements. This paper presents a way to reduce the cost of a 3-D audio system using a statistical modeling which allows to use only few measurements for each user.

[1]  Gary S. Kendall,et al.  A 3-D Sound Primer: Directional Hearing and Stereo Reproduction , 1995 .

[2]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[3]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[4]  Heinrich Hertz,et al.  On the differences between localization and lateralization. , 1974, The Journal of the Acoustical Society of America.

[5]  Simon Carlile,et al.  Methods for spherical data analysis and visualization , 1998, Journal of Neuroscience Methods.

[6]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[7]  Véronique Larcher,et al.  Techniques de spatialisation des sons pour la réalité virtuelle , 2001 .

[8]  B. Moore,et al.  A revision of Zwicker's loudness model , 1996 .

[9]  Vincent Lemaire,et al.  SOM-Based Clustering for On-Line Fraud Behavior Classification: A Case study , 2002, FSKD.

[10]  Anthony N. Burkitt,et al.  Refined Pruning Techniques for Feed-forward Neural Networks , 1992, Complex Syst..

[11]  S. Carlile,et al.  Enabling Individualized Virtual Auditory Space using Morphological Measurments , 2000 .

[12]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[13]  F L Wightman,et al.  Headphone simulation of free-field listening. II: Psychophysical validation. , 1989, The Journal of the Acoustical Society of America.

[14]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[15]  Rick L. Jenison,et al.  A Spherical Basis Function Neural Network for Modeling Auditory Space , 1996, Neural Computation.

[16]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[17]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[18]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[19]  Mendel Kleiner,et al.  Auditory Presence, Individualized Head-Related Transfer Functions, and Illusory Ego-Motion in Virtual Environments , 2004 .