Fitting pinna-related transfer functions to anthropometry for binaural sound rendering

This paper faces the general problem of modeling pinna-related transfer functions (PRTFs) for 3-D sound rendering. Following a structural approach, we aim at constructing a model for PRTF synthesis which allows to control separately the evolution of ear resonances and spectral notches through the design of two distinct filter blocks. Taking such model as endpoint, we propose a method based on the McAulay-Quatieri partial tracking algorithm to extract the frequencies of the most important spectral notches. Ray-tracing analysis performed on the so obtained tracks reveals a convincing correspondence between extracted frequencies and pinna geometry of a bunch of subjects.

[1]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[2]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[3]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[4]  V. Ralph Algazi,et al.  The Use of Head-and-Torso Models for Improved Spatial Sound Synthesis , 2002 .

[5]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[6]  V. Ralph Algazi,et al.  Physical and Filter Pinna Models Based on Anthropometry , 2007 .

[7]  ARMANDO BARRETO,et al.  Time and Frequency Decomposition of Head-Related Impulse Responses for the Development of Customizable Spatial Audio Models , 2007 .

[8]  Simone Spagnol,et al.  Estimation and modeling of pinna-related transfer functions. , 2010 .

[9]  Matti Karjalainen,et al.  Frequency-Zooming ARMA Modeling for Analysis of Noisy String Instrument Tones , 2003, EURASIP J. Adv. Signal Process..

[10]  Gregory H. Wakefield,et al.  Efficient model fitting using a genetic algorithm: pole-zero approximations of HRTFs , 2002, IEEE Trans. Speech Audio Process..

[11]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[12]  Xavier Serra,et al.  Digital Audio Effects , 2011 .

[13]  A. J. Watkins,et al.  Psychoacoustical aspects of synthesized vertical locale cues. , 1978, The Journal of the Acoustical Society of America.

[14]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[15]  Armando Barreto,et al.  Modeling Head-Related Transfer Functions Based on Pinna Anthropometry , 2004 .