Mixed structural modeling of head-related transfer functions for customized binaural audio delivery

A novel approach to the modeling of head-related transfer functions (HRTFs) for binaural audio rendering is formalized and described in this paper. Mixed structural modeling (MSM) can be seen as the generalization and extension of the structural modeling approach first defined by Brown and Duda back in 1998. Possible solutions for building partial HRTFs (pHRTFs) of the head, torso, and pinna of a specific listener are first described and then used in the construction of two possible mixed structural models of a KEMAR mannequin. Thanks to the flexibility of the MSM approach, an exponential number of solutions for building custom binaural audio displays can be considered and evaluated, the final aim of the process being the achievement of a HRTF model fully customizable by the listener.

[1]  Gregory H. Wakefield,et al.  Efficient model fitting using a genetic algorithm: pole-zero approximations of HRTFs , 2002, IEEE Trans. Speech Audio Process..

[2]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[3]  Sjsu ScholarWorks,et al.  Characterizing elevation effects of a prolate spheroidal HRTF model , 2014 .

[4]  R. Duda,et al.  Approximating the head-related transfer function using simple geometric models of the head and torso. , 2002, The Journal of the Acoustical Society of America.

[5]  John William Strutt Scientific Papers: On the Acoustic Shadow of a Sphere , 2009 .

[6]  E. Shaw,et al.  Sound pressure generated in an external-ear replica and real human ears by a nearby point source. , 1968, The Journal of the Acoustical Society of America.

[7]  W. M. Rabinowitz,et al.  Auditory localization of nearby sources. Head-related transfer functions. , 1999, The Journal of the Acoustical Society of America.

[8]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[9]  Youn-sik Park,et al.  APROXIMATION OF HEAD RELATED TRANSFER FUNCTION USING PROLATE SPHEROIDAL HEAD MODEL , 2008 .

[10]  Youngjin Park,et al.  Enhanced Vertical Perception through Head-Related Impulse Response Customization Based on Pinna Response Tuning in the Median Plane , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[11]  V. Ralph Algazi,et al.  An adaptable ellipsoidal head model for the interaural time difference , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[12]  Simone Spagnol,et al.  Hearing distance: A low-cost model for near-field binaural effects , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[13]  Simone Spagnol,et al.  Fitting pinna-related transfer functions to anthropometry for binaural sound rendering , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[14]  R. M. Sachs,et al.  Anthropometric manikin for acoustic research. , 1975, The Journal of the Acoustical Society of America.

[15]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[16]  V. Ralph Algazi,et al.  Estimation of a Spherical-Head Model from Anthropometry , 2001 .

[17]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  V R Algazi,et al.  Elevation localization and head-related transfer function analysis at low frequencies. , 2001, The Journal of the Acoustical Society of America.

[19]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[20]  Simone Spagnol,et al.  Estimation and modeling of pinna-related transfer functions. , 2010 .

[21]  V. Ralph Algazi,et al.  Physical and Filter Pinna Models Based on Anthropometry , 2007 .

[22]  Richard O. Duda,et al.  Structural composition and decomposition of HRTFs , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[23]  Y. Shao,et al.  Sound Localization Cues for a Magnified Head: Implications from Sound Diffraction about a Rigid Sphere , 1993, Presence: Teleoperators & Virtual Environments.

[24]  Youn-sik Park,et al.  Modeling and Customization of Head-Related Impulse Responses Based on General Basis Functions in Time Domain , 2008 .

[25]  R H Y So,et al.  Toward orthogonal non-individualised head-related transfer functions for forward and backward directional sound: cluster analysis and an experimental study , 2010, Ergonomics.

[26]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[27]  V. Ralph Algazi,et al.  The Use of Head-and-Torso Models for Improved Spatial Sound Synthesis , 2002 .

[28]  Simone Spagnol,et al.  A Modular Framework for the Analysis and Synthesis of Head-Related Transfer Functions , 2013 .

[29]  F. Asano,et al.  Role of spectral cues in median plane localization. , 1990, The Journal of the Acoustical Society of America.

[30]  Gaëtan Parseihian,et al.  Perceptually based head-related transfer function database optimization. , 2012, The Journal of the Acoustical Society of America.

[31]  Bernhard U. Seeber,et al.  Subjective selection of non-individual head-related transfer functions , 2003 .

[32]  D. W. Batteau,et al.  The role of the pinna in human localization , 1967, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[33]  Simone Spagnol,et al.  A Head-Related Transfer Function Model for Real-Time Customized 3-D Sound Rendering , 2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems.

[34]  G. F. Kuhn Model for the interaural time differences in the azimuthal plane , 1977 .

[35]  M. Gardner,et al.  Problem of localization in the median plane: effect of pinnae cavity occlusion. , 1973, The Journal of the Acoustical Society of America.

[36]  Tomi Huttunen,et al.  Some Effects of the Torso on Head-Related Transfer Functions , 2007 .

[37]  Youngjin Park,et al.  Optimization of spherical and spheroidal head model for Head Related Transfer Function customization: Magnitude comparison , 2008, 2008 International Conference on Control, Automation and Systems.

[38]  B F Katz,et al.  Boundary element method calculation of individual head-related transfer function. I. Rigid model calculation. , 2001, The Journal of the Acoustical Society of America.

[39]  Ville Pulkki,et al.  A single-azimuth pinna-related transfer function database , 2011 .

[40]  Gregory H. Wakefield,et al.  Introduction to Head-Related Transfer Functions (HRTFs): Representations of HRTFs in Time, Frequency, and Space , 2001 .