Investigating head-related transfer function smoothing using a sagittal-plane localization model

A new head-related transfer function (HRTF) smoothing algorithm is presented. HRTF magnitude responses are expressed on an equivalent rectangular bandwidth frequency scale and smoothing is increased by progressively discarding the higher frequency Fourier coefficients. A sagittal plane localization model was used to assess the degree of spectral smoothing that can be applied without significant increase in localization error. The results of the localization model simulation were compared with results from a previous perceptual investigation using an algorithm that discards coefficients on a linear frequency scale. Our findings suggest that using a perceptually motivated frequency scale yields similar localization performance using fewer than half the number of coefficients.

[1]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[2]  B. P. Bogert,et al.  The quefrency analysis of time series for echoes : cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking , 1963 .

[3]  Anthony I. Tew,et al.  Morphoacoustic perturbation analysis: principles and validation , 2012 .

[4]  J. Hebrank,et al.  Spectral cues used in the localization of sound sources on the median plane. , 1974, The Journal of the Acoustical Society of America.

[5]  Piotr Majdak,et al.  The Auditory Modeling Toolbox , 2013 .

[6]  Biing-Hwang Juang,et al.  On the use of bandpass liftering in speech recognition , 1987, IEEE Trans. Acoust. Speech Signal Process..

[7]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[8]  Philip A. Nelson,et al.  Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models , 2007 .

[9]  Craig T. Jin,et al.  Creating the Sydney York Morphological and Acoustic Recordings of Ears Database , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[10]  Jeroen Breebaart,et al.  Perceptual (ir)relevance of HRTF magnitude and phase spectra , 2001 .

[11]  D.P. Skinner,et al.  The cepstrum: A guide to processing , 1977, Proceedings of the IEEE.

[12]  Simon Carlile,et al.  Spectral information in sound localization. , 2005, International review of neurobiology.

[13]  V. Ralph Algazi,et al.  Physical and Filter Pinna Models Based on Anthropometry , 2007 .

[14]  Russell L. Martin,et al.  Localization of Virtual Sound as a Function of Head-Related Impulse Response Duration , 2002 .

[15]  Elizabeth M. Wenzel,et al.  Localization using nonindividualized head-related transfer functions. [for auditory interfaces in virtual environments , 1993 .

[16]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[17]  Sungmok Hwang,et al.  Interpretations on principal components analysis of head-related impulse responses in the median plane. , 2008, The Journal of the Acoustical Society of America.

[18]  H. Steven Colburn,et al.  Role of spectral detail in sound-source localization , 1998, Nature.

[19]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[20]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[21]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[22]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[23]  R. Patterson,et al.  B OF THE SVOS FINAL REPORT ( Part A : The Auditory Filterbank ) AN EFFICIENT AUDITORY FIL TERBANK BASED ON THE GAMMATONE FUNCTION , 2010 .

[24]  Robert Baumgartner,et al.  Assessment of Sagittal-Plane Sound Localization Performance in Spatial-Audio Applications , 2013 .

[25]  Soo-Chang Pei,et al.  Minimum-Phase FIR Filter Design Using Real Cepstrum , 2006 .

[26]  Parham Mokhtari,et al.  Comparison of Simulated and Measured HRTFs: FDTD Simulation Using MRI Head Data , 2007 .

[27]  Brad Rakerd,et al.  Sound localization in the median sagittal plane by hearing impaired listeners , 1995 .

[28]  R.W. Schafer,et al.  From frequency to quefrency: a history of the cepstrum , 2004, IEEE Signal Processing Magazine.