Binaural sound source localization using the frequency diversity of the head-related transfer function.

The spectral localization cues contained in the head-related transfer function are known to play a contributory role in the sound source localization abilities of humans. However, existing localization techniques are unable to fully exploit this diversity to accurately localize a sound source. The availability of just two measured signals complicates matters further, and results in front to back confusions and poor performance distinguishing between the source locations in a vertical plane. This study evaluates the performance of a source location estimator that retains the frequency domain diversity of the head-related transfer function. First, a method for extracting the directional information in the subbands of a broadband signal is described, and a composite estimator based on signal subspace decomposition is introduced. The localization performance is experimentally evaluated for single and multiple source scenarios in the horizontal and vertical planes. The proposed estimator's ability to successfully localize a sound source and resolve the ambiguities in the vertical plane is demonstrated, and the impact of the source location, knowledge of the source and the effect of reverberation is discussed.

[1]  S. Shamma,et al.  Segregation of complex acoustic scenes based on temporal coherence , 2013, eLife.

[2]  E D Young,et al.  Neural network models of sound localization based on directional filtering by the pinna. , 1992, The Journal of the Acoustical Society of America.

[3]  M. Cynader,et al.  A computational theory of spectral cue localization , 1993 .

[4]  Kazuhiro Iida,et al.  Upper hemisphere sound localization using head-related transfer functions in the median plane and interaural differences , 2003 .

[5]  Volker Hohmann,et al.  Sound source localization in real sound fields based on empirical statistics of interaural parameters. , 2006, The Journal of the Acoustical Society of America.

[6]  Ramani Duraiswami,et al.  Numerical study of the influence of the torso on the HRTF , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Mostafa Kaveh,et al.  Coherent wide-band ESPRIT method for directions-of-arrival estimation of multiple wide-band sources , 1990, IEEE Trans. Acoust. Speech Signal Process..

[8]  Paul M. Hofman,et al.  Relearning sound localization with new ears , 1998, Nature Neuroscience.

[9]  H. Colburn,et al.  Models of Sound Localization , 2005 .

[10]  Simon Carlile,et al.  The nature and distribution of errors in sound localization by human listeners , 1997, Hearing Research.

[11]  Harald Viste,et al.  Binaural Source Localization by Joint Estimation of ILD and ITD , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  F. Wightman,et al.  The dominant role of low-frequency interaural time differences in sound localization. , 1992, The Journal of the Acoustical Society of America.

[13]  E. Macpherson,et al.  Binaural weighting of monaural spectral cues for sound localization. , 2007, The Journal of the Acoustical Society of America.

[14]  Zhi Ding,et al.  Broadband DOA estimation using frequency invariant beamforming , 1998, IEEE Trans. Signal Process..

[15]  Virginia Best,et al.  The role of high frequencies in speech localization. , 2005, The Journal of the Acoustical Society of America.

[16]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[17]  Thushara D. Abhayapala,et al.  Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Juan Liang,et al.  Robust and low complexity localization algorithm based on head-related impulse responses and interaural time difference. , 2013, The Journal of the Acoustical Society of America.

[19]  Thushara D. Abhayapala,et al.  HRTF aided broadband doa estimation using two microphones , 2012, 2012 International Symposium on Communications and Information Technologies (ISCIT).

[20]  B. Shinn-Cunningham,et al.  Tori of confusion: binaural localization cues for sources within reach of a listener. , 2000, The Journal of the Acoustical Society of America.

[21]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[22]  Rodney A. Kennedy,et al.  HRTF Measurement on KEMAR Manikin , 2009 .

[23]  Hong Wang,et al.  Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wide-band sources , 1985, IEEE Trans. Acoust. Speech Signal Process..

[24]  D. M. Green,et al.  Sound localization by human listeners. , 1991, Annual review of psychology.

[25]  V. Ralph Algazi,et al.  Estimation of a Spherical-Head Model from Anthropometry , 2001 .

[26]  V. Mellert,et al.  Transformation characteristics of the external human ear. , 1977, The Journal of the Acoustical Society of America.

[27]  W M Hartmann,et al.  Identification and localization of sound sources in the median sagittal plane. , 1999, The Journal of the Acoustical Society of America.

[28]  C. Lim,et al.  Estimating the azimuth and elevation of a sound source from the output of a cochlear model , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[29]  Jinyu Qian,et al.  The role of spectral modulation cues in virtual sound localization. , 2008, The Journal of the Acoustical Society of America.

[30]  M. Morimoto,et al.  Localization cues of sound sources in the upper hemisphere. , 1984 .

[31]  Densil Cabrera,et al.  Influence of fundamental frequency and source elevation on the vertical localization of complex tones and complex tone pairs. , 2007, The Journal of the Acoustical Society of America.

[32]  Richard M. Stern,et al.  Interaural Correlation as the Basis of a Working Model of Binaural Processing: An Introduction , 2005 .

[33]  Gerhard Lakemeyer,et al.  Azimuthal sound localization using coincidence of timing across frequency on a robotic platform. , 2007, The Journal of the Acoustical Society of America.

[34]  J. MacDonald A localization algorithm based on head-related transfer functions. , 2008, The Journal of the Acoustical Society of America.

[35]  V R Algazi,et al.  Elevation localization and head-related transfer function analysis at low frequencies. , 2001, The Journal of the Acoustical Society of America.