Optimal positioning of a binaural sensor on a humanoid head for sound source localization

A generic approach to the placement of a binaural sensor on a humanoid robot head is proposed in order to improve sound localization. After a brief description of binaural and spectral cues, the method is first illustrated on the well-known head spherical approximation, which response can be described analytically. Then, a simplified CAD model of a mannequin is considered. It is argued that recent developments on analytical expansions of HRTFs can help to get a solution even from a limited set of simulated head responses. The obtained conclusions noticeably comply with some antropomorphic statistics.

[1]  Abhijit Kulkarni,et al.  Infinite-impulse-response models of the head-related transfer function. , 1995, The Journal of the Acoustical Society of America.

[2]  R. Duda,et al.  Range dependence of the response of a spherical head model , 1998 .

[3]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[4]  Hiroaki Kitano,et al.  Applying scattering theory to robot audition system: robust sound source localization and extraction , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[5]  L. W.,et al.  The Theory of Sound , 1898, Nature.

[6]  R. Duraiswami,et al.  Insights into head-related transfer function: Spatial dimensionality and continuous representation. , 2010, The Journal of the Acoustical Society of America.

[7]  Fumio Kanehiro,et al.  Robust speech interface based on audio and video information fusion for humanoid HRP-2 , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[8]  Tetsuya Ogata,et al.  Auditory and visual integration based localization and tracking of humans in daily-life environments , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Henrik Møller,et al.  The interaural time difference in binaural synthesis , 2000 .

[10]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[11]  William A. Yost,et al.  Spatial hearing: The psychophysics of human sound localization, revised edition , 1998 .

[12]  Ramani Duraiswami,et al.  INTERPOLATION AND RANGE EXTRAPOLATION OF HRTFS , 2004 .

[13]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[14]  Christopher A. Feuchter Numerical integration over a sphere , 1968 .

[15]  V. Ralph Algazi,et al.  An adaptable ellipsoidal head model for the interaural time difference , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[16]  R. Woodworth,et al.  Experimental Psychology, 3rd ed , 1957 .

[17]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[18]  Anthony I. Tew,et al.  Analyzing head-related transfer function measurements using surface spherical harmonics , 1998 .

[19]  V R Algazi,et al.  Elevation localization and head-related transfer function analysis at low frequencies. , 2001, The Journal of the Acoustical Society of America.

[20]  Stefan Weinzierl,et al.  Individualization of Dynamic Binaural Synthesis by Real Time Manipulation of ITD , 2010 .