Artificial Robot Ear Design for Sound Direction Estimation

We propose a novel design of artificial robot ear for sound direction estimation using measured two outputs only. The spectral features in head-related transfer functions and in interaural transfer functions (ITFs) are distinctive in the voice frequency band. Thus, these features provide effective sound cues to estimate the sound direction using two measured ear outputs only without input information. Especially, the direction of median source can be estimated by the bilateral asymmetry of microphone positions, whereas many previous studies, which depend on the interaural differences, cannot achieve. We propose a localization method to estimate the lateral and vertical angles simultaneously. The lateral angle is estimated using interaural time delay and Woodworth and Schlosberg's formula, and the front-back discrimination is achieved by the presence of the spectral features in ITF estimated from two measured outputs. The vertical angle of the frontal source is estimated by comparing the spectral features in the estimated ITF with those in the database built in an anechoic chamber.

[1]  J. Bendat,et al.  Random Data: Analysis and Measurement Procedures , 1987 .

[2]  Gregory H. Wakefield,et al.  Introduction to Head-Related Transfer Functions (HRTFs): Representations of HRTFs in Time, Frequency, and Space , 2001 .

[3]  J. Hebrank,et al.  Spectral cues used in the localization of sound sources on the median plane. , 1974, The Journal of the Acoustical Society of America.

[4]  Hiroaki Kitano,et al.  Sound and Visual Tracking for Humanoid Robot , 2004, Applied Intelligence.

[5]  E. Langendijk,et al.  Contribution of spectral cues to human sound localization. , 1999, The Journal of the Acoustical Society of America.

[6]  Jie Huang,et al.  Robotic spatial sound localization and its 3D sound human interface , 2002, First International Symposium on Cyber Worlds, 2002. Proceedings..

[7]  Kazuhiro Iida,et al.  Role of pinna cavities in median plane localization , 1998 .

[8]  Hiroshi G. Okuno,et al.  Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots , 2004, Speech Commun..

[9]  E. Shaw,et al.  Sound pressure generated in an external-ear replica and real human ears by a nearby point source. , 1968, The Journal of the Acoustical Society of America.

[10]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[11]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[12]  E. Shaw,et al.  External-ear acoustic models with simple geometry. , 1968, The Journal of the Acoustical Society of America.

[13]  M. Morimoto,et al.  Localization cues of sound sources in the upper hemisphere. , 1984 .

[14]  Youngjin Park,et al.  Artificial Ear for Robots , 2006, 2006 5th IEEE Conference on Sensors.