A Robust Real-Time Sound Source Localization System for Olivia Robot

In this paper, we present a robust real-time sound source localization system implemented on a social robot platform developed in Institute for Infocomm Research, Singapore. The audio localization system provides the robot with auditory senses and enables the robot to direct its face to a speaker outside its frontal vision system. As the localization system exploits time difference of arrival (TDOA), the placement of the 8 microphone system array is crucial. This paper discusses the configuration and implementation of our system for the Olivia robot platform for accurate 3D localization under high babble noise condition.

[1]  Jacob Benesty,et al.  Robust time delay estimation exploiting redundancy among multiple microphones , 2003, IEEE Trans. Speech Audio Process..

[2]  Masakiyo Fujimoto,et al.  Noise robust voice activity detection based on periodic to aperiodic component ratio , 2010, Speech Commun..

[3]  E. J. Hannan,et al.  Estimating group delay , 1973 .

[4]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[5]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[6]  Huakang Li,et al.  A Spatial Sound Localization System for Mobile Robots , 2007, 2007 IEEE Instrumentation & Measurement Technology Conference IMTC 2007.

[7]  Fumio Kanehiro,et al.  Robust speech interface based on audio and video information fusion for humanoid HRP-2 , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[8]  M S Brandstein Time-delay estimation of reverberated speech exploiting harmonic structure. , 1999, The Journal of the Acoustical Society of America.

[9]  Carl Eckart Optimal Rectifier Systems for the Detection of Steady Signals , 1952 .

[10]  G. C. Carter,et al.  The smoothed coherence transform , 1973 .

[11]  Tatsuya Hirahara,et al.  An acoustical tele-presence robot: TeleHead II , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[12]  Youngjin Park,et al.  Sound Source Localization Methods with Considering of Microphone Placement in Robot Platform , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[13]  Tetsunori Kobayashi,et al.  Multi-person conversation via multi-modal interface - a robot who communicate with multi-user - , 1999, EUROSPEECH.

[14]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[15]  Hiroaki Kitano,et al.  Real-time sound source localization and separation for robot audition , 2002, INTERSPEECH.

[16]  Hiroaki Kitano,et al.  Social Interaction of Humanoid RobotBased on Audio-Visual Tracking , 2002, IEA/AIE.