First applications of sound-based control on a mobile robot equipped with two microphones

This paper validates experimentally a novel approach to robot audition, sound-based control, which consists in introducing auditory features directly as inputs of a closed-loop control scheme, that is, without any explicit localization process. The applications we present rely on the implicit bearings of the sound sources computed from the time difference of arrival (TDOA) between two microphones. By linking the motion of the robot to the aural perception of the environment, this approach has the benefit of being more robust to reverberation and noise. Therefore neither complex tracking method such as Kalman filtering nor TDOA enhancement with denoising or dereverberation methods are needed to track the correct TDOA measurements. The experiments conducted on a mobile robot instrumented with a pair of microphones show the validity of our approach. In a reverberating and noisy room, this approach is able to orient the robot to a mobile sound source in real time. A positioning task with respect to two sound sources is also performed while the robot perception is disturbed by altered and spurious TDOA measurements.

[1]  François Chaumette,et al.  Sound-based control with two microphones , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Makoto Kumon,et al.  Adaptive audio servo for multirate robot systems , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[3]  S. Wermter,et al.  Robotic Sound-Source Localization and Tracking Using Interaural Time Difference and Cross-Correlation , 2004 .

[4]  Hiroaki Kitano,et al.  Active Audition for Humanoid , 2000, AAAI/IAAI.

[5]  Jean Rouat,et al.  Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering , 2007, Robotics Auton. Syst..

[6]  Alban Portello,et al.  Active binaural localization of intermittent moving sources in the presence of false measurements , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Radu Horaud,et al.  Variational EM for binaural sound-source separation and localization , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  DeLiang Wang,et al.  Binaural Localization of Multiple Sources in Reverberant and Noisy Environments , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  François Chaumette,et al.  Visual Servoing and Visual Tracking , 2008, Springer Handbook of Robotics.

[10]  Stefan Wermter,et al.  Robotic sound-source localisation architecture using cross-correlation and recurrent neural networks , 2009, Neural Networks.

[11]  Emmanuel Vincent,et al.  Multi-source TDOA estimation in reverberant audio using angular spectra and clustering , 2012, Signal Process..

[12]  Parham Aarabi,et al.  Acoustic robot navigation using distributed microphone arrays , 2004, Inf. Fusion.

[13]  Giuseppe Oriolo,et al.  Feature Depth Observation for Image-based Visual Servoing: Theory and Experiments , 2008, Int. J. Robotics Res..

[14]  Ivan Markovic,et al.  Speaker localization and tracking with a microphone array on a mobile robot using von Mises distribution and particle filtering , 2010, Robotics Auton. Syst..

[15]  Kazuhiro Nakadai,et al.  Real-time sound source orientation estimation using a 96 channel microphone array , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .