Monte Carlo exploration for active binaural localization

This study introduces a machine hearing system for robot audition, which enables a robotic agent to pro-actively minimize the uncertainty of sound source location estimates through motion. The proposed system is based on an active exploration approach, providing a means to model and predict effects of the agent's future motions on localization uncertainty in a probabilistic manner. Particle filtering is used to estimate the posterior probability density function of the source position from binaural measurements, enabling to jointly assess azimuth and distance of the source. The framework allows to infer and refine a policy to select appropriate actions via a Monte Carlo exploration approach. Experiments in simulated reverberant conditions are conducted, showing that active exploration and the incorporation of distance estimation significantly improve localization performance.

[1]  Alban Portello,et al.  A three-stage framework to active source localization from a binaural head , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Alexei Makarenko,et al.  Information based adaptive robotic exploration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[4]  Steven van de Par,et al.  A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Guy J. Brown,et al.  Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions , 2015, INTERSPEECH.

[6]  John Mourjopoulos,et al.  Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Rudolph van der Merwe,et al.  Gaussian mixture sigma-point particle filters for sequential probabilistic inference in dynamic state-space models , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[8]  Alastair H. Moore,et al.  Acoustic simultaneous localization and mapping (A-SLAM) of a moving microphone array and its surrounding speakers , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Martin Müller,et al.  Monte-Carlo Exploration for Deterministic Planning , 2009, IJCAI.

[10]  Yoshitaka Nakajima,et al.  Auditory Scene Analysis: The Perceptual Organization of Sound Albert S. Bregman , 1992 .

[11]  Guy J. Brown,et al.  A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Dorothea Kolossa,et al.  Binaural sound source localisation and tracking using a dynamic spherical head model , 2015, INTERSPEECH.

[13]  Martin Cooke,et al.  Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[15]  Gabriel Bustamante,et al.  Towards information-based feedback control for binaural active localization , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Wolfram Burgard,et al.  Exploration with active loop-closing for FastSLAM , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[17]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[18]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[19]  H. Wallach,et al.  The role of head movements and vestibular and visual cues in sound localization. , 1940 .

[20]  Sampo Vesa,et al.  Binaural Sound Source Distance Learning in Rooms , 2009, IEEE Transactions on Audio, Speech, and Language Processing.