Reflection-Aware Sound Source Localization

We present a novel, reflection-aware method for 3D sound localization in indoor environments. Unlike prior approaches, which are mainly based on continuous sound signals from a stationary source, our formulation is designed to localize the position instantaneously from signals within a single frame. We consider direct sound and indirect sound signals that reach the microphones after reflecting off surfaces such as ceilings or walls. We then generate and trace direct and reflected acoustic paths using inverse acoustic ray tracing and utilize these paths with Monte Carlo localization to estimate a 3D sound source position. We have implemented our method on a robot with a cube-shaped microphone array and tested it against different settings with continuous and intermittent sound signals with a stationary or a mobile source. Across different settings, our approach can localize the sound with an average distance error of 0.8 m tested in a room of 7 m by 7 m area with 3 m height, including a mobile and non-line-of-sight sound source. We also reveal that the modeling of indirect rays increases the localization accuracy by 40% compared to only using direct acoustic rays.

[1]  Heinrich Kuttruff,et al.  Acoustics: An Introduction , 2006 .

[2]  Jean Rouat,et al.  Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering , 2007, Robotics Auton. Syst..

[3]  Wolfram Burgard,et al.  OctoMap: an efficient probabilistic 3D mapping framework based on octrees , 2013, Autonomous Robots.

[4]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[5]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[6]  Lauri Savioja,et al.  Overview of geometrical room acoustic modeling techniques. , 2015, The Journal of the Acoustical Society of America.

[7]  Tapio Lokki,et al.  The room acoustic rendering equation. , 2007, The Journal of the Acoustical Society of America.

[8]  Yoko Sasaki,et al.  Probabilistic 3D sound source mapping using moving microphone array , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Tetsuya Ogata,et al.  Bayesian Extension of MUSIC for Sound Source Localization and Tracking , 2011, INTERSPEECH.

[10]  Patrick Danès,et al.  Broadband variations of the MUSIC high-resolution method for Sound Source Localization in Robotics , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Keisuke Nakamura,et al.  Intelligent sound source localization for dynamic environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Dinesh Manocha,et al.  Interactive Sound Propagation and Rendering for Large Multi-Source Scenes , 2016, ACM Trans. Graph..

[13]  François Michaud,et al.  Online global loop closure detection for large-scale multi-session graph-based SLAM , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  M. Vorländer Simulation of the transient and steady‐state sound propagation in rooms using a new combined ray‐tracing/image‐source algorithm , 1989 .

[15]  Dinesh Manocha,et al.  Acoustic Classification and Optimization for Multi-Modal Rendering of Real-World Scenes , 2018, IEEE Transactions on Visualization and Computer Graphics.

[16]  James J. Little,et al.  A Boosted Particle Filter: Multitarget Detection and Tracking , 2004, ECCV.

[17]  Teresa A. Vidal-Calleja,et al.  Towards real-time 3D sound sources mapping with linear microphone arrays , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[19]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[20]  M. Vorländer Computer simulations in room acoustics: concepts and uncertainties. , 2013, The Journal of the Acoustical Society of America.

[21]  François Michaud,et al.  The ManyEars open framework , 2013, Autonomous Robots.

[22]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[23]  Gautam Narang,et al.  Auditory-aware navigation for mobile robots based on reflection-robust sound source localization and visual SLAM , 2014, 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[24]  Norihiro Hagita,et al.  Using multiple microphone arrays and reflections for 3D localization of sound sources , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  François Michaud,et al.  Embedded auditory system for small mobile robots , 2008, 2008 IEEE International Conference on Robotics and Automation.

[26]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .