DREGON: Dataset and Methods for UAV-Embedded Sound Source Localization

This paper introduces DREGON, a novel publicly-available dataset that aims at pushing research in sound source localization using a microphone array embedded in an unmanned aerial vehicle (UAV). The dataset contains both clean and noisy in-flight audio recordings continuously annotated with the 3D position of the target sound source using an accurate motion capture system. In addition, various signals of interests are available such as the rotational speed of individual rotors and inertial measurements at all time. Besides introducing the dataset, this paper sheds light on the specific properties, challenges and opportunities brought by the emerging task of UAV-embedded sound source localization. Several baseline methods are evaluated and compared on the dataset, with real-time applicability in mind. Very promising results are obtained for the localization of a broad-band source in loud noise conditions, while speech localization remains a challenge under extreme noise levels.

[1]  Andrea Cavallaro,et al.  Ear in the sky: Ego-noise reduction for auditory micro aerial vehicles , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[2]  Kazuhiro Nakadai,et al.  Partially Shared Deep Neural Network in sound source separation and identification using a UAV-embedded microphone array , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[4]  Joost van de Weijer,et al.  Review on computer vision techniques in emergency situations , 2017, Multimedia Tools and Applications.

[5]  Walter Kellermann,et al.  Ego-noise reduction using a motor data-guided multichannel dictionary , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  Keisuke Nakamura,et al.  Outdoor auditory scene analysis using a moving microphone array embedded in a quadrocopter , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Parham Aarabi,et al.  Self-localizing dynamic microphone arrays , 2002 .

[8]  Akinori Ito,et al.  Internal noise suppression for speech recognition by small robots , 2005, INTERSPEECH.

[9]  R. O. Schmidt,et al.  Multiple emitter location and signal Parameter estimation , 1986 .

[10]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[11]  François Michaud,et al.  The ManyEars open framework , 2013, Autonomous Robots.

[12]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[13]  Kazuhiro Nakadai,et al.  Ego-motion noise suppression for robots based on Semi-Blind Infinite Non-negative Matrix Factorization , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[15]  Raffaello D'Andrea,et al.  Guest Editorial Can Drones Deliver? , 2014, IEEE Trans Autom. Sci. Eng..

[16]  Hiroshi G. Okuno,et al.  Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments † , 2017, Sensors.

[17]  Antonio Franchi,et al.  The TeleKyb framework for a modular and extendible ROS-based quadrotor control , 2013, 2013 European Conference on Mobile Robots.

[18]  J. Larsen,et al.  Wind Noise Reduction using Non-Negative Sparse Coding , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[19]  Keisuke Nakamura,et al.  Improvement in outdoor sound source detection using a quadrotor-embedded microphone array , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Patrick Danès,et al.  Broadband variations of the MUSIC high-resolution method for Sound Source Localization in Robotics , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Emmanuel Vincent,et al.  Multi-source TDOA estimation in reverberant audio using angular spectra and clustering , 2012, Signal Process..

[22]  Katsutoshi Itoyama,et al.  Noise correlation matrix estimation for improving sound source localization by multirotor UAV , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Keisuke Nakamura,et al.  Intelligent sound source localization for dynamic environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Dario Floreano,et al.  Robust acoustic source localization of emergency signals from Micro Air Vehicles , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Walter Kellermann,et al.  Challenges in Acoustic Signal Enhancement for Human-Robot Communication , 2014, ITG Symposium on Speech Communication.

[26]  Andrea Cavallaro,et al.  Microphone-Array Ego-Noise Reduction Algorithms for Auditory Micro Aerial Vehicles , 2017, IEEE Sensors Journal.

[27]  Iván V. Meza,et al.  Localization of sound sources in robotics: A review , 2017, Robotics Auton. Syst..