Development of Robot Audition under Severe Conditions

The ability of robots to listen to several things at once with their own “ears”, i.e., robot audition, is critical in improving the performance of search and rescue activities under severe conditions. This paper introduces “HARK” robot audition open-source software and its capabilities of suppressing ego-noise that is caused by robot’s own movements such as motor, propeller and/or flying noise. Then it describes three main applications of robot audition: 1) Unmanned Aerial Vehicle (UAV) with a microphone array to capture sounds can localize a sound source by suppressing ego-noise with either hovering, slow gliding or fast gliding. It can also recognize a sound source by CNN. 2) A serpentine robot with a microphone array can estimate its posture by sound. It can also enhance a voice by Online Robust PCA. 3) A robot with a LiDAR and 32-channel microphone can visualize a sound map by superimposing sound source directions on point clouds.

[1]  Hiroshi G. Okuno,et al.  Improved sound source localization in horizontal plane for binaural robot audition , 2014, Applied Intelligence.

[2]  Katsutoshi Itoyama,et al.  Noise correlation matrix estimation for improving sound source localization by multirotor UAV , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Hiroaki Kitano,et al.  Active Audition for Humanoid , 2000, AAAI/IAAI.

[4]  François Michaud,et al.  Code reusability tools for programming mobile robots , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[5]  Satoshi Tadokoro,et al.  Microphone-accelerometer based 3D posture estimation for a hose-shaped rescue robot , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6]  Satoshi Uemura,et al.  Outdoor Acoustic Event Identification using Sound Source Separation and Deep Learning with a Quadrotor-Embedded Microphone Array , 2015 .

[7]  Keisuke Nakamura,et al.  Intelligent sound source localization for dynamic environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Satoshi Tadokoro,et al.  Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array , 2015, 2015 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[9]  Katsutoshi Itoyama,et al.  Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Keisuke Nakamura,et al.  Assessment of general applicability of ego noise estimation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[11]  Hiroshi Sawada,et al.  Bayesian Nonparametrics for Microphone Array Processing , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12]  Hiroshi G. Okuno,et al.  Robot audition: Its rise and perspectives , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Makoto Kumon,et al.  Design model of microphone arrays for multirotor helicopters , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[14]  Keisuke Nakamura,et al.  Improvement in outdoor sound source detection using a quadrotor-embedded microphone array , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Katsutoshi Itoyama,et al.  Posture estimation of hose-shaped robot using microphone array localization , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Hiroshi G. Okuno,et al.  A real-time super-resolution robot audition system that improves the robustness of simultaneous speech recognition , 2013, Adv. Robotics.

[17]  智晴 長尾,et al.  Deep Neural Network を用いた株式売買戦略の構築 , 2016 .