论文信息 - A Fully-Autonomous Aerial Robot for Search and Rescue Applications in Indoor Environments using Learning-Based Techniques

A Fully-Autonomous Aerial Robot for Search and Rescue Applications in Indoor Environments using Learning-Based Techniques

Search and Rescue (SAR) missions represent an important challenge in the robotics research field as they usually involve exceedingly variable-nature scenarios which require a high-level of autonomy and versatile decision-making capabilities. This challenge becomes even more relevant in the case of aerial robotic platforms owing to their limited payload and computational capabilities. In this paper, we present a fully-autonomous aerial robotic solution, for executing complex SAR missions in unstructured indoor environments. The proposed system is based on the combination of a complete hardware configuration and a flexible system architecture which allows the execution of high-level missions in a fully unsupervised manner (i.e. without human intervention). In order to obtain flexible and versatile behaviors from the proposed aerial robot, several learning-based capabilities have been integrated for target recognition and interaction. The target recognition capability includes a supervised learning classifier based on a computationally-efficient Convolutional Neural Network (CNN) model trained for target/background classification, while the capability to interact with the target for rescue operations introduces a novel Image-Based Visual Servoing (IBVS) algorithm which integrates a recent deep reinforcement learning method named Deep Deterministic Policy Gradients (DDPG). In order to train the aerial robot for performing IBVS tasks, a reinforcement learning framework has been developed, which integrates a deep reinforcement learning agent (e.g. DDPG) with a Gazebo-based simulator for aerial robotics. The proposed system has been validated in a wide range of simulation flights, using Gazebo and PX4 Software-In-The-Loop, and real flights in cluttered indoor environments, demonstrating the versatility of the proposed system in complex SAR missions.

[1] Stefan Kohlbrecher,et al. A flexible and scalable SLAM system with full 3D motion estimation , 2011, 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics.

[2] Pascual Campoy Cervera,et al. A Multi-Layered Component-Based Approach for the Development of Aerial Robotic Systems: The Aerostack Framework , 2017, J. Intell. Robotic Syst..

[3] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[4] Gang Xiang,et al. Design of the life-ring drone delivery system for rip current rescue , 2016, 2016 IEEE Systems and Information Engineering Design Symposium (SIEDS).

[5] Marc Pollefeys,et al. PIXHAWK: A system for autonomous flight using onboard computer vision , 2011, 2011 IEEE International Conference on Robotics and Automation.

[6] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7] Patrick Doherty,et al. A UAV Search and Rescue Scenario with Human Body Detection and Geolocalization , 2007, Australian Conference on Artificial Intelligence.

[8] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[9] François Chaumette,et al. 2 1/2 D visual servoing: a possible solution to improve image-based and position-based visual servoings , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[10] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.

[11] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Hriday Bavle,et al. A fully-autonomous aerial robotic solution for the 2016 International Micro Air Vehicle competition , 2017, 2017 International Conference on Unmanned Aircraft Systems (ICUAS).

[13] Martin Molina,et al. A flexible and dynamic mission planning architecture for UAV swarm coordination , 2016, 2016 International Conference on Unmanned Aircraft Systems (ICUAS).

[14] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15] Farid Melgani,et al. A Convolutional Neural Network Approach for Assisting Avalanche Search and Rescue Operations with UAV Imagery , 2017, Remote. Sens..

[16] Nicholas Roy,et al. Autonomous Flight in Unknown Indoor Environments , 2009 .

[17] W.D. Smart,et al. What does shaping mean for computational reinforcement learning? , 2008, 2008 7th IEEE International Conference on Development and Learning.

[18] Hriday Bavle,et al. A flight altitude estimator for multirotor UAVs in dynamic and unstructured indoor environments , 2017, 2017 International Conference on Unmanned Aircraft Systems (ICUAS).

[19] Rich Caruana,et al. An empirical comparison of supervised learning algorithms , 2006, ICML.

[20] Wolfram Burgard,et al. A Fully Autonomous Indoor Quadrotor , 2012, IEEE Transactions on Robotics.

[21] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[22] Ezio Malis,et al. Improving vision-based control using efficient second-order minimization techniques , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[23] Éric Marchand,et al. ViSP for visual servoing: a generic software platform with a wide class of robot control skills , 2005, IEEE Robotics & Automation Magazine.

[24] Thomas Moore,et al. A Generalized Extended Kalman Filter Implementation for the Robot Operating System , 2014, IAS.

[25] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26] Roland Siegwart,et al. RotorS—A Modular Gazebo MAV Simulator Framework , 2016 .

[27] Mathias Fleck,et al. Usability of Lightweight Defibrillators for UAV Delivery , 2016, CHI Extended Abstracts.

[28] François Chaumette,et al. Visual servo control. I. Basic approaches , 2006, IEEE Robotics & Automation Magazine.

[29] Angelo Cangelosi,et al. Autonomous Quadrotor Landing using Deep Reinforcement Learning , 2017, ArXiv.

[30] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31] D. Erdos,et al. An experimental UAV system for search and rescue challenge , 2013, IEEE Aerospace and Electronic Systems Magazine.

[32] Kurt Konolige,et al. The Office Marathon: Robust navigation in an indoor office environment , 2010, 2010 IEEE International Conference on Robotics and Automation.

[33] Pascual Campoy Cervera,et al. A Review of Deep Learning Methods and Applications for Unmanned Aerial Vehicles , 2017, J. Sensors.

[34] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[35] Bernhard Rinner,et al. An Autonomous Multi-UAV System for Search and Rescue , 2015, DroNet@MobiSys.

[36] Marco Colombetti,et al. Robot Shaping: An Experiment in Behavior Engineering , 1997 .

[37] Jingxuan Sun,et al. A Camera-Based Target Detection and Positioning UAV System for Search and Rescue (SAR) Purposes , 2016, Sensors.

[38] Roland Siegwart,et al. MAV navigation through indoor corridors using optical flow , 2010, 2010 IEEE International Conference on Robotics and Automation.

[39] Andrea Sanna,et al. New Frontiers of Delivery Services Using Drones: A Prototype System Exploiting a Quadcopter for Autonomous Drug Shipments , 2015, 2015 IEEE 39th Annual Computer Software and Applications Conference.

[40] Morgan Quigley,et al. ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[41] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[42] P. Rudol,et al. Human Body Detection and Geolocalization for UAV Search and Rescue Missions Using Color and Thermal Imagery , 2008, 2008 IEEE Aerospace Conference.

[43] Francois Chaumette,et al. Potential problems of unstability and divergence in image-based and position-based visual servoing , 1999, 1999 European Control Conference (ECC).

[44] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[45] H. Abrahamsen. A remotely piloted aircraft system in major incident management: concept and pilot, feasibility study , 2015, BMC Emergency Medicine.

[46] G. DeJong,et al. Theory and Application of Reward Shaping in Reinforcement Learning , 2004 .

[47] Pascual Campoy Cervera,et al. A General Purpose Configurable Controller for Indoors and Outdoors GPS-Denied Navigation for Multirotor Unmanned Aerial Vehicles , 2014, J. Intell. Robotic Syst..

[48] Darius Burschka,et al. Toward a Fully Autonomous UAV: Research Platform for Indoor and Outdoor Urban Search and Rescue , 2012, IEEE Robotics & Automation Magazine.

[49] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[50] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[51] François Chaumette,et al. Potential problems of stability and convergence in image-based and position-based visual servoing , 1997 .