A Fully-Autonomous Aerial Robot for Search and Rescue Applications in Indoor Environments using Learning-Based Techniques

Search and Rescue (SAR) missions represent an important challenge in the robotics research field as they usually involve exceedingly variable-nature scenarios which require a high-level of autonomy and versatile decision-making capabilities. This challenge becomes even more relevant in the case of aerial robotic platforms owing to their limited payload and computational capabilities. In this paper, we present a fully-autonomous aerial robotic solution, for executing complex SAR missions in unstructured indoor environments. The proposed system is based on the combination of a complete hardware configuration and a flexible system architecture which allows the execution of high-level missions in a fully unsupervised manner (i.e. without human intervention). In order to obtain flexible and versatile behaviors from the proposed aerial robot, several learning-based capabilities have been integrated for target recognition and interaction. The target recognition capability includes a supervised learning classifier based on a computationally-efficient Convolutional Neural Network (CNN) model trained for target/background classification, while the capability to interact with the target for rescue operations introduces a novel Image-Based Visual Servoing (IBVS) algorithm which integrates a recent deep reinforcement learning method named Deep Deterministic Policy Gradients (DDPG). In order to train the aerial robot for performing IBVS tasks, a reinforcement learning framework has been developed, which integrates a deep reinforcement learning agent (e.g. DDPG) with a Gazebo-based simulator for aerial robotics. The proposed system has been validated in a wide range of simulation flights, using Gazebo and PX4 Software-In-The-Loop, and real flights in cluttered indoor environments, demonstrating the versatility of the proposed system in complex SAR missions.

[1]  Stefan Kohlbrecher,et al.  A flexible and scalable SLAM system with full 3D motion estimation , 2011, 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics.

[2]  Pascual Campoy Cervera,et al.  A Multi-Layered Component-Based Approach for the Development of Aerial Robotic Systems: The Aerostack Framework , 2017, J. Intell. Robotic Syst..

[3]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[4]  Gang Xiang,et al.  Design of the life-ring drone delivery system for rip current rescue , 2016, 2016 IEEE Systems and Information Engineering Design Symposium (SIEDS).

[5]  Marc Pollefeys,et al.  PIXHAWK: A system for autonomous flight using onboard computer vision , 2011, 2011 IEEE International Conference on Robotics and Automation.

[6]  Ali Farhadi,et al.  Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Patrick Doherty,et al.  A UAV Search and Rescue Scenario with Human Body Detection and Geolocalization , 2007, Australian Conference on Artificial Intelligence.

[8]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[9]  François Chaumette,et al.  2 1/2 D visual servoing: a possible solution to improve image-based and position-based visual servoings , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[10]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[11]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Hriday Bavle,et al.  A fully-autonomous aerial robotic solution for the 2016 International Micro Air Vehicle competition , 2017, 2017 International Conference on Unmanned Aircraft Systems (ICUAS).

[13]  Martin Molina,et al.  A flexible and dynamic mission planning architecture for UAV swarm coordination , 2016, 2016 International Conference on Unmanned Aircraft Systems (ICUAS).

[14]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15]  Farid Melgani,et al.  A Convolutional Neural Network Approach for Assisting Avalanche Search and Rescue Operations with UAV Imagery , 2017, Remote. Sens..

[16]  Nicholas Roy,et al.  Autonomous Flight in Unknown Indoor Environments , 2009 .

[17]  W.D. Smart,et al.  What does shaping mean for computational reinforcement learning? , 2008, 2008 7th IEEE International Conference on Development and Learning.

[18]  Hriday Bavle,et al.  A flight altitude estimator for multirotor UAVs in dynamic and unstructured indoor environments , 2017, 2017 International Conference on Unmanned Aircraft Systems (ICUAS).

[19]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[20]  Wolfram Burgard,et al.  A Fully Autonomous Indoor Quadrotor , 2012, IEEE Transactions on Robotics.

[21]  Sergey Levine,et al.  (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[22]  Ezio Malis,et al.  Improving vision-based control using efficient second-order minimization techniques , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[23]  Éric Marchand,et al.  ViSP for visual servoing: a generic software platform with a wide class of robot control skills , 2005, IEEE Robotics & Automation Magazine.

[24]  Thomas Moore,et al.  A Generalized Extended Kalman Filter Implementation for the Robot Operating System , 2014, IAS.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Roland Siegwart,et al.  RotorS—A Modular Gazebo MAV Simulator Framework , 2016 .

[27]  Mathias Fleck,et al.  Usability of Lightweight Defibrillators for UAV Delivery , 2016, CHI Extended Abstracts.

[28]  François Chaumette,et al.  Visual servo control. I. Basic approaches , 2006, IEEE Robotics & Automation Magazine.

[29]  Angelo Cangelosi,et al.  Autonomous Quadrotor Landing using Deep Reinforcement Learning , 2017, ArXiv.

[30]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  D. Erdos,et al.  An experimental UAV system for search and rescue challenge , 2013, IEEE Aerospace and Electronic Systems Magazine.

[32]  Kurt Konolige,et al.  The Office Marathon: Robust navigation in an indoor office environment , 2010, 2010 IEEE International Conference on Robotics and Automation.

[33]  Pascual Campoy Cervera,et al.  A Review of Deep Learning Methods and Applications for Unmanned Aerial Vehicles , 2017, J. Sensors.

[34]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[35]  Bernhard Rinner,et al.  An Autonomous Multi-UAV System for Search and Rescue , 2015, DroNet@MobiSys.

[36]  Marco Colombetti,et al.  Robot Shaping: An Experiment in Behavior Engineering , 1997 .

[37]  Jingxuan Sun,et al.  A Camera-Based Target Detection and Positioning UAV System for Search and Rescue (SAR) Purposes , 2016, Sensors.

[38]  Roland Siegwart,et al.  MAV navigation through indoor corridors using optical flow , 2010, 2010 IEEE International Conference on Robotics and Automation.

[39]  Andrea Sanna,et al.  New Frontiers of Delivery Services Using Drones: A Prototype System Exploiting a Quadcopter for Autonomous Drug Shipments , 2015, 2015 IEEE 39th Annual Computer Software and Applications Conference.

[40]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[41]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[42]  P. Rudol,et al.  Human Body Detection and Geolocalization for UAV Search and Rescue Missions Using Color and Thermal Imagery , 2008, 2008 IEEE Aerospace Conference.

[43]  Francois Chaumette,et al.  Potential problems of unstability and divergence in image-based and position-based visual servoing , 1999, 1999 European Control Conference (ECC).

[44]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[45]  H. Abrahamsen A remotely piloted aircraft system in major incident management: concept and pilot, feasibility study , 2015, BMC Emergency Medicine.

[46]  G. DeJong,et al.  Theory and Application of Reward Shaping in Reinforcement Learning , 2004 .

[47]  Pascual Campoy Cervera,et al.  A General Purpose Configurable Controller for Indoors and Outdoors GPS-Denied Navigation for Multirotor Unmanned Aerial Vehicles , 2014, J. Intell. Robotic Syst..

[48]  Darius Burschka,et al.  Toward a Fully Autonomous UAV: Research Platform for Indoor and Outdoor Urban Search and Rescue , 2012, IEEE Robotics & Automation Magazine.

[49]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[50]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[51]  François Chaumette,et al.  Potential problems of stability and convergence in image-based and position-based visual servoing , 1997 .