论文信息 - Learning to fly by crashing

Learning to fly by crashing

How do you learn to navigate an Unmanned Aerial Vehicle (UAV) and avoid obstacles? One approach is to use a small dataset collected by human experts: however, high capacity learning algorithms tend to overfit when trained with little data. An alternative is to use simulation. But the gap between simulation and real world remains large especially for perception problems. The reason most research avoids using large-scale real data is the fear of crashes! In this paper, we propose to bite the bullet and collect a dataset of crashes itself! We build a drone whose sole purpose is to crash into objects: it samples naive trajectories and crashes into random objects. We crash our drone 11,500 times to create one of the biggest UAV crash dataset. This dataset captures the different ways in which a UAV can crash. We use all this negative flying data in conjunction with positive data sampled from the same trajectories to learn a simple yet powerful policy for UAV navigation. We show that this simple self-supervised model is quite effective in navigating the UAV even in extremely cluttered environments with dynamic obstacles including humans. For supplementary video see:

[1] Yoshifumi Kitamura,et al. Real-time path planning in a dynamic 3-D environment , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[2] Sebastian Thrun,et al. FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[3] Hugh F. Durrant-Whyte,et al. Simultaneous Localization and Mapping with Sparse Extended Information Filters , 2004, Int. J. Robotics Res..

[4] James J. Kuffner,et al. Planning 3-D Path Networks in Unstructured Environments , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[5] G. Klein,et al. Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[6] Dario Floreano,et al. Quadrotor Using Minimal Sensing For Autonomous Indoor Flight , 2007 .

[7] Sebastian Scherer,et al. Flying Fast and Low Among Obstacles: Methodology and Experiments , 2008, Int. J. Robotics Res..

[8] Nicolas H. Franceschini,et al. Aerial robot piloted in steep relief by optic flow sensors , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9] Abraham Bachrach,et al. Autonomous flight in unstructured and unknown indoor environments , 2009 .

[10] Dario Floreano,et al. Vision-based control of near-obstacle flight , 2009, Auton. Robots.

[11] G. Gerhart,et al. Stereo vision and laser odometry for autonomous helicopters in GPS-denied indoor environments , 2009 .

[12] Ian D. Reid,et al. RSLAM: A System for Large-Scale Mapping in Constant-Time Using Stereo , 2011, International Journal of Computer Vision.

[13] Ashutosh Saxena,et al. Autonomous MAV flight in indoor environments using single image perspective cues , 2011, 2011 IEEE International Conference on Robotics and Automation.

[14] Dieter Fox,et al. RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments , 2012, Int. J. Robotics Res..

[15] Zhengyou Zhang,et al. Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[16] Nicholas Roy,et al. State estimation for aggressive flight in GPS-denied environments using onboard sensing , 2012, 2012 IEEE International Conference on Robotics and Automation.

[17] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18] Marc Pollefeys,et al. Vision-based autonomous mapping and exploration using a quadrotor MAV , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19] Martial Hebert,et al. Learning monocular reactive UAV control in cluttered natural environments , 2012, 2013 IEEE International Conference on Robotics and Automation.

[20] Michael Suppa,et al. Stereo vision based indoor/outdoor navigation for flying robots , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21] Rob Fergus,et al. Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[22] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23] Daniel Cremers,et al. LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[24] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[25] Sergey Levine,et al. Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[26] Abhinav Gupta,et al. Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Jitendra Malik,et al. Learning to Poke by Poking: Experiential Learning of Intuitive Physics , 2016, NIPS.

[28] Guido C. H. E. de Croon,et al. Self-supervised monocular distance learning on a lightweight micro air vehicle , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29] Abhinav Gupta,et al. Supersizing self-supervision: Learning to grasp from 50K tries and 700 robot hours , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[30] Ashish Kapoor,et al. Aerial Informatics and Robotics Platform , 2017 .

[31] Abhinav Gupta,et al. Learning to push by grasping: Using multiple tasks for effective learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[32] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[33] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[34] Sergey Levine,et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..