论文信息 - Deep Visual MPC-Policy Learning for Navigation

Deep Visual MPC-Policy Learning for Navigation

Humans can routinely follow a trajectory defined by a list of images/landmarks. However, traditional robot navigation methods require accurate mapping of the environment, localization, and planning. Moreover, these methods are sensitive to subtle changes in the environment. In this letter, we propose PoliNet, a deep visual model predictive control-policy learning method that can perform visual navigation while avoiding collisions with unseen objects on the navigation path. PoliNet takes in as input a visual trajectory and 360$^{\circ }$ images from robot's current view and outputs velocity commands for a planning horizon of $N$ steps that optimally balance between trajectory following and obstacle avoidance. PoliNet is trained using a differentiable neural image predictive model and a traversability estimation model in an model predictive control setup, with minimal human supervision. PoliNet can be applied to visual trajectory in new scenes without retraining. We show experimentally that the robot can follow a visual trajectory even if it does not start from the exact same position and in the presence of previously unseen obstacles. We validated our algorithm with tests both in a realistic simulation environment and in the real world outperforming state-of-the-art baselines under similar conditions in success rate, coverage rate of the trajectory, and with lower computational load. We also show that we can generate visual trajectory in simulation and execute the corresponding path in the real environment.

[1] Patrick Rives,et al. A new approach to visual servoing in robotics , 1992, IEEE Trans. Robotics Autom..

[2] Peter I. Corke,et al. A tutorial on visual servo control , 1996, IEEE Trans. Robotics Autom..

[3] Wolfram Burgard,et al. The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[4] Wolfram Burgard,et al. A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998, Auton. Robots.

[5] E. Malis,et al. 2 1/2 D Visual Servoing , 1999 .

[6] Wolfram Burgard,et al. A real-time algorithm for mobile robot mapping with applications to multi-robot and 3D mapping , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[7] Daniel E. Koditschek,et al. Visual servoing via navigation functions , 2002, IEEE Trans. Robotics Autom..

[8] P. Poignet,et al. Image Based Visual Servoing through Nonlinear Model Predictive Control , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[9] Hugh F. Durrant-Whyte,et al. Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[10] François Chaumette,et al. Visual servo control. I. Basic approaches , 2006, IEEE Robotics & Automation Magazine.

[11] François Chaumette,et al. Visual servo control. II. Advanced approaches [Tutorial] , 2007, IEEE Robotics & Automation Magazine.

[12] Guillaume Allibert,et al. Real-time visual predictive controller for image-based trajectory tracking of a mobile robot , 2008 .

[13] James B. Rawlings,et al. Postface to “ Model Predictive Control : Theory and Design ” , 2012 .

[14] Oussama Khatib,et al. A depth space approach to human-robot collision avoidance , 2012, 2012 IEEE International Conference on Robotics and Automation.

[15] Minoru Tanaka,et al. Personal robot assisting transportation to support active human life — Reference generation based on model predictive control for robust quick turning , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[16] Oliver Brock,et al. Deterioration of depth measurements due to interference of multiple RGB-D sensors , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17] Thomas B. Schön,et al. From Pixels to Torques: Policy Learning with Deep Dynamical Models , 2015, ICML 2015.

[18] Ross A. Knepper,et al. DeepMPC: Learning Deep Latent Features for Model Predictive Control , 2015, Robotics: Science and Systems.

[19] Sergey Levine,et al. Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[20] Jitendra Malik,et al. View Synthesis by Appearance Flow , 2016, ECCV.

[21] Emilio Frazzoli,et al. A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[22] Chun-Yi Su,et al. Vision-Based Model Predictive Control for Steering of a Nonholonomic Mobile Robot , 2016, IEEE Transactions on Control Systems Technology.

[23] Aaron M. Dollar,et al. Vision-based model predictive control for within-hand precision manipulation with underactuated grippers , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[24] Silvio Savarese,et al. Joint 2D-3D-Semantic Data for Indoor Scene Understanding , 2017, ArXiv.

[25] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[26] Matthias Nießner,et al. Matterport3D: Learning from RGB-D Data in Indoor Environments , 2017, 2017 International Conference on 3D Vision (3DV).

[27] Sergey Levine,et al. Deep visual foresight for planning robot motion , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[28] Samarth Brahmbhatt,et al. DeepNav: Learning to Navigate Large Cities , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Jitendra Malik,et al. Unifying Map and Landmark Based Representations for Visual Navigation , 2017, ArXiv.

[30] Vladlen Koltun,et al. Semi-parametric Topological Memory for Navigation , 2018, ICLR.

[31] Silvio Savarese,et al. GONet++: Traversability Estimation via Dynamic Scene View Synthesis , 2018, ArXiv.

[32] Jitendra Malik,et al. On Evaluation of Embodied Navigation Agents , 2018, ArXiv.

[33] Qi Wu,et al. Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34] Alexey Dosovitskiy,et al. End-to-End Driving Via Conditional Imitation Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[35] Jitendra Malik,et al. Zero-Shot Visual Imitation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[36] Noriaki Hirose,et al. MPC policy learning using DNN for human following control without collision , 2018, Adv. Robotics.

[37] Silvio Savarese,et al. GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[38] Sergey Levine,et al. Self-Supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[39] Jitendra Malik,et al. Visual Memory for Robust Path Following , 2018, NeurIPS.

[40] Jitendra Malik,et al. Gibson Env: Real-World Perception for Embodied Agents , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41] Varun Tolani. Visual Model Predictive Control , 2018 .

[42] Stefano Rovetta,et al. A Neural-Network-Based Model Predictive Control of Three-Phase Inverter With an Output $LC$ Filter , 2019, IEEE Access.

[43] Silvio Savarese,et al. A Behavioral Approach to Visual Navigation with Graph Localization Networks , 2019, Robotics: Science and Systems.

[44] Joonho Lee,et al. Learning agile and dynamic motor skills for legged robots , 2019, Science Robotics.

[45] Rahul Sukthankar,et al. Cognitive Mapping and Planning for Visual Navigation , 2017, International Journal of Computer Vision.

[46] Silvio Savarese,et al. VUNet: Dynamic Scene View Synthesis for Traversability Estimation Using an RGB Camera , 2018, IEEE Robotics and Automation Letters.