论文信息 - Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation

Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation

In mobile manipulation (MM), robots can both navigate within and interact with their environment and are thus able to complete many more tasks than robots only capable of navigation or manipulation. In this work, we explore how to apply imitation learning (IL) to learn continuous visuo-motor policies for MM tasks. Much prior work has shown that IL can train visuo-motor policies for either manipulation or navigation domains, but few works have applied IL to the MM domain. Doing this is challenging for two reasons: on the data side, current interfaces make collecting high-quality human demonstrations difficult, and on the learning side, policies trained on limited data can suffer from covariate shift when deployed. To address these problems, we first propose MOBILE MANIPULATION ROBOTURK (MOMART), a novel teleoperation framework allowing simultaneous navigation and manipulation of mobile manipulators, and collect a first-of-its-kind large scale dataset in a realistic simulated kitchen setting. We then propose a learned error detection system to address covariate shift by detecting when an agent is in a potential failure state. We train performant IL policies and error detectors from this data, and achieve over 45% task success rate and 85% error detection success rate across multiple multi-stage tasks when trained on expert data. Additional results and video at https://sites.google.com/view/il-for-mm/home.

[1] Keiji Nagatani,et al. Designing strategy and implementation of mobile manipulator control system for opening door , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[2] Ken Goldberg,et al. Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[3] Ruben Grandia,et al. Whole-Body MPC for a Dynamically Stable Mobile Manipulator , 2019, IEEE Robotics and Automation Letters.

[4] Darwin G. Caldwell,et al. Learning and Reproduction of Gestures by Imitation , 2010, IEEE Robotics & Automation Magazine.

[5] Oussama Khatib,et al. Mobile manipulation: The robotic assistant , 1999, Robotics Auton. Syst..

[6] Douglas A. Reynolds,et al. Gaussian Mixture Models , 2018, Encyclopedia of Biometrics.

[7] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[8] Raghavendra Chalapathy University of Sydney,et al. Deep Learning for Anomaly Detection: A Survey , 2019, ArXiv.

[9] Eric Huang,et al. A General-purpose System for Teleoperation of the DRC-HUBO Humanoid Robot , 2015, J. Field Robotics.

[10] Silvio Savarese,et al. iGibson 1.0: A Simulation Environment for Interactive Tasks in Large Realistic Scenes , 2020, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11] Peter Stone,et al. VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[12] Tamim Asfour,et al. Manipulation Planning Among Movable Obstacles , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[13] Silvio Savarese,et al. ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for Mobile Manipulation , 2020, ArXiv.

[14] P. Alam,et al. R , 1823, The Herodotus Encyclopedia.

[15] Daniel King,et al. Fetch & Freight : Standard Platforms for Service Robot Applications , 2016 .

[16] Chuang Gan,et al. ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation , 2020, ArXiv.

[17] Mohamed Medhat Gaber,et al. Deep imitation learning for 3D navigation tasks , 2017, Neural Computing and Applications.

[18] Jie Zhao,et al. Design of a coordinated control strategy for multi-mobile-manipulator cooperative teleoperation system , 2012, 2012 IEEE International Conference on Mechatronics and Automation.

[19] Sergey Levine,et al. Uncertainty-Aware Reinforcement Learning for Collision Avoidance , 2017, ArXiv.

[20] Dean Pomerleau,et al. Input Reconstruction Reliability Estimation , 1992, NIPS.

[21] Michael Melone. Detect , 2021, Designing Secure Systems.

[22] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[23] Silvio Savarese,et al. HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators , 2019, CoRL.

[24] Shantanu Thakar,et al. Towards Remote Teleoperation of a Semi-Autonomous Mobile Manipulator System in Machine Tending Tasks , 2019 .

[25] Song-Chun Zhu,et al. VRKitchen: an Interactive 3D Virtual Environment for Task-oriented Learning , 2019, ArXiv.

[26] Sungchul Kang,et al. Development of a six DOF haptic master for teleoperation of a mobile manipulator , 2010 .

[27] Alberto Garcia-Garcia,et al. UnrealROX: an extremely photorealistic virtual reality environment for robotics simulations and synthetic data generation , 2018, Virtual Reality.

[28] Silvio Savarese,et al. GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[29] Leslie Pack Kaelbling,et al. FFRob: Leveraging symbolic planning for efficient task and motion planning , 2016, Int. J. Robotics Res..

[30] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[31] Satoshi Kagami,et al. Humanoid teleoperation for whole body manipulation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[32] Marco Hutter,et al. Perceptive Model Predictive Control for Continuous Mobile Manipulation , 2020, IEEE Robotics and Automation Letters.

[33] Marco Hutter,et al. Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation , 2021, ArXiv.

[34] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.

[35] Silvio Savarese,et al. ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation , 2018, CoRL.

[36] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[37] Katherine Rose Driggs-Campbell,et al. EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning , 2018, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[38] Sen Wang,et al. Learning Mobile Manipulation through Deep Reinforcement Learning , 2020, Sensors.

[39] Dawei Wang,et al. Teleoperation System of the Internet-based Omnidirectional Mobile Robot with A Mounted Manipulator , 2007, 2007 International Conference on Mechatronics and Automation.

[40] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[41] Luc Van Gool,et al. Failure Prediction for Autonomous Driving , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[42] Junhong Xu,et al. Shared Multi-Task Imitation Learning for Indoor Self-Navigation , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[43] R. Sarpong,et al. Bio-inspired synthesis of xishacorenes A, B, and C, and a new congener from fuscol† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02572c , 2019, Chemical science.

[44] José García Rodríguez,et al. The RobotriX: An Extremely Photorealistic and Very-Large-Scale Indoor Dataset of Sequences with Robot Trajectories and Interactions , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[45] Joseph DelPreto,et al. Helping Robots Learn: A Human-Robot Master-Apprentice Model Using Demonstrations via Virtual Reality Teleoperation , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[46] Charles Richter,et al. Safe Visual Navigation via Deep Learning and Novelty Detection , 2017, Robotics: Science and Systems.

[47] Zhijun Li,et al. Development of human-machine interface for teleoperation of a mobile manipulator , 2012 .

[48] Andrei Haidu,et al. Learning Motion Parameterizations of Mobile Pick and Place Actions from Observing Humans in Virtual Environments , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[49] Saeid Nahavandi,et al. Autonomous Navigation via Deep Imitation and Transfer Learning: A Comparative Study , 2020, 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[50] Oussama Khatib,et al. Springer Handbook of Robotics , 2007, Springer Handbooks.

[51] Wolfram Burgard,et al. Combined Task and Action Learning from Human Demonstrations for Mobile Manipulation Applications , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[52] Silvio Savarese,et al. Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations , 2020, Robotics: Science and Systems.

[53] Wolfram Burgard,et al. Learning mobile manipulation actions from human demonstrations , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[54] Vikram Kapila,et al. Comparing Interface Elements on a Tablet for Intuitive Teleoperation of a Mobile Manipulator , 2014 .

[55] Rachid Alami,et al. A Robot Task Planner that Merges Symbolic and Geometric Reasoning , 2004, ECAI.

[56] Sergey Levine,et al. Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts? , 2020, ICML.

[57] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .

[58] Silvio Savarese,et al. Learning Multi-Arm Manipulation Through Collaborative Teleoperation , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[59] Chih-Hung King,et al. Dusty: an assistive mobile manipulator that retrieves dropped objects for people with motor impairments , 2011, Disability and rehabilitation. Assistive technology.

[60] Matthew R. Walter,et al. Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation , 2011, AAAI.

[61] Bruce H. Krogh,et al. Path planning for mobile manipulators for multiple task execution , 1991, IEEE Trans. Robotics Autom..

[62] Stuart J. Russell,et al. Combined Task and Motion Planning for Mobile Manipulation , 2010, ICAPS.

[63] Yuke Zhu,et al. Detect, Reject, Correct: Crossmodal Compensation of Corrupted Sensors , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[64] Silvio Savarese,et al. Interactive Gibson Benchmark: A Benchmark for Interactive Navigation in Cluttered Environments , 2020, IEEE Robotics and Automation Letters.

[65] Maxim Likhachev,et al. A web-based infrastructure for recording user demonstrations of mobile manipulation tasks , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[66] Abhinav Valada,et al. Learning Kinematic Feasibility for Mobile Manipulation Through Deep Reinforcement Learning , 2021, IEEE Robotics and Automation Letters.