论文信息 - Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion

Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion

Locomotion has seen dramatic progress for walking or running across challenging terrains. However, robotic quadrupeds are still far behind their biological counterparts, such as dogs, which display a variety of agile skills and can use the legs beyond locomotion to perform several basic manipulation tasks like interacting with objects and climbing. In this paper, we take a step towards bridging this gap by training quadruped robots not only to walk but also to use the front legs to climb walls, press buttons, and perform object interaction in the real world. To handle this challenging optimization, we decouple the skill learning broadly into locomotion, which involves anything that involves movement whether via walking or climbing a wall, and manipulation, which involves using one leg to interact while balancing on the other three legs. These skills are trained in simulation using curriculum and transferred to the real world using our proposed sim2real variant that builds upon recent locomotion success. Finally, we combine these skills into a robust long-term plan by learning a behavior tree that encodes a high-level task hierarchy from one clean expert demonstration. We evaluate our method in both simulation and real-world showing successful executions of both short as well as long-range tasks and how robustness helps confront external perturbations. Videos at https://robot-skills.github.io

Deepak Pathak | Ashish Kumar | Xuxin Cheng

[1] S. Levine,et al. Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning , 2023, ArXiv.

[2] Joseph J. Lim,et al. Cross-Domain Transfer via Semantic Skill Imitation , 2022, CoRL.

[3] Jitendra Malik,et al. Legged Locomotion in Challenging Terrains using Egocentric Vision , 2022, CoRL.

[4] Zipeng Fu,et al. Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion , 2022, CoRL.

[5] S. Levine,et al. Hierarchical Reinforcement Learning for Precise Soccer Shooting Skills using a Quadrupedal Robot , 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[6] Jitendra Malik,et al. Adapting Rapid Motor Adaptation for Bipedal Robots , 2022, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7] S. Levine,et al. ASE , 2022, ACM Trans. Graph..

[8] Jemin Hwangbo,et al. Concurrent Training of a Control Policy and a State Estimator for Dynamic and Robust Legged Locomotion , 2022, IEEE Robotics and Automation Letters.

[9] Lorenz Wellhausen,et al. Learning robust perceptive locomotion for quadrupedal robots in the wild , 2022, Science Robotics.

[10] Joseph J. Lim,et al. Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization , 2021, CoRL.

[11] Jitendra Malik,et al. Minimizing Energy Consumption Leads to the Emergence of Gaits in Legged Robots , 2021, CoRL.

[12] Philipp Reist,et al. Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning , 2021, CoRL.

[13] Miles Macklin,et al. Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning , 2021, NeurIPS Datasets and Benchmarks.

[14] Jitendra Malik,et al. RMA: Rapid Motor Adaptation for Legged Robots , 2021, Robotics: Science and Systems.

[15] Rutav Shah,et al. RRL: Resnet as representation for Reinforcement Learning , 2021, ICML.

[16] Byron Boots,et al. Fast and Efficient Locomotion via Learned Gait Transitions , 2021, CoRL.

[17] S. Levine,et al. AMP , 2021, ACM Trans. Graph..

[18] Koushil Sreenath,et al. Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[19] Joonho Lee,et al. Circus ANYmal: A Quadruped Learning Dexterous Manipulation with Its Limbs , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[20] Lorenz Wellhausen,et al. Learning quadrupedal locomotion over challenging terrain , 2020, Science Robotics.

[21] Koushil Sreenath,et al. Dynamic Legged Manipulation of a Ball Through Multi-Contact Optimization , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[22] S. Levine,et al. Learning Agile Robotic Locomotion Skills by Imitating Animals , 2020, Robotics: Science and Systems.

[23] Vikash Kumar,et al. Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real , 2019, CoRL.

[24] Odest Chadwicke Jenkins,et al. Learning Behavior Trees From Demonstration , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[25] Joseph J. Lim,et al. Composing Complex Skills by Learning Transition Policies , 2018, ICLR.

[26] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[27] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Petter Ögren,et al. Towards a unified behavior trees framework for robot control , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[29] Fan Yang,et al. Study on hexapod robot manipulation using legs , 2014, Robotica.

[30] Daniel E. Koditschek,et al. Legged Self-Manipulation , 2013, IEEE Access.

[31] Edwin Olson,et al. AprilTag: A robust and flexible visual fiducial system , 2011, 2011 IEEE International Conference on Robotics and Automation.

[32] Kevin M. Lynch,et al. Nonprehensile robotic manipulation: controllability and planning , 1996 .

[33] A. B. Loyall,et al. Integrating Reactivity, Goals, and Emotion in a Broad Agent , 1992 .