Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based Robotics Research

—Autonomous fabric manipulation is a longstanding challenge in robotics, but evaluating progress is difficult due to the cost and diversity of robot hardware. Using Reach, a cloud robotics platform that enables low-latency remote execution of control policies on physical robots, we present the first systematic benchmarking of fabric manipulation algorithms on physical hardware. We develop 4 novel learning- based algorithms that model expert actions, keypoints, reward functions, and dynamic motions, and we compare these against 4 learning-free and inverse dynamics algorithms on the task of folding a crumpled T-shirt with a single robot arm. The entire lifecycle of data collection, model training, and policy evaluation is performed remotely without physical access to the robot workcell. Results suggest a new algorithm combining imitation learning with analytic methods achieves 84% of human- level performance on the folding task. See https://sites. google.com/berkeley.edu/cloudfolding for all data, code, models, and supplemental material.

[1]  Siddhartha S. Srinivasa,et al.  Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation , 2021, IEEE Robotics and Automation Letters.

[2]  David Held,et al.  FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy , 2021, Conference on Robot Learning.

[3]  Francisco Roldan Sanchez,et al.  Solving the Real Robot Challenge Using Deep Reinforcement Learning , 2021, AICS.

[4]  Siddhartha S. Srinivasa,et al.  A Robot Cluster for Reproducible Research in Dexterous Manipulation , 2021, ArXiv.

[5]  Jonathan Tompson,et al.  Implicit Behavioral Cloning , 2021, CoRL.

[6]  Manuel Wüthrich,et al.  Transferring Dexterous Manipulation from GPU Simulation to a Remote Real-World TriFinger , 2021, ArXiv.

[7]  Shuran Song,et al.  FlingBot: The Unreasonable Effectiveness of Dynamic Manipulation for Cloth Unfolding , 2021, ArXiv.

[8]  Sergey Levine,et al.  Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Brijen Thananjeyan,et al.  LazyDAgger: Reducing Context Switching in Interactive Imitation Learning , 2021, 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE).

[10]  Jeffrey Zhang,et al.  Dexterous Manipulation Primitives for the Real Robot Challenge , 2021, ArXiv.

[11]  Matthew R. Walter,et al.  Grasp and Motion Planning for Dexterous Manipulation for the Real Robot Challenge , 2021, ArXiv.

[12]  Isabella Huang,et al.  Simulation of Parallel-Jaw Grasping using Incremental Potential Contact Models , 2021, ArXiv.

[13]  Yufei Wang,et al.  SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation , 2020, CoRL.

[14]  Peter R. Florence,et al.  Transporter Networks: Rearranging the Visual World for Robotic Manipulation , 2020, CoRL.

[15]  Ludovic Righetti,et al.  TriFinger: An Open-Source Robot for Learning Dexterity , 2020, CoRL.

[16]  Katsu Yamane,et al.  Learning to Smooth and Fold Real Fabric Using Dense Object Descriptors Trained on Synthetic Color Images , 2020, ArXiv.

[17]  Ken Goldberg,et al.  VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation , 2020, Robotics: Science and Systems.

[18]  Peter Corke,et al.  EGAD! An Evolved Grasping Analysis Dataset for Diversity and Reproducibility in Robotic Manipulation , 2020, IEEE Robotics and Automation Letters.

[19]  Danica Kragic,et al.  Benchmarking Bimanual Cloth Manipulation , 2020, IEEE Robotics and Automation Letters.

[20]  Luke Zettlemoyer,et al.  ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  P. Abbeel,et al.  Learning to Manipulate Deformable Objects without Demonstrations , 2019, Robotics: Science and Systems.

[22]  Ken Goldberg,et al.  Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor , 2019, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  S. Levine,et al.  RoboNet: Large-Scale Multi-Robot Learning , 2019, Conference on Robot Learning.

[24]  S. Levine,et al.  ROBEL: Robotics Benchmarks for Learning with Low-Cost Robots , 2019, CoRL.

[25]  Sergey Levine,et al.  REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning , 2019, ArXiv.

[26]  Jürgen Leitner,et al.  Quantifying the Reality Gap in Robotic Manipulation Tasks , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[27]  Herke van Hoof,et al.  Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[28]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[29]  Jonathan P. How,et al.  Duckietown: An open, inexpensive and flexible platform for autonomy education and research , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[30]  Xinyu Liu,et al.  Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics , 2017, Robotics: Science and Systems.

[31]  Jitendra Malik,et al.  Combining self-supervised learning and imitation for vision-based rope manipulation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[32]  Li Wang,et al.  The Robotarium: A remotely accessible swarm robotics research testbed , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[33]  Vladimír Petrík,et al.  Folding Clothes Autonomously: A Complete Pipeline , 2016, IEEE Transactions on Robotics.

[34]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[35]  Siddhartha S. Srinivasa,et al.  Benchmarking in Manipulation Research: Using the Yale-CMU-Berkeley Object and Model Set , 2015, IEEE Robotics & Automation Magazine.

[36]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Geoffrey J. Gordon,et al.  A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.

[38]  Pieter Abbeel,et al.  Cloth grasp point detection based on multiple-view geometric cues with application to robotic towel folding , 2010, 2010 IEEE International Conference on Robotics and Automation.

[39]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.