Planar Robot Casting with Real2Sim2Real Self-Supervised Learning

Manipulation of deformable objects using a single parameterized dynamic action can be useful for tasks such as fly fishing, lofting a blanket, and playing shuffleboard. Such tasks take as input a desired final state and output one parameterized open-loop dynamic robot action which produces a trajectory toward the final state. This is especially challenging for long-horizon trajectories with complex dynamics involving friction. This paper explores the task of Planar Robot Casting (PRC): where one planar motion of a robot wrist holding one end of a cable causes the other end to slide across the plane toward a desired target. PRC allows the cable to reach points beyond the robot’s workspace and has applications for cable management in homes, warehouses, and factories. To efficiently learn a PRC policy for a given cable, we propose Real2Sim2Real, a self-supervised framework that automatically collects physical trajectory examples to tune parameters of a dynamics simulator using Differential Evolution, generates many simulated examples, and then learns a policy using a weighted combination of simulated and physical data. We evaluate Real2Sim2Real with three simulators, Isaac Gymsegmented, Isaac Gym-hybrid, and PyBullet, two function approximators, Gaussian Processes and Neural Networks (NNs), and three cables with differing stiffness, torsion, and friction. Results on 16 held-out test targets for each cable suggest that the NN PRC policies using Isaac Gym-segmented attain median error distance (as % of cable length) ranging from 8% to 14%, outperforming baselines and policies trained on only real or only simulated examples. Code, data, and videos are available at https://tinyurl.com/robotcast.

[1]  Priya Sundaresan,et al.  Untangling Dense Knots by Learning Task-Relevant Keypoints , 2020, CoRL.

[2]  M.T. Mason,et al.  Dynamic manipulation , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[3]  Peter I. Frazier,et al.  A Tutorial on Bayesian Optimization , 2018, ArXiv.

[4]  Masatoshi Ishikawa,et al.  One-handed knotting of a flexible rope with a high-speed multifingered hand having tactile sensors , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Jitendra Malik,et al.  Zero-Shot Visual Imitation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[6]  Byron Boots,et al.  Continuous-time Gaussian process motion planning via probabilistic inference , 2017, Int. J. Robotics Res..

[7]  Trevor Darrell,et al.  Auto-Tuned Sim-to-Real Transfer , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Pieter Abbeel,et al.  Learning to Manipulate Deformable Objects without Demonstrations , 2019, Robotics: Science and Systems.

[9]  Joseph K. Kearney,et al.  A Case Study of Flexible Object Manipulation , 1991, Int. J. Robotics Res..

[10]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[11]  M. Gutmann,et al.  Stir to Pour: Efficient Calibration of Liquid Properties for Pouring Actions , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Andrew J. Davison,et al.  Sim-to-Real Reinforcement Learning for Deformable Object Manipulation , 2018, CoRL.

[13]  Ken Goldberg,et al.  Deep learning can accelerate grasp-optimized motion planning , 2020, Science Robotics.

[14]  Jürgen Schmidhuber,et al.  A System for Robotic Heart Surgery that Learns to Tie Knots Using Recurrent Neural Networks , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  E. Adelson,et al.  Cable manipulation with a tactile-reactive gripper , 2019, Robotics: Science and Systems.

[16]  Ashutosh Saxena,et al.  Tangled: Learning to untangle ropes with RGB-D perception , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Roi Poranne,et al.  Dynamic Manipulation of Deformable Objects With Implicit Integration , 2021, IEEE Robotics and Automation Letters.

[18]  Young-Ho Kim,et al.  Using a Compliant, Unactuated Tail to Manipulate Objects , 2017, IEEE Robotics and Automation Letters.

[19]  Steven M. LaValle,et al.  Randomized Kinodynamic Planning , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[20]  Gang Wang,et al.  Analysis of fly fishing rod casting dynamics , 2011 .

[21]  Yevgen Chebotar,et al.  Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[22]  Dieter Fox,et al.  BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators , 2019, Robotics: Science and Systems.

[23]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[24]  Inman Harvey,et al.  Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics , 1995, ECAL.

[25]  Tae-Yong Kim,et al.  Unified particle physics for real-time applications , 2014, ACM Trans. Graph..

[26]  Hirofumi Nakagaki,et al.  Study of deformation and insertion tasks of a flexible wire , 1997, Proceedings of International Conference on Robotics and Automation.

[27]  Kenneth Y. Goldberg,et al.  Automating multi-throw multilateral surgical suturing with a mechanical needle guide and sequential convex optimization , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[28]  David Held,et al.  SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation , 2020, CoRL.

[29]  Andrea L. Thomaz,et al.  TuneNet: One-Shot Residual Tuning for System Identification and Sim-to-Real Robot Task Transfer , 2019, CoRL.

[30]  Wojciech Zaremba,et al.  Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Pieter Abbeel,et al.  Learning Predictive Representations for Deformable Objects Using Contrastive Estimation , 2020, CoRL.

[32]  Daniel Seita,et al.  Robots of the Lost Arc: Learning to Dynamically Manipulate Fixed-Endpoint Ropes and Cables , 2020, ArXiv.

[33]  Kazutoshi Tanaka,et al.  An analytical diabolo model for robotic learning and control , 2020, ArXiv.

[34]  Aude Billard,et al.  Catching Objects in Flight , 2014, IEEE Transactions on Robotics.

[35]  Jun Takamatsu,et al.  Knot planning from observation , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[36]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[37]  Jitendra Malik,et al.  Combining self-supervised learning and imitation for vision-based rope manipulation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[38]  K. Bathe Finite Element Procedures , 1995 .

[39]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[40]  Masatoshi Ishikawa,et al.  Dynamic High-Speed Knotting of a Rope by a Manipulator , 2013 .

[41]  Sergey Levine,et al.  (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.

[42]  Kevin M. Lynch,et al.  Dynamic Nonprehensile Manipulation: Controllability, Planning, and Experiments , 1999, Int. J. Robotics Res..

[43]  Alberto Rodriguez,et al.  TossingBot: Learning to Throw Arbitrary Objects With Residual Physics , 2019, IEEE Transactions on Robotics.

[44]  Masatoshi Ishikawa,et al.  Motion planning for dynamic knotting of a flexible rope with a high-speed robot arm , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[45]  Jonathan Tompson,et al.  Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks , 2020, ArXiv.

[46]  Peter R. Florence,et al.  Transporter Networks: Rearranging the Visual World for Robotic Manipulation , 2020, CoRL.

[47]  Noel C. Perkins,et al.  Numerical model for the dynamics of a coupled fly line/fly rod system and experimental validation , 2004 .

[48]  Marcin Andrychowicz,et al.  Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[49]  David Howard,et al.  Traversing the Reality Gap via Simulator Tuning , 2020, ArXiv.

[50]  Tao Wang,et al.  Automatic Gait Optimization with Gaussian Process Regression , 2007, IJCAI.

[51]  Masatoshi Ishikawa,et al.  Simple Model and Deformation Control of a Flexible Rope using Constant, High-Speed Motion of a Robot Arm , 2012, 2012 IEEE International Conference on Robotics and Automation.

[52]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[53]  Sergey Levine,et al.  How to train your robot with deep reinforcement learning: lessons we have learned , 2021, Int. J. Robotics Res..

[54]  Dmitry Berenson,et al.  An online method for tight-tolerance insertion tasks for string and rope , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[55]  Pieter Abbeel,et al.  Superhuman performance of surgical tasks by robots using iterative learning from human-guided demonstrations , 2010, 2010 IEEE International Conference on Robotics and Automation.

[56]  Taskin Padir,et al.  Sim2Real2Sim: Bridging the Gap Between Simulation and Real-World in Flexible Object Manipulation , 2020, 2020 Fourth IEEE International Conference on Robotic Computing (IRC).

[57]  Belhassen-Chedli Bouzgarrou,et al.  Robotic manipulation and sensing of deformable objects in domestic and industrial applications: a survey , 2018, Int. J. Robotics Res..

[58]  Xinyu Liu,et al.  Dex-Net 2.0: Deep Learning to Plan Robust Grasps with Synthetic Point Clouds and Analytic Grasp Metrics , 2017, Robotics: Science and Systems.

[59]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[60]  E. Adelson,et al.  SwingBot: Learning Physical Features from In-hand Tactile Exploration for Dynamic Swing-up Manipulation , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[61]  Devin J. Balkcom,et al.  Tying knot precisely , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).