Learning Preconditions of Hybrid Force-Velocity Controllers for Contact-Rich Manipulation

Robots need to manipulate objects in constrained environments like shelves and cabinets when assisting humans in everyday settings like homes and offices. These constraints make manipulation difficult by reducing grasp accessibility, so robots need to use non-prehensile strategies that leverage object-environment contacts to perform manipulation tasks. To tackle the challenge of planning and controlling contact-rich behaviors in such settings, this work uses Hybrid Force-Velocity Controllers (HFVCs) as the skill representation and plans skill sequences with learned preconditions. While HFVCs naturally enable robust and compliant contact-rich behaviors, solvers that synthesize them have traditionally relied on precise object models and closed-loop feedback on object pose, which are difficult to obtain in constrained environments due to occlusions. We first relax HFVCs' need for precise models and feedback with our HFVC synthesis framework, then learn a point-cloud-based precondition function to classify where HFVC executions will still be successful despite modeling inaccuracies. Finally, we use the learned precondition in a search-based task planner to complete contact-rich manipulation tasks in a shelf domain. Our method achieves a task success rate of $73.2\%$, outperforming the $51.5\%$ achieved by a baseline without the learned precondition. While the precondition function is trained in simulation, it can also transfer to a real-world setup without further fine-tuning. See supplementary materials and videos at https://sites.google.com/view/constrained-manipulation/

[1]  David Held,et al.  Learning to Grasp the Ungraspable with Emergent Extrinsic Dexterity , 2022, CoRL.

[2]  Orion Taylor,et al.  Manipulation of unknown objects via contact configuration regulation , 2022, 2022 International Conference on Robotics and Automation (ICRA).

[3]  Mohit Sharma,et al.  Search-Based Task Planning with Learned Skill Effect Models for Lifelong Robotic Manipulation , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[4]  M. T. Mason,et al.  Contact Mode Guided Motion Planning for Quasidynamic Dexterous Manipulation in 3D , 2021, 2022 International Conference on Robotics and Automation (ICRA).

[5]  Dieter Fox,et al.  Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Muhammad Suhail Saleem,et al.  Manipulation Planning Among Movable Obstacles Using Physics-Based Adaptive Motion Primitives , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Kim Doang Nguyen,et al.  Nonprehensile Manipulation:A Trajectory-Planning Perspective , 2021, IEEE/ASME Transactions on Mechatronics.

[8]  Silvio Savarese,et al.  Deep Affordance Foresight: Planning Through What Can Be Done in the Future , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Pulkit Agrawal,et al.  A Long Horizon Planning Framework for Manipulating Rigid Pointcloud Objects , 2020, CoRL.

[10]  M. T. Mason,et al.  An Efficient Closed-Form Method for Optimal Hybrid Force-Velocity Control , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Oliver Kroemer,et al.  Learning to Compose Hierarchical Object-Centric Controllers for Robotic Manipulation , 2020, CoRL.

[12]  Mohit Sharma,et al.  A Modular Robotic Arm Control Stack for Research: Franka-Interface and FrankaPy , 2020, ArXiv.

[13]  Xianyi Cheng,et al.  Contact Mode Guided Sampling-Based Planning for Quasistatic Dexterous Manipulation in 2D , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Zherong Pan,et al.  Decision Making in Joint Push-Grasp Action Space for Large-Scale Object Sorting , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Pierre Sermanet,et al.  Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning , 2020, CoRL.

[16]  Abdeslam Boularias,et al.  A Probabilistic Model for Planar Sliding of Objects with Unknown Material Properties: Identification and Robust Planning , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Alberto Rodriguez,et al.  A Global Quasi-Dynamic Model for Contact-Trajectory Optimization in Manipulation , 2020, Robotics: Science and Systems.

[18]  Caelan Reed Garrett,et al.  Learning compositional models of robot skills for task and motion planning , 2020, Int. J. Robotics Res..

[19]  Matthew T. Mason,et al.  Manipulation with Shared Grasping , 2020, Robotics: Science and Systems.

[20]  Abdeslam Boularias,et al.  Learning to Slide Unknown Objects with Differentiable Physics Simulations , 2020, Robotics: Science and Systems.

[21]  Alberto Rodriguez,et al.  Reactive planar non-prehensile manipulation with hybrid model predictive control , 2020, Int. J. Robotics Res..

[22]  Haoruo Zhang,et al.  Learning efficient push and grasp policy in a totebox from simulation , 2020, Adv. Robotics.

[23]  Zhibin Li,et al.  Learning Pregrasp Manipulation of Objects from Ungraspable Poses , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Alberto Rodriguez,et al.  Hybrid Differential Dynamic Programming for Planar Manipulation Primitives , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[25]  Ken Goldberg,et al.  Robust Toppling for Vacuum Suction Grasping , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[26]  Siddhartha S. Srinivasa,et al.  Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions , 2018, ArXiv.

[27]  Dieter Fox,et al.  GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning , 2018, CoRL.

[28]  Kenneth Y. Goldberg,et al.  Linear Push Policies to Increase Grasp Access for Robot Bin Picking , 2018, 2018 IEEE 14th International Conference on Automation Science and Engineering (CASE).

[29]  Toshiaki Tsuji,et al.  Sequence-to-Sequence Model for Trajectory Planning of Nonprehensile Manipulation Including Contact Model , 2018, IEEE Robotics and Automation Letters.

[30]  Marc Toussaint,et al.  Differentiable Physics and Stable Modes for Tool-Use and Manipulation Planning , 2018, Robotics: Science and Systems.

[31]  Emanuel Todorov,et al.  Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system , 2018, 2018 IEEE International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR).

[32]  Alberto Rodriguez,et al.  Learning Synergies Between Pushing and Grasping with Self-Supervised Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Danica Kragic,et al.  Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Vincenzo Lippiello,et al.  Nonprehensile Dynamic Manipulation: A Survey , 2018, IEEE Robotics and Automation Letters.

[35]  Leslie Pack Kaelbling,et al.  From Skills to Symbols: Learning Symbolic Representations for Abstract High-Level Planning , 2018, J. Artif. Intell. Res..

[36]  Vladlen Koltun,et al.  Open3D: A Modern Library for 3D Data Processing , 2018, ArXiv.

[37]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[38]  Kevin M. Lynch,et al.  Planning and control for dynamic, nonprehensile, and hybrid manipulation tasks , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Oliver Brock,et al.  Planning grasp strategies That Exploit Environmental Constraints , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[40]  P. Abbeel,et al.  Benchmarking in Manipulation Research: The YCB Object and Model Set and Benchmarking Protocols , 2015, ArXiv.

[41]  Siddhartha S. Srinivasa,et al.  Pregrasp Manipulation as Trajectory Optimization , 2013, Robotics: Science and Systems.

[42]  James M. Rehg,et al.  Guided pushing for object singulation , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[43]  Zoran Popovic,et al.  Discovery of complex behaviors through contact-invariant optimization , 2012, ACM Trans. Graph..

[44]  Tamim Asfour,et al.  Templates for pre-grasp sliding interactions , 2012, Robotics Auton. Syst..

[45]  S. Srinivasa,et al.  Push-grasping with dexterous hands: Mechanics and a method , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[46]  Siddhartha S. Srinivasa,et al.  Planning pre-grasp manipulation for transport tasks , 2010, 2010 IEEE International Conference on Robotics and Automation.

[47]  Ales Ude,et al.  Autonomous acquisition of pushing actions to support object grasping with a humanoid robot , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[48]  Doina Precup,et al.  Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[49]  Kevin M. Lynch,et al.  Dynamic Nonprehensile Manipulation: Controllability, Planning, and Experiments , 1999, Int. J. Robotics Res..

[50]  Kevin M. Lynch,et al.  Dynamic underactuated nonprehensile manipulation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[51]  Richard E. Korf,et al.  Real-Time Heuristic Search , 1990, Artif. Intell..

[52]  Matthew T. Mason,et al.  Compliance and Force Control for Computer Controlled Manipulators , 1981, IEEE Transactions on Systems, Man, and Cybernetics.