Learning Geometric Reasoning and Control for Long-Horizon Tasks from Visual Input

Long-horizon manipulation tasks require joint reasoning over a sequence of discrete actions and their associated continuous control parameters. While Task and Motion Planning (TAMP) approaches are capable of generating motion plans that account for this joint reasoning, they usually assume full knowledge about the environment (e.g. in terms of shapes, poses of objects) and often require computation times not suitable for real-time control.To overcome this, we propose a learning framework where a high-level reasoning network predicts, based on an image of the scene, a sequence of discrete actions and the parameter values of their associated low-level controllers. These controllers are parameterized in terms of a learned energy function, leading to time-invariant controllers for each phase. We train the whole framework end-to-end using a dataset of TAMP solutions computed using Logic Geometric Programming. A key feature is that the reasoning network determines the parameters of the controllers jointly, such that the overall task can be solved. Despite having no explicit representation of the geometry nor pose of the objects in the scene, our network is still able to accomplish geometrically precise manipulation tasks, including handovers and an accurate pointing task where the parameters of early actions are tightly coupled with those of later actions. Video: https://youtu.be/AcPWRTkr3_g

[1]  Gert Kootstra,et al.  International Conference on Robotics and Automation (ICRA) , 2008, ICRA 2008.

[2]  Leslie Pack Kaelbling,et al.  Hierarchical Planning in the Now , 2010, Bridging the Gap Between Task and Motion Planning.

[3]  Alessandro Saffiotti,et al.  Constraint propagation on interval bounds for dealing with geometric backtracking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Rachid Alami,et al.  Towards Combining HTN Planning and Geometric Task Planning , 2013, ArXiv.

[5]  Emanuel Todorov,et al.  Combining the benefits of function approximation and trajectory optimization , 2014, Robotics: Science and Systems.

[6]  Leslie Pack Kaelbling,et al.  A constraint-based method for solving sequential manipulation planning problems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Pieter Abbeel,et al.  Combined task and motion planning through an extensible planner-independent interface layer , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Marc Toussaint,et al.  Logic-Geometric Programming: An Optimization-Based Approach to Combined Task and Motion Planning , 2015, IJCAI.

[9]  Oliver Kroemer,et al.  Towards learning hierarchical skills for multi-phase manipulation tasks , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Pieter Abbeel,et al.  Value Iteration Networks , 2016, NIPS.

[11]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[12]  Dylan Hadfield-Menell,et al.  Guided search for task and motion plans using learned heuristics , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Leslie Pack Kaelbling,et al.  Learning to Rank for Synthesizing Planning Heuristics , 2016, IJCAI.

[14]  Vladlen Koltun,et al.  Learning to Act by Predicting the Future , 2016, ICLR.

[15]  Luca Rigazio,et al.  Path Integral Networks: End-to-End Differentiable Optimal Control , 2017, ArXiv.

[16]  Sergey Levine,et al.  Self-Supervised Visual Planning with Temporal Skip Connections , 2017, CoRL.

[17]  Leslie Pack Kaelbling,et al.  Learning to guide task and motion planning using score-space representation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Marc Toussaint,et al.  Differentiable Physics and Stable Modes for Tool-Use and Manipulation Planning , 2018, Robotics: Science and Systems.

[19]  Daniel Kappler,et al.  Riemannian Motion Policies , 2018, ArXiv.

[20]  Byron Boots,et al.  Differentiable MPC for End-to-end Planning and Control , 2018, NeurIPS.

[21]  Alberto Rodriguez,et al.  Reactive Planar Manipulation with Convex Hybrid MPC , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[22]  Swarat Chaudhuri,et al.  An incremental constraint-based framework for task and motion planning , 2018, Int. J. Robotics Res..

[23]  Leslie Pack Kaelbling,et al.  Active Model Learning and Diverse Action Sampling for Task and Motion Planning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[24]  Han-Lim Choi,et al.  Adaptive path-integral autoencoder: representation learning and planning for dynamical systems , 2018, NeurIPS.

[25]  Marco Pavone,et al.  Learning Sampling Distributions for Robot Motion Planning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Allan Jabri,et al.  Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control , 2018, ICML.

[27]  Li Fei-Fei,et al.  Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation , 2019, CoRL.

[28]  Lydia E. Kavraki,et al.  Learning Feasibility for Task and Motion Planning in Tabletop Environments , 2019, IEEE Robotics and Automation Letters.

[29]  Daniel Leidner,et al.  Iteratively Refined Feasibility Checks in Robotic Assembly Sequence Planning , 2019, IEEE Robotics and Automation Letters.

[30]  Dieter Fox,et al.  Representing Robot Task Plans as Robust Logical-Dynamical Systems , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Sergey Levine,et al.  Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning , 2019, CoRL.

[32]  Learning Latent Plans from Play , 2019, CoRL.

[33]  Sergey Levine,et al.  Improvisation through Physical Understanding: Using Novel Objects as Tools with Visual Foresight , 2019, Robotics: Science and Systems.

[34]  Gregory D. Hager,et al.  Visual Robot Task Planning , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[35]  Alberto Rodriguez,et al.  Hybrid Differential Dynamic Programming for Planar Manipulation Primitives , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[36]  Lawson L. S. Wong,et al.  Deep Imitation Learning for Bimanual Robotic Manipulation , 2020, NeurIPS.

[37]  Karol Hausman,et al.  Modeling Long-horizon Tasks as Sequential Interaction Landscapes , 2020, CoRL.

[38]  Jung-Su Ha,et al.  Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image , 2020, Robotics: Science and Systems.

[39]  Chelsea Finn,et al.  Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal Generation , 2019, ICLR.

[40]  A. Gupta,et al.  Efficient Bimanual Manipulation Using Learned Task Schemas , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[41]  Pulkit Agrawal,et al.  A Long Horizon Planning Framework for Manipulating Rigid Pointcloud Objects , 2020, CoRL.

[42]  Leslie Pack Kaelbling,et al.  Integrated Task and Motion Planning , 2020, Annu. Rev. Control. Robotics Auton. Syst..

[43]  Chelsea Finn,et al.  Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors , 2020, NeurIPS.

[44]  Russ Tedrake,et al.  Keypoints into the Future: Self-Supervised Correspondence in Model-Based Reinforcement Learning , 2020, CoRL.

[45]  Hammad Mazhar,et al.  Transferable Task Execution from Pixels through Deep Planning Domain Learning , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[46]  Jung-Su Ha,et al.  Describing Physics For Physical Reasoning: Force-Based Sequential Manipulation Planning , 2020, IEEE Robotics and Automation Letters.

[47]  Jung-Su Ha,et al.  A Probabilistic Framework for Constrained Manipulations and Task and Motion Planning under Uncertainty , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[48]  Toki Migimatsu,et al.  Object-Centric Task and Motion Planning in Dynamic Environments , 2019, IEEE Robotics and Automation Letters.

[49]  Jung-Su Ha,et al.  Deep Visual Heuristics: Learning Feasibility of Mixed-Integer Programs for Manipulation Planning , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[50]  George Konidaris,et al.  Option Discovery using Deep Skill Chaining , 2020, ICLR.

[51]  M. Hutter,et al.  MPC-Net: A First Principles Guided Policy Search , 2019, IEEE Robotics and Automation Letters.

[52]  Danny Driess,et al.  Deep 6-DoF Tracking of Unknown Objects for Reactive Grasping , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[53]  Russ Tedrake,et al.  The Surprising Effectiveness of Linear Models for Visual Foresight in Object Pile Manipulation , 2020, WAFR.

[54]  Han-Lim Choi,et al.  Distilling a Hierarchical Policy for Planning and Control via Representation and Reinforcement Learning , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[55]  Kartic Subr,et al.  Action Sequencing Using Visual Permutations , 2020, IEEE Robotics and Automation Letters.

[56]  Leslie Pack Kaelbling,et al.  Planning with Learned Object Importance in Large Problem Instances using Graph Neural Networks , 2020, AAAI.

[57]  Silvio Savarese,et al.  Deep Affordance Foresight: Planning Through What Can Be Done in the Future , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[58]  Pierre Sermanet,et al.  Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning , 2020, CoRL.

[59]  Marc Toussaint,et al.  Learning Efficient Constraint Graph Sampling for Robotic Sequential Manipulation , 2020, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[60]  Ye Zhao,et al.  SyDeBO: Symbolic-Decision-Embedded Bilevel Optimization for Long-Horizon Manipulation in Dynamic Environments , 2020, IEEE Access.

[61]  Russ Tedrake,et al.  Warm Start of Mixed-Integer Programs for Model Predictive Control of Hybrid Systems , 2019, IEEE Transactions on Automatic Control.