Learning Efficient Constraint Graph Sampling for Robotic Sequential Manipulation

Efficient sampling from constraint manifolds, and thereby generating a diverse set of solutions of feasibility problems, is a fundamental challenge. We consider the case where a problem is factored, that is, the underlying nonlinear mathematical program is decomposed into differentiable equality and inequality constraints, each of which depends only on some variables. Such problems are at the core of efficient and robust sequential robot manipulation planning. Naive sequential conditional sampling of individual variables, as well as fully joint sampling of all variables at once (e.g., leveraging optimization methods), can be highly inefficient and non-robust. We propose a novel framework to learn how to break the overall problem into smaller sequential sampling problems. Specifically, we leverage Monte-Carlo Tree Search to learn which variable subsets should be assigned in which sequential order, in order to minimize the computation time to generate full samples. This strategy allows us to efficiently compute a set of diverse valid robot configurations for mode-switches within sequential manipulation tasks, which are waypoints for subsequent trajectory optimization or sampling-based motion planning algorithms. We show that the learning method quickly converges to the best sampling strategy for a given problem, and outperforms user-defined orderings and joint optimization, while also providing a higher sample diversity. Video: https://youtu.be/xWAjBGACZhs

[1]  Malek Mouhoub,et al.  Heuristic techniques for variable and value ordering in CSPs , 2011, GECCO '11.

[2]  Toby Walsh,et al.  Handbook of Constraint Programming , 2006, Handbook of Constraint Programming.

[3]  Leslie Pack Kaelbling,et al.  Sampling-based methods for factored task and motion planning , 2018, Int. J. Robotics Res..

[4]  Erez Karpas,et al.  Temporal Planning while the Clock Ticks , 2018, ICAPS.

[5]  Marc Toussaint,et al.  Newton methods for k-order Markov Constrained Motion Problems , 2014, ArXiv.

[6]  Siddhartha S. Srinivasa,et al.  Generalized Lazy Search for Robot Motion Planning: Interleaving Search and Edge Evaluation via Event-based Toggles , 2019, ICAPS.

[7]  George B. Dantzig,et al.  Decomposition Principle for Linear Programs , 1960 .

[8]  Manuel Lopes,et al.  Multi-bound tree search for logic-geometric programming in cooperative manipulation domains , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Brendan J. Frey,et al.  Factor Graphs and Algorithms , 2008 .

[10]  Eiichi Yoshida,et al.  Quotient-Space Motion Planning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Mark S. Boddy,et al.  Solving Time-Dependent Planning Problems , 1989, IJCAI.

[12]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[13]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[14]  Leslie Pack Kaelbling,et al.  Integrated Task and Motion Planning , 2020, Annu. Rev. Control. Robotics Auton. Syst..

[15]  Bart Selman,et al.  Uniform Solution Sampling Using a Constraint Solver As an Oracle , 2012, UAI.

[16]  Zonghao Gu,et al.  Generating Multiple Solutions for Mixed Integer Programming Problems , 2007, IPCO.

[17]  Razvan Pascanu,et al.  Learning model-based planning from scratch , 2017, ArXiv.

[18]  Jean-Claude Latombe,et al.  Multi-modal Motion Planning in Non-expansive Spaces , 2010, Int. J. Robotics Res..

[19]  Lydia E. Kavraki,et al.  Informing Multi-Modal Planning with Synergistic Discrete Leads , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Thomas L. Griffiths,et al.  Algorithm selection by rational metareasoning as a model of human strategy selection , 2014, NIPS.

[21]  Jung-Su Ha,et al.  Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image , 2020, Robotics: Science and Systems.

[22]  M. Johnston,et al.  S PIKE : Intelligent Scheduling of Hubble Space Telescope Observations , 1994 .

[23]  Rina Dechter,et al.  Generating random solutions for constraint satisfaction problems , 2002, AAAI/IAAI.

[24]  Erez Karpas,et al.  Rational deployment of multiple heuristics in optimal state-space search , 2018, Artif. Intell..

[25]  Chonhyon Park,et al.  An Efficient Acyclic Contact Planner for Multiped Robots , 2018, IEEE Transactions on Robotics.

[26]  Swarat Chaudhuri,et al.  Incremental Task and Motion Planning: A Constraint-Based Approach , 2016, Robotics: Science and Systems.

[27]  Marc Toussaint,et al.  Hierarchical Task and Motion Planning using Logic-Geometric Programming ( HLGP ) , 2010 .

[28]  Vibhav Gogate,et al.  A New Algorithm for Sampling CSP Solutions Uniformly at Random , 2006, CP.

[29]  Wheeler Ruml,et al.  Metareasoning in Real-Time Heuristic Search , 2015, SOCS.

[30]  Marc Toussaint,et al.  Differentiable Physics and Stable Modes for Tool-Use and Manipulation Planning , 2018, Robotics: Science and Systems.

[31]  Lydia E. Kavraki,et al.  On the advantages of task motion multigraphs for efficient mobile manipulation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  David Silver,et al.  Combining online and offline knowledge in UCT , 2007, ICML '07.

[33]  Jung-Su Ha,et al.  Learning Geometric Reasoning and Control for Long-Horizon Tasks from Visual Input , 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Jacques F. Benders,et al.  Partitioning procedures for solving mixed-variables programming problems , 2005, Comput. Manag. Sci..

[35]  D. Bertsekas Convexification procedures and decomposition methods for nonconvex optimization problems , 1979 .

[36]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[37]  Lydia E. Kavraki,et al.  Sampling-Based Methods for Motion Planning with Constraints , 2018, Annu. Rev. Control. Robotics Auton. Syst..

[38]  Alessandro Saffiotti,et al.  Efficiently combining task and motion planning using geometric constraints , 2014, Int. J. Robotics Res..

[39]  Achim Menges,et al.  Robust Task and Motion Planning for Long-Horizon Architectural Construction Planning , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40]  Stuart J. Russell,et al.  Principles of Metareasoning , 1989, Artif. Intell..

[41]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[42]  Shlomo Zilberstein Metareasoning and Bounded Rationality , 2011, Metareasoning.