Integrated motor control, planning, grasping and high-level reasoning in a blocks world using probabilistic inference

A new approach to planning and goal-directed behavior has recently been proposed using probabilistic inference in a graphical model that represents states, actions, constraints and goals of the future to infer appropriate actions and controls. The approach has led to new algorithms on the control and trajectory optimization level as well as for high-level rule-based planning in relational domains. In this paper we integrate these methods to a coherent control, trajectory optimization, and action planning architecture, using the principle of planning by inference across all levels of abstractions. Our scenario is a real blocks world: using a 14DoF Schunk arm and hand with tactile sensors and a stereo camera, the goal is to manipulate a set of objects on the table in a goal-oriented way. For high-level reasoning, we learn relational rule-based models from experience in simulation.

[1]  M. Ciletti,et al.  The computation and theory of optimal control , 1972 .

[2]  S. Yakowitz,et al.  Differential dynamic programming and Newton's method for discrete optimal control problems , 1984 .

[3]  Yoshihiko Nakamura,et al.  Inverse kinematic solutions with singularity robustness for robot manipulator control , 1986 .

[4]  T. Yoshikawa,et al.  Task-Priority Based Redundancy Control of Robot Manipulators , 1987 .

[5]  Christopher G. Atkeson,et al.  Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming , 1993, NIPS.

[6]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[7]  John K. Slaney,et al.  Blocks World revisited , 2001, Artif. Intell..

[8]  Henrik I. Christensen,et al.  Automatic grasp planning using shape primitives , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[9]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[10]  Jun Nakanishi,et al.  A unifying methodology for the control of robotic systems , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Jun Nakanishi,et al.  A Unifying Framework for the Control of Robotic Systems , 2005 .

[12]  Helge J. Ritter,et al.  Task-oriented quality measures for dextrous grasping , 2005, 2005 International Symposium on Computational Intelligence in Robotics and Automation.

[13]  Michael Gienger,et al.  Task-oriented whole body motion for humanoid robots , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[14]  Raúl Suárez Feijóo,et al.  Grasp quality measures , 2006 .

[15]  Dmitry Berenson,et al.  Grasp planning in complex scenes , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[16]  William D. Smart,et al.  Receding Horizon Differential Dynamic Programming , 2007, NIPS.

[17]  Marc Toussaint,et al.  Probabilistic inference for structured planning in robotics , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  L. P. Kaelbling,et al.  Learning Symbolic Models of Stochastic Domains , 2007, J. Artif. Intell. Res..

[19]  Marc Toussaint,et al.  Optimization of fluent approach and grasp motions , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[20]  Emanuel Todorov,et al.  General duality between optimal control and estimation , 2008, 2008 47th IEEE Conference on Decision and Control.

[21]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[22]  Marc Toussaint,et al.  Approximate inference for planning in stochastic relational worlds , 2009, ICML '09.

[23]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[24]  Marc Toussaint,et al.  Robot trajectory optimization using approximate inference , 2009, ICML '09.

[25]  Marc Toussaint,et al.  Learning model-free robot control by a Monte Carlo EM algorithm , 2009, Auton. Robots.

[26]  Marc Toussaint,et al.  Relevance Grounding for Planning in Relational Domains , 2009, ECML/PKDD.

[27]  Jan Peters,et al.  Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .

[28]  Marc Toussaint,et al.  A Bayesian View on Motor Control and Planning , 2010, From Motor Learning to Interaction Learning in Robots.

[29]  Vicenç Gómez,et al.  Optimal control as a graphical model inference problem , 2009, Machine Learning.

[30]  De,et al.  Relational Reinforcement Learning , 2022 .