Probabilistic inference for structured planning in robotics

Real-world robotic environments are highly structured. The scalability of planning and reasoning methods to cope with complex problems in such environments crucially depends on exploiting this structure. We propose a new approach to planning in robotics based on probabilistic inference. The method uses structured Dynamic Bayesian Networks to represent the scenario and efficient inference techniques (loopy belief propagation) to solve planning problems. In principle, any kind of factored or hierarchical state representations can be accounted for. We demonstrate the approach on reaching tasks under collision avoidance constraints with a humanoid upper body.

[1]  Craig Boutilier,et al.  Exploiting Structure in Policy Construction , 1995, IJCAI.

[2]  B. Faverjon,et al.  Probabilistic Roadmaps for Path Planning in High-Dimensional Con(cid:12)guration Spaces , 1996 .

[3]  Daphne Koller,et al.  Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.

[4]  Steven M. LaValle,et al.  RRT-connect: An efficient approach to single-query path planning , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[5]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[6]  Dale Schuurmans,et al.  Direct value-approximation for factored MDPs , 2001, NIPS.

[7]  Tom Minka,et al.  A family of algorithms for approximate Bayesian inference , 2001 .

[8]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[9]  Shobha Venkataraman,et al.  Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..

[10]  Leslie Pack Kaelbling,et al.  Representing hierarchical POMDPs as DBNs for multi-scale robot localization , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[11]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[12]  Yoram Singer,et al.  The Hierarchical Hidden Markov Model: Analysis and Applications , 1998, Machine Learning.

[13]  Weiwei Li,et al.  Hierarchical optimal control of redundant biomechanical systems , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  Jun Nakanishi,et al.  A unifying methodology for the control of robotic systems , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Jun Nakanishi,et al.  A Unifying Framework for the Control of Robotic Systems , 2005 .

[16]  Milos Hauskrecht,et al.  Solving Factored MDPs with Exponential-Family Transition Models , 2006, ICAPS.

[17]  Marc Toussaint,et al.  Probabilistic inference for solving discrete and continuous state Markov Decision Processes , 2006, ICML.

[18]  Marc Toussaint,et al.  Probabilistic inference for solving (PO) MDPs , 2006 .