Optimal and Efficient Stochastic Motion Planning in Partially-Known Environments

A framework capable of computing optimal control policies for a continuous system in the presence of both action and environment uncertainty is presented in this work. The framework decomposes the planning problem into two stages: an offline phase that reasons only over action uncertainty and an online phase that quickly reacts to the uncertain environment. Offline, a bounded-parameter Markov decision process (BMDP) is employed to model the evolution of the stochastic system over a discretization of the environment. Online, an optimal control policy over the BMDP is computed. Upon the discovery of an unknown environment feature during policy execution, the BMDP is updated and the optimal control policy is efficiently recomputed. Depending on the desired quality of the control policy, a suite of methods is presented to incorporate new information into the BMDP with varying degrees of detail online. Experiments confirm that the framework recomputes high-quality policies in seconds and is orders of magnitude faster than existing methods.

[1]  Sven Koenig,et al.  Fast replanning for navigation in unknown terrain , 2005, IEEE Transactions on Robotics.

[2]  Robert Givan,et al.  Bounded-parameter Markov decision processes , 2000, Artif. Intell..

[3]  Nancy M. Amato,et al.  FIRM: Sampling-based feedback motion-planning under motion uncertainty and imperfect measurements , 2014, Int. J. Robotics Res..

[4]  H. Kushner Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .

[5]  Reid G. Simmons,et al.  Probabilistic Robot Navigation in Partially Observable Environments , 1995, IJCAI.

[6]  Emilio Frazzoli,et al.  An incremental sampling-based algorithm for stochastic optimal control , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  Leslie Pack Kaelbling,et al.  Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..

[8]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[9]  Jonathan Richard Shewchuk,et al.  Delaunay refinement algorithms for triangular mesh generation , 2002, Comput. Geom..

[10]  Nicholas Roy,et al.  Efficient Planning under Uncertainty with Macro-actions , 2014, J. Artif. Intell. Res..

[11]  Han-Pang Huang,et al.  Robot Motion Planning in Dynamic Uncertain Environments , 2011, Adv. Robotics.

[12]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[13]  Nicholas Roy,et al.  Robust Navigation Execution by Planning in Belief Space , 2013 .

[14]  Anthony Stentz,et al.  The Focussed D* Algorithm for Real-Time Replanning , 1995, IJCAI.

[15]  Thierry Fraichard,et al.  Robust motion planning using Markov decision processes and quadtree decomposition , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[16]  Sebastian Thrun,et al.  Anytime search in dynamic graphs , 2008, Artif. Intell..

[17]  Thierry Siméon,et al.  The Stochastic Motion Roadmap: A Sampling Framework for Planning with Markov Motion Uncertainty , 2007, Robotics: Science and Systems.

[18]  Lydia E. Kavraki,et al.  Fast stochastic motion planning with optimality guarantees using local policy reconfiguration , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Anthony Stentz,et al.  Optimal and efficient path planning for partially-known environments , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[20]  Wolfram Burgard,et al.  Robotics: Science and Systems XV , 2010 .

[21]  N. Roy,et al.  The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance , 2009, Int. J. Robotics Res..

[22]  Karl Iagnemma,et al.  Stochastic mobility-based path planning in uncertain environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Wolfram Burgard,et al.  The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[24]  Oliver Brock,et al.  Planning Long Dynamically-Feasible Maneuvers for Autonomous Vehicles , 2009 .

[25]  Rajeev Sharma,et al.  On Motion Planning in Changing, Partially Predictable Environments , 1997, Int. J. Robotics Res..

[26]  David Hsu,et al.  Planning under Uncertainty for Robotic Tasks with Mixed Observability , 2010, Int. J. Robotics Res..