Topology-guided path integral approach for stochastic optimal control in cluttered environment

This paper addresses planning and control of robot motion under uncertainty that is formulated as a continuous-time, continuous-space stochastic optimal control problem, by developing a topology-guided path integral control method. The path integral control framework, which forms the backbone of the proposed method, re-writes the Hamilton-Jacobi-Bellman equation as a statistical inference problem; the resulting inference problem is solved by a sampling procedure that computes the distribution of controlled trajectories around the trajectory by the passive dynamics. For motion control of robots in a highly cluttered environment, however, this sampling can easily be trapped in a local minimum unless the sample size is very large, since the global optimality of local minima depends on the degree of uncertainty. Thus, a homology-embedded sampling-based planner that identifies many (potentially) local-minimum trajectories in different homology classes is developed to aid the sampling process. In combination with a receding-horizon fashion of the optimal control the proposed method produces a dynamically feasible and collision-free motion plans without being trapped in a local minimum. Numerical examples on a synthetic toy problem and on quadrotor control in a complex obstacle field demonstrate the validity of the proposed method.

[1]  Han-Lim Choi,et al.  A successive approximation-based approach for optimal kinodynamic motion planning with nonlinear differential constraints , 2013, 52nd IEEE Conference on Decision and Control.

[2]  Rahul Kala,et al.  Homotopic Roadmap Generation for Robot Motion Planning , 2016, J. Intell. Robotic Syst..

[3]  Emilio Frazzoli,et al.  Optimal kinodynamic motion planning using incremental sampling-based methods , 2010, 49th IEEE Conference on Decision and Control (CDC).

[4]  Nicholas Roy,et al.  Rapidly-exploring Random Belief Trees for motion planning under uncertainty , 2011, 2011 IEEE International Conference on Robotics and Automation.

[5]  Vijay Kumar,et al.  The GRASP Multiple Micro-UAV Testbed , 2010, IEEE Robotics & Automation Magazine.

[6]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Marco Pavone,et al.  Fast marching tree: A fast marching sampling-based method for optimal motion planning in many dimensions , 2013, ISRR.

[8]  H. Kappen,et al.  Path integral control and state-dependent feedback. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Pere Ridao,et al.  A comparison of homotopic path planning algorithms for robotic applications , 2015, Robotics Auton. Syst..

[10]  Emilio Frazzoli,et al.  Optimal sampling-based Feedback Motion Trees among obstacles for controllable linear systems with linear constraints , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[11]  Han-Lim Choi,et al.  Adaptive path-integral autoencoder: representation learning and planning for dynamical systems , 2018, NeurIPS.

[12]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[13]  C. Gardiner Handbook of Stochastic Methods , 1983 .

[14]  Han-Lim Choi,et al.  A topology-guided path integral approach for stochastic optimal control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Marco Pavone,et al.  Toward a real-time framework for solving the kinodynamic motion planning problem , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Evangelos Theodorou,et al.  Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations , 2015, Entropy.

[17]  Jur P. van den Berg,et al.  Kinodynamic RRT*: Asymptotically optimal motion planning for robots with linear dynamics , 2013, 2013 IEEE International Conference on Robotics and Automation.

[18]  Evangelos Theodorou,et al.  Model Predictive Path Integral Control using Covariance Variable Importance Sampling , 2015, ArXiv.

[19]  Torsten Bertram,et al.  Integrated online trajectory planning and optimization in distinctive topologies , 2017, Robotics Auton. Syst..

[20]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[21]  Alexei Makarenko,et al.  Information based adaptive robotic exploration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Vijay Kumar,et al.  Topological exploration of unknown and partially known environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Hilbert J. Kappen,et al.  Adaptive Importance Sampling for Control and Inference , 2015, ArXiv.

[24]  Hilbert J. Kappen,et al.  Particle Smoothing for Hidden Diffusion Processes: Adaptive Path Integral Smoother , 2016, IEEE Transactions on Signal Processing.

[25]  Stefan Schaal,et al.  Reinforcement learning of motor skills in high dimensions: A path integral approach , 2010, 2010 IEEE International Conference on Robotics and Automation.

[26]  Gaurav S. Sukhatme,et al.  A topological approach to using cables to separate and manipulate sets of objects , 2013, Int. J. Robotics Res..

[27]  Danica Kragic,et al.  Topological trajectory clustering with relative persistent homology , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[28]  Vijay Kumar,et al.  Invariants for homology classes with application to optimal search and planning problem in robotics , 2012, Annals of Mathematics and Artificial Intelligence.

[29]  Emanuel Todorov,et al.  Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.

[30]  Evangelos Theodorou,et al.  Information-theoretic stochastic optimal control via incremental sampling-based algorithms , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[31]  Ioannis Karatzas,et al.  Brownian Motion and Stochastic Calculus , 1987 .

[32]  Florian T. Pokorny,et al.  Topological trajectory classification with filtrations of simplicial complexes and persistent homology , 2016, Int. J. Robotics Res..

[33]  H. Kappen Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[34]  Vijay Kumar,et al.  Topological constraints in search-based robot path planning , 2012, Auton. Robots.

[35]  Vijay Kumar,et al.  Persistent Homology for Path Planning in Uncertain Environments , 2015, IEEE Transactions on Robotics.

[36]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[37]  Danica Kragic,et al.  High-dimensional Winding-Augmented Motion Planning with 2D topological task projections and persistent homology , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).