论文信息 - Topology-guided path integral approach for stochastic optimal control in cluttered environment

Topology-guided path integral approach for stochastic optimal control in cluttered environment

This paper addresses planning and control of robot motion under uncertainty that is formulated as a continuous-time, continuous-space stochastic optimal control problem, by developing a topology-guided path integral control method. The path integral control framework, which forms the backbone of the proposed method, re-writes the Hamilton-Jacobi-Bellman equation as a statistical inference problem; the resulting inference problem is solved by a sampling procedure that computes the distribution of controlled trajectories around the trajectory by the passive dynamics. For motion control of robots in a highly cluttered environment, however, this sampling can easily be trapped in a local minimum unless the sample size is very large, since the global optimality of local minima depends on the degree of uncertainty. Thus, a homology-embedded sampling-based planner that identifies many (potentially) local-minimum trajectories in different homology classes is developed to aid the sampling process. In combination with a receding-horizon fashion of the optimal control the proposed method produces a dynamically feasible and collision-free motion plans without being trapped in a local minimum. Numerical examples on a synthetic toy problem and on quadrotor control in a complex obstacle field demonstrate the validity of the proposed method.

Han-Lim Choi | Jung-Su Ha | Soon-Seo Park

[1] Han-Lim Choi,et al. A successive approximation-based approach for optimal kinodynamic motion planning with nonlinear differential constraints , 2013, 52nd IEEE Conference on Decision and Control.

[2] Rahul Kala,et al. Homotopic Roadmap Generation for Robot Motion Planning , 2016, J. Intell. Robotic Syst..

[3] Emilio Frazzoli,et al. Optimal kinodynamic motion planning using incremental sampling-based methods , 2010, 49th IEEE Conference on Decision and Control (CDC).

[4] Nicholas Roy,et al. Rapidly-exploring Random Belief Trees for motion planning under uncertainty , 2011, 2011 IEEE International Conference on Robotics and Automation.

[5] Vijay Kumar,et al. The GRASP Multiple Micro-UAV Testbed , 2010, IEEE Robotics & Automation Magazine.

[6] Nolan Wagener,et al. Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7] Marco Pavone,et al. Fast marching tree: A fast marching sampling-based method for optimal motion planning in many dimensions , 2013, ISRR.

[8] H. Kappen,et al. Path integral control and state-dependent feedback. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9] Pere Ridao,et al. A comparison of homotopic path planning algorithms for robotic applications , 2015, Robotics Auton. Syst..

[10] Emilio Frazzoli,et al. Optimal sampling-based Feedback Motion Trees among obstacles for controllable linear systems with linear constraints , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[11] Han-Lim Choi,et al. Adaptive path-integral autoencoder: representation learning and planning for dynamical systems , 2018, NeurIPS.

[12] Steven M. LaValle,et al. Planning algorithms , 2006 .

[13] C. Gardiner. Handbook of Stochastic Methods , 1983 .

[14] Han-Lim Choi,et al. A topology-guided path integral approach for stochastic optimal control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[15] Marco Pavone,et al. Toward a real-time framework for solving the kinodynamic motion planning problem , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[16] Evangelos Theodorou,et al. Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations , 2015, Entropy.

[17] Jur P. van den Berg,et al. Kinodynamic RRT*: Asymptotically optimal motion planning for robots with linear dynamics , 2013, 2013 IEEE International Conference on Robotics and Automation.

[18] Evangelos Theodorou,et al. Model Predictive Path Integral Control using Covariance Variable Importance Sampling , 2015, ArXiv.

[19] Torsten Bertram,et al. Integrated online trajectory planning and optimization in distinctive topologies , 2017, Robotics Auton. Syst..

[20] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[21] Alexei Makarenko,et al. Information based adaptive robotic exploration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22] Vijay Kumar,et al. Topological exploration of unknown and partially known environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23] Hilbert J. Kappen,et al. Adaptive Importance Sampling for Control and Inference , 2015, ArXiv.

[24] Hilbert J. Kappen,et al. Particle Smoothing for Hidden Diffusion Processes: Adaptive Path Integral Smoother , 2016, IEEE Transactions on Signal Processing.

[25] Stefan Schaal,et al. Reinforcement learning of motor skills in high dimensions: A path integral approach , 2010, 2010 IEEE International Conference on Robotics and Automation.

[26] Gaurav S. Sukhatme,et al. A topological approach to using cables to separate and manipulate sets of objects , 2013, Int. J. Robotics Res..

[27] Danica Kragic,et al. Topological trajectory clustering with relative persistent homology , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[28] Vijay Kumar,et al. Invariants for homology classes with application to optimal search and planning problem in robotics , 2012, Annals of Mathematics and Artificial Intelligence.

[29] Emanuel Todorov,et al. Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.

[30] Evangelos Theodorou,et al. Information-theoretic stochastic optimal control via incremental sampling-based algorithms , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[31] Ioannis Karatzas,et al. Brownian Motion and Stochastic Calculus , 1987 .

[32] Florian T. Pokorny,et al. Topological trajectory classification with filtrations of simplicial complexes and persistent homology , 2016, Int. J. Robotics Res..

[33] H. Kappen. Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[34] Vijay Kumar,et al. Topological constraints in search-based robot path planning , 2012, Auton. Robots.

[35] Vijay Kumar,et al. Persistent Homology for Path Planning in Uncertain Environments , 2015, IEEE Transactions on Robotics.

[36] Emilio Frazzoli,et al. Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[37] Danica Kragic,et al. High-dimensional Winding-Augmented Motion Planning with 2D topological task projections and persistent homology , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).