论文信息 - High-dimensional Motion Planning using Latent Variable Models via Approximate Inference

High-dimensional Motion Planning using Latent Variable Models via Approximate Inference

In this work, we present an efficient framework to generate a motion trajectory of a robot that has a high degree of freedom (e.g., a humanoid robot). High-dimensionality of the robot configuration space often leads to difficulties in utilizing the widely-used motion planning algorithms because the volume of the decision space increases exponentially with the number of dimensions. To handle complications arising from the large decision space, and to solve a corresponding motion planning problem efficiently, two key concepts were adopted. First, the Gaussian process latent variable model (GP-LVM) was utilized for low-dimensional representation of the original configuration space. Second, an approximate inference algorithm was used, exploiting through the duality between control and estimation, to explore the decision space and to compute a high-quality motion trajectory of the robot. Utilizing the GP-LVM and the duality between control and estimation, we construct a fully probabilistic generative model with which we transform a high-dimensional motion planning problem into a tractable inference problem. Finally, we compute the optimal motion trajectory via an approximate inference algorithm based on a variant of the particle filter. Numerical experiments are presented to demonstrate the applicability and validity of the proposed approach. The resulting motions can be viewed in the supplemental video. ( this https URL )

Han-Lim Choi | Jung-Su Ha | Hyeok-Joo Chae

[1] Han-Lim Choi,et al. Multiscale abstraction, planning and control using diffusion wavelets for stochastic optimal control problems , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[3] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[4] Mikhail Belkin,et al. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[5] C. Karen Liu,et al. Online control of simulated humanoids using particle belief propagation , 2015, ACM Trans. Graph..

[6] David J. Fleet,et al. Topologically-constrained latent variable models , 2008, ICML '08.

[7] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8] Emanuel Todorov,et al. Efficient computation of optimal actions , 2009, Proceedings of the National Academy of Sciences.

[9] Zoran Popovic,et al. Discovery of complex behaviors through contact-invariant optimization , 2012, ACM Trans. Graph..

[10] Marco Pavone,et al. Fast marching tree: A fast marching sampling-based method for optimal motion planning in many dimensions , 2013, ISRR.

[11] Joaquin Quiñonero Candela,et al. Local distance preservation in the GP-LVM through back constraints , 2006, ICML.

[12] Han-Lim Choi,et al. A topology-guided path integral approach for stochastic optimal control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13] David J. Fleet,et al. Erratum: "Gaussian process dynamical models for human motion" (IEEE Transactions on Pattern analysis and Machine Intelligenc (292)) , 2008 .

[14] Neil D. Lawrence,et al. Probabilistic Non-linear Principal Component Analysis with Gaussian Process Latent Variable Models , 2005, J. Mach. Learn. Res..

[15] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.

[16] Emilio Frazzoli,et al. Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[17] Stefan Schaal,et al. STOMP: Stochastic trajectory optimization for motion planning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[18] Anil V. Rao,et al. GPOPS-II , 2014, ACM Trans. Math. Softw..

[19] Sergey Levine,et al. Continuous character control with low-dimensional embeddings , 2012, ACM Trans. Graph..

[20] David J. Fleet,et al. 3D People Tracking with Gaussian Process Dynamical Models , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[22] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .

[23] Emanuel Todorov,et al. General duality between optimal control and estimation , 2008, 2008 47th IEEE Conference on Decision and Control.

[24] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.

[25] Emanuel Todorov,et al. Finding the Most Likely Trajectories of Optimally-Controlled Stochastic Systems , 2011 .

[26] Emanuel Todorov,et al. Convex and analytically-invertible dynamics with contacts and constraints: Theory and implementation in MuJoCo , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[27] A. Doucet,et al. Maximum a Posteriori Sequence Estimation Using Monte Carlo Particle Filters , 2001, Annals of the Institute of Statistical Mathematics.

[28] E. Todorov,et al. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[29] H. Kappen. Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[30] David J. Fleet,et al. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Gaussian Process Dynamical Model , 2007 .

[31] Neil D. Lawrence,et al. Deep Gaussian Processes , 2012, AISTATS.