Policy Search for Path Integral Control
暂无分享,去创建一个
Vicenç Gómez | Hilbert J. Kappen | Jan Peters | Gerhard Neumann | Jan Peters | H. Kappen | G. Neumann | V. Gómez
[1] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.
[2] H. Kappen. Linear theory for control of nonlinear stochastic systems. , 2004, Physical review letters.
[3] Emanuel Todorov,et al. Linearly-solvable Markov decision problems , 2006, NIPS.
[4] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.
[5] Jan Peters,et al. Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .
[6] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[7] Emanuel Todorov,et al. Policy gradients in linearly-solvable MDPs , 2010, NIPS.
[8] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..
[9] Stefan Schaal,et al. Learning variable impedance control , 2011, Int. J. Robotics Res..
[10] Stefan Schaal,et al. Hierarchical reinforcement learning with movement primitives , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.
[11] Stefan Schaal,et al. Learning to grasp under uncertainty , 2011, 2011 IEEE International Conference on Robotics and Automation.
[12] Marc Toussaint,et al. On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference , 2012, Robotics: Science and Systems.
[13] Hilbert J. Kappen,et al. Dynamic policy programming , 2010, J. Mach. Learn. Res..
[14] Evangelos Theodorou,et al. Relative entropy and free energy dualities: Connections to Path Integral and KL control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).
[15] Jan Peters,et al. Hierarchical Relative Entropy Policy Search , 2014, AISTATS.
[16] Olivier Sigaud,et al. Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.
[17] Vicenç Gómez,et al. Optimal control as a graphical model inference problem , 2009, Machine Learning.
[18] Evangelos Theodorou,et al. Tendon-driven control of biomechanical and robotic systems: A path integral reinforcement learning approach , 2012, 2012 IEEE International Conference on Robotics and Automation.
[19] Francesco Nori,et al. Open-loop stochastic optimal control of a passive noise-rejection variable stiffness actuator: Application to unstable tasks , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[20] Marc Toussaint,et al. Path Integral Control by Reproducing Kernel Hilbert Space Embedding , 2013, IJCAI.
[21] Jan Peters,et al. Data-Efficient Generalization of Robot Skills with Contextual Policy Search , 2013, AAAI.