Update Method of Cost Function to Learn Robust Policy Parameters
暂无分享,去创建一个
Daigo Fujiwara | Fumitoshi Matsuno | Ryo Ariizumi | Kosuke Yamamoto | Tomohiro Hayakawa | F. Matsuno | Ryo Ariizumi | Tomohiro Hayakawa | D. Fujiwara | Kosuke Yamamoto
[1] Fumitoshi Matsuno,et al. Learning and Chaining of Motor Primitives for Goal-Directed Locomotion of a Snake-Like Robot with Screw-Drive Units , 2015 .
[2] Cristina P. Santos,et al. Adapting Biped Locomotion to Sloped Environments , 2015, J. Intell. Robotic Syst..
[3] Stéphane Doncieux,et al. The Transferability Approach: Crossing the Reality Gap in Evolutionary Robotics , 2013, IEEE Transactions on Evolutionary Computation.
[4] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[5] Olivier Sigaud,et al. Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.
[6] Ales Ude,et al. Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives , 2011, Robotics Auton. Syst..
[7] Jan Peters,et al. Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .
[8] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..
[9] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.
[10] Jun Morimoto,et al. Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning , 2000, Robotics Auton. Syst..
[11] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..