Apprenticeship learning via inverse reinforcement learning
暂无分享,去创建一个
[1] A. S. Manne. Linear Programming and Sequential Decisions , 1960 .
[2] 丸山 徹. Convex Analysisの二,三の進展について , 1977 .
[3] R. Varga,et al. Proof of Theorem 2 , 1983 .
[4] N. Hogan. An organizing principle for a class of voluntary movements , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[5] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[6] 宇野 洋二,et al. Formation and control of optimal trajectory in human multijoint arm movement : minimum torque-change model , 1988 .
[7] Masayuki Inaba,et al. Learning by watching: extracting reusable task knowledge from visual observation of human performance , 1994, IEEE Trans. Robotics Autom..
[8] Gillian M. Hayes,et al. A Robot Controller Using Learning by Imitation , 1994 .
[9] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.
[10] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[11] S. Pattinson,et al. Learning to fly. , 1998 .
[12] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[13] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[14] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[15] R. Amit,et al. Learning movement sequences from demonstration , 2002, Proceedings 2nd International Conference on Development and Learning. ICDL 2002.
[16] M. Kawato,et al. Formation and control of optimal trajectory in human multijoint arm movement , 1989, Biological Cybernetics.
[17] K. Taira. Proof of Theorem 1.3 , 2004 .