Apprenticeship learning using linear programming
暂无分享,去创建一个
[1] R. Varga,et al. Proof of Theorem 2 , 1983 .
[2] Charles R. Johnson,et al. Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.
[3] Shu-Cherng Fang,et al. Linear Optimization and Extensions: Theory and Algorithms , 1993 .
[4] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[5] A. Shwartz,et al. Handbook of Markov decision processes : methods and applications , 2002 .
[6] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[7] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[8] Michael H. Bowling,et al. Computing Robust Counter-Strategies , 2007, NIPS.
[9] Tao Wang,et al. Stable Dual Dynamic Programming , 2007, NIPS.
[10] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.