The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping
暂无分享,去创建一个
[1] M. Kendall. Probability and Statistical Inference , 1956, Nature.
[2] P. J. Green,et al. Probability and Statistical Inference , 1978 .
[3] Marco Colombetti,et al. Robot shaping: developing situated agents through learning , 1992 .
[4] Maja J. Mataric,et al. Reward Functions for Accelerated Learning , 1994, ICML.
[5] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[6] Preben Alstrøm,et al. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping , 1998, ICML.
[7] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[8] Gerald DeJong,et al. Reinforcement Learning and Shaping: Encouraging Intended Behaviors , 2002, ICML.
[9] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.
[10] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.