Building a Library of Policies through Policy Reuse
暂无分享,去创建一个
[1] Manuela Veloso,et al. Exploration and Policy Reuse , 2005 .
[2] Sebastian Thrun,et al. Finding Structure in Reinforcement Learning , 1994, NIPS.
[3] Manuela Veloso,et al. Tree based hierarchical reinforcement learning , 2002 .
[4] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[5] Manuela Veloso,et al. Probabilistic Reuse of Past Policies , 2005 .
[6] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[7] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[8] Manuela M. Veloso,et al. Real-Time Randomized Path Planning for Robot Navigation , 2002, RoboCup.
[9] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[10] Manuela M. Veloso,et al. Planning and Learning by Analogical Reasoning , 1994, Lecture Notes in Computer Science.
[11] Sebastian Thrun,et al. Lifelong robot learning , 1993, Robotics Auton. Syst..