Probabilistic Policy Reuse for inter-task transfer learning
暂无分享,去创建一个
[1] Fernando Fernández,et al. Two steps reinforcement learning , 2008, Int. J. Intell. Syst..
[2] Peter Stone,et al. Transfer Learning via Inter-Task Mappings for Temporal Difference Learning , 2007, J. Mach. Learn. Res..
[3] Vishal Soni,et al. Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains , 2006, AAAI.
[4] Peter Stone,et al. Inter-Task Action Correlation for Reinforcement Learning Tasks , 2006, AAAI.
[5] Manuela M. Veloso,et al. Probabilistic policy reuse in a reinforcement learning agent , 2006, AAMAS '06.
[6] Jude W. Shavlik,et al. Using Advice to Transfer Knowledge Acquired in One Reinforcement Learning Task to Another , 2005, ECML.
[7] Peter Stone,et al. Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..
[8] Peter Stone,et al. Behavior transfer for value-function-based reinforcement learning , 2005, AAMAS '05.
[9] Peter Stone,et al. Value Functions for RL-Based Behavior Transfer: A Comparative Study , 2005, AAAI.
[10] Craig Boutilier,et al. Imitation and Reinforcement Learning in Agents with Heterogeneous Actions , 2001, Canadian Conference on AI.
[11] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[12] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[13] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[14] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[15] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.
[16] Manuela Veloso,et al. Reinforcement learning in the robocup-soccer keepaway , 2007 .
[17] Thomas J. Walsh. Transferring State Abstractions Between MDPs , 2006 .
[18] Fernando Fernández,et al. Policy Reuse for Transfer Learning Across Tasks with Different State and Action Spaces , 2006 .
[19] Scott J. Harmon,et al. Empirical Comparison of Incremental Reuse Strategies in Genetic Programming for Keep-Away Soccer , 2004 .