论文信息 - Probabilistic Policy Reuse for inter-task transfer learning - 字舞流文

Probabilistic Policy Reuse for inter-task transfer learning

Javier García | Manuela M. Veloso | Fernando Fernández | M. Veloso | F. Fernández | Javier García

[1] Fernando Fernández,et al. Two steps reinforcement learning , 2008, Int. J. Intell. Syst..

[2] Peter Stone,et al. Transfer Learning via Inter-Task Mappings for Temporal Difference Learning , 2007, J. Mach. Learn. Res..

[3] Vishal Soni,et al. Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains , 2006, AAAI.

[4] Peter Stone,et al. Inter-Task Action Correlation for Reinforcement Learning Tasks , 2006, AAAI.

[5] Manuela M. Veloso,et al. Probabilistic policy reuse in a reinforcement learning agent , 2006, AAMAS '06.

[6] Jude W. Shavlik,et al. Using Advice to Transfer Knowledge Acquired in One Reinforcement Learning Task to Another , 2005, ECML.

[7] Peter Stone,et al. Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..

[8] Peter Stone,et al. Behavior transfer for value-function-based reinforcement learning , 2005, AAMAS '05.

[9] Peter Stone,et al. Value Functions for RL-Based Behavior Transfer: A Comparative Study , 2005, AAAI.

[10] Craig Boutilier,et al. Imitation and Reinforcement Learning in Agents with Heterogeneous Actions , 2001, Canadian Conference on AI.

[11] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[12] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[13] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[14] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[15] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.

[16] Manuela Veloso,et al. Reinforcement learning in the robocup-soccer keepaway , 2007 .

[17] Thomas J. Walsh. Transferring State Abstractions Between MDPs , 2006 .

[18] Fernando Fernández,et al. Policy Reuse for Transfer Learning Across Tasks with Different State and Action Spaces , 2006 .

[19] Scott J. Harmon,et al. Empirical Comparison of Incremental Reuse Strategies in Genetic Programming for Keep-Away Soccer , 2004 .