Probabilistic Policy Reuse for inter-task transfer learning

[1]  Fernando Fernández,et al.  Two steps reinforcement learning , 2008, Int. J. Intell. Syst..

[2]  Peter Stone,et al.  Transfer Learning via Inter-Task Mappings for Temporal Difference Learning , 2007, J. Mach. Learn. Res..

[3]  Vishal Soni,et al.  Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains , 2006, AAAI.

[4]  Peter Stone,et al.  Inter-Task Action Correlation for Reinforcement Learning Tasks , 2006, AAAI.

[5]  Manuela M. Veloso,et al.  Probabilistic policy reuse in a reinforcement learning agent , 2006, AAMAS '06.

[6]  Jude W. Shavlik,et al.  Using Advice to Transfer Knowledge Acquired in One Reinforcement Learning Task to Another , 2005, ECML.

[7]  Peter Stone,et al.  Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..

[8]  Peter Stone,et al.  Behavior transfer for value-function-based reinforcement learning , 2005, AAMAS '05.

[9]  Peter Stone,et al.  Value Functions for RL-Based Behavior Transfer: A Comparative Study , 2005, AAAI.

[10]  Craig Boutilier,et al.  Imitation and Reinforcement Learning in Agents with Heterogeneous Actions , 2001, Canadian Conference on AI.

[11]  Doina Precup,et al.  Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[12]  Thomas G. Dietterich Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[13]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[14]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[15]  Gerald Tesauro,et al.  Practical issues in temporal difference learning , 1992, Machine Learning.

[16]  Manuela Veloso,et al.  Reinforcement learning in the robocup-soccer keepaway , 2007 .

[17]  Thomas J. Walsh Transferring State Abstractions Between MDPs , 2006 .

[18]  Fernando Fernández,et al.  Policy Reuse for Transfer Learning Across Tasks with Different State and Action Spaces , 2006 .

[19]  Scott J. Harmon,et al.  Empirical Comparison of Incremental Reuse Strategies in Genetic Programming for Keep-Away Soccer , 2004 .