Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains
暂无分享,去创建一个
[1] Robert Givan,et al. Model Minimization in Markov Decision Processes , 1997, AAAI/IAAI.
[2] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[3] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[4] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[5] Balaraman Ravindran,et al. Model Minimization in Hierarchical Reinforcement Learning , 2002, SARA.
[6] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..
[7] A. Barto,et al. An algebraic approach to abstraction in reinforcement learning , 2004 .
[8] Peter Stone,et al. Behavior transfer for value-function-based reinforcement learning , 2005, AAMAS '05.
[9] Peter Stone,et al. Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..
[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.