Defining Object Types and Options Using MDP Homomorphisms
暂无分享,去创建一个
[1] David Chapman,et al. Pengi: An Implementation of a Theory of Activity , 1987, AAAI.
[2] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[3] Craig Boutilier,et al. Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.
[4] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.
[5] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..
[6] Luc De Raedt,et al. Logical Markov Decision Programs , 2003 .
[7] A. Barto,et al. An algebraic approach to abstraction in reinforcement learning , 2004 .
[8] Alicia P. Wolfe,et al. Decision Tree Methods for Finding Reusable MDP Homomorphisms , 2006, AAAI.