论文信息 - Defining Object Types and Options Using MDP Homomorphisms

Defining Object Types and Options Using MDP Homomorphisms

Agents in complex environments can have a wide range of tasks to perform over time. However, often there are sets of tasks that involve similar goals on similar objects, e.g., the skill of making a car move to a destination is similar for all cars. This paper lays out a framework for specifying goals that are parameterized with focus objects, as well as defining object type in such a way that objects of the same type share policies. The method is agnostic as to the underlying state representation, as long as simple functions of the state of the object can be calculated.

Alicia P. Wolfe

[1] David Chapman,et al. Pengi: An Implementation of a Theory of Activity , 1987, AAAI.

[2] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[3] Craig Boutilier,et al. Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.

[4] Balaraman Ravindran,et al. SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes , 2003, IJCAI.

[5] Robert Givan,et al. Equivalence notions and model minimization in Markov decision processes , 2003, Artif. Intell..

[6] Luc De Raedt,et al. Logical Markov Decision Programs , 2003 .

[7] A. Barto,et al. An algebraic approach to abstraction in reinforcement learning , 2004 .

[8] Alicia P. Wolfe,et al. Decision Tree Methods for Finding Reusable MDP Homomorphisms , 2006, AAAI.