Abstracting Complex Domains Using Modular Object-Oriented Markov Decision Processes
暂无分享,去创建一个
[1] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[2] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[3] Marie desJardins,et al. Portable Option Discovery for Automated Learning Transfer in Object-Oriented Markov Decision Processes , 2015, IJCAI.
[4] Sriraam Natarajan,et al. Dynamic preferences in multi-criteria reinforcement learning , 2005, ICML.
[5] Craig Boutilier,et al. Exploiting Structure in Policy Construction , 1995, IJCAI.
[6] Andre Cohen,et al. An object-oriented representation for efficient reinforcement learning , 2008, ICML '08.