Switching between Representations in Reinforcement Learning
暂无分享,去创建一个
Shimon Whiteson | Harm van Seijen | Leon J. H. M. Kester | H. V. Seijen | L. Kester | Shimon Whiteson
[1] R. Bellman. A Markovian Decision Process , 1957 .
[2] Michael Kearns,et al. Efficient Reinforcement Learning in Factored MDPs , 1999, IJCAI.
[3] Jesse Hoey,et al. APRICODD: Approximate Policy Construction Using Decision Diagrams , 2000, NIPS.
[4] Michael L. Littman,et al. Efficient Structure Learning in Factored-State MDPs , 2007, AAAI.
[5] D. Siegmund. Importance Sampling in the Monte Carlo Study of Sequential Tests , 1976 .
[6] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[7] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[8] Pieter Abbeel,et al. Learning Factor Graphs in Polynomial Time and Sample Complexity , 2006, J. Mach. Learn. Res..
[9] Thomas J. Walsh,et al. Knows what it knows: a framework for self-aware learning , 2008, ICML.
[10] Craig Boutilier,et al. Exploiting Structure in Policy Construction , 1995, IJCAI.
[11] Jesse Hoey,et al. SPUDD: Stochastic Planning using Decision Diagrams , 1999, UAI.
[12] Lihong Li,et al. The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning , 2009, ICML '09.
[13] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..