PAC-MDP Reinforcement Learning with Bayesian Priors
暂无分享,去创建一个
Michael L. Littman | David Wingate | Ali Nouri | Lihong Li | M. Littman | J. Asmuth | Lihong Li | A. Nouri | D. Wingate | John Asmuth
[1] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..
[2] Michael L. Littman,et al. Efficient Structure Learning in Factored-State MDPs , 2007, AAAI.
[3] Sham M. Kakade,et al. On the sample complexity of reinforcement learning. , 2003 .
[4] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[5] Michael L. Littman,et al. Efficient Reinforcement Learning with Relocatable Action Models , 2007, AAAI.
[6] Lihong Li,et al. The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning , 2009, ICML '09.
[7] Michael Kearns,et al. Efficient Reinforcement Learning in Factored MDPs , 1999, IJCAI.
[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[9] Michael L. Littman,et al. A unifying framework for computational reinforcement learning theory , 2009 .
[10] Thomas J. Walsh,et al. Knows what it knows: a framework for self-aware learning , 2008, ICML.
[11] Jesse Hoey,et al. An analytic solution to discrete Bayesian reinforcement learning , 2006, ICML.
[12] David Andre,et al. Model based Bayesian Exploration , 1999, UAI.
[13] Lihong Li,et al. A Bayesian Sampling Approach to Exploration in Reinforcement Learning , 2009, UAI.
[14] Claude-Nicolas Fiechter,et al. Efficient reinforcement learning , 1994, COLT '94.