PAC Optimal Exploration in Continuous Space Markov Decision Processes
暂无分享,去创建一个
[1] Pieter Abbeel,et al. Safe Exploration in Markov Decision Processes , 2012, ICML.
[2] B. Adams,et al. Dynamic multidrug therapies for hiv: optimal and sti control approaches. , 2004, Mathematical biosciences and engineering : MBE.
[3] Louis Wehenkel,et al. Clinical data based optimal STI strategies for HIV: a reinforcement learning approach , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.
[4] Ronald J. Williams,et al. Tight Performance Bounds on Greedy Policies Based on Imperfect Value Functions , 1993 .
[5] Michael L. Littman,et al. A theoretical analysis of Model-Based Interval Estimation , 2005, ICML.
[6] Michael L. Littman,et al. Multi-resolution Exploration in Continuous Spaces , 2008, NIPS.
[7] Bart De Schutter,et al. Optimistic planning for sparsely stochastic systems , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[8] Peter Stone,et al. Model-Based Exploration in Continuous State Spaces , 2007, SARA.
[9] Sham M. Kakade,et al. On the sample complexity of reinforcement learning. , 2003 .
[10] Michael L. Littman,et al. A unifying framework for computational reinforcement learning theory , 2009 .
[11] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[12] Michael L. Littman,et al. Online Linear Regression and Its Application to Model-Based Reinforcement Learning , 2007, NIPS.
[13] Csaba Szepesvári,et al. Model-based reinforcement learning with nearly tight exploration complexity bounds , 2010, ICML.
[14] Andrew Y. Ng,et al. Near-Bayesian exploration in polynomial time , 2009, ICML '09.
[15] Lihong Li,et al. PAC model-free reinforcement learning , 2006, ICML.
[16] Nicholas Roy,et al. Provably Efficient Learning with Typed Parametric Models , 2009, J. Mach. Learn. Res..
[17] John Langford,et al. Exploration in Metric State Spaces , 2003, ICML.