Sample Complexity of Policy Search with Known Dynamics
暂无分享,去创建一个
[1] D. Pollard. Empirical Processes: Theory and Applications , 1990 .
[2] David Haussler,et al. Decision Theoretic Generalizations of the PAC Model for Neural Net and Other Learning Applications , 1992, Inf. Comput..
[3] Paul W. Goldberg,et al. Bounding the Vapnik-Chervonenkis Dimension of Concept Classes Parameterized by Real Numbers , 1993, COLT '93.
[4] Noga Alon,et al. Scale-sensitive dimensions, uniform convergence, and learnability , 1997, JACM.
[5] Lenore Blum,et al. Complexity and Real Computation , 1997, Springer New York.
[6] Peter L. Bartlett,et al. Learning in Neural Networks: Theoretical Foundations , 1999 .
[7] Peter L. Bartlett,et al. Neural Network Learning - Theoretical Foundations , 1999 .
[8] Michael I. Jordan,et al. PEGASUS: A policy search method for large MDPs and POMDPs , 2000, UAI.
[9] P. Varaiya,et al. Simulation-based uniform value function estimates of discounted and average-reward MDPs , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).