A Sparse Kernel-Based Least-Squares Temporal Difference Algorithm for Reinforcement Learning
暂无分享,去创建一个
[1] Shie Mannor,et al. Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning , 2003, ICML.
[2] Justin A. Boyan,et al. Least-Squares Temporal Difference Learning , 1999, ICML.
[3] Vladimir Vapnik,et al. Statistical learning theory , 1998 .
[4] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[5] Alexander J. Smola,et al. Learning with kernels , 1998 .
[6] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[7] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[8] Xin Xu,et al. Kernel Least-Squares Temporal Difference Learning , 2006 .
[9] Justin A. Boyan,et al. Technical Update: Least-Squares Temporal Difference Learning , 2002, Machine Learning.
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] Shie Mannor,et al. The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.
[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[13] Liming Xiang,et al. Kernel-Based Reinforcement Learning , 2006, ICIC.
[14] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[15] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[16] H. He,et al. Efficient Reinforcement Learning Using Recursive Least-Squares Methods , 2011, J. Artif. Intell. Res..
[17] Vladislav Tadic,et al. On the Convergence of Temporal-Difference Learning with Linear Function Approximation , 2001, Machine Learning.