Online Reinforcement Learning Using a Probability Density Estimation
暂无分享,去创建一个
[1] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.
[2] Carl E. Rasmussen,et al. Gaussian process dynamic programming , 2009, Neurocomputing.
[3] Kenji Doya,et al. Reinforcement Learning in Continuous Time and Space , 2000, Neural Computation.
[4] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[5] Geoffrey J. Gordon. Stable Function Approximation in Dynamic Programming , 1995, ICML.
[6] Marc Peter Deisenroth,et al. Efficient reinforcement learning using Gaussian processes , 2010 .
[7] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[8] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[9] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.
[10] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[11] Shin Ishii,et al. Reinforcement Learning Based on On-Line EM Algorithm , 1998, NIPS.
[12] Shin Ishii,et al. On-line EM Algorithm for the Normalized Gaussian Network , 2000, Neural Computation.
[13] Alejandro Agostini,et al. Reinforcement Learning with a Gaussian mixture model , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).
[14] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .
[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[16] Shie Mannor,et al. Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning , 2003, ICML.
[17] Alejandro Agostini,et al. A Competitive Strategy for Function Approximation in Q-Learning , 2011, IJCAI.
[18] Mário A. T. Figueiredo. On Gaussian radial basis function approximations: interpretation, extensions, and learning strategies , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.
[19] Alejandro Agostini,et al. Online EM with Weight-Based Forgetting , 2015, Neural Computation.