Monte-Carlo Swarm Policy Search
暂无分享,去创建一个
[1] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[2] Mark W. Spong,et al. The swing up control problem for the Acrobot , 1995 .
[3] Matthieu Geist,et al. Parametric value function approximation: A unified view , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[4] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[5] Maurice Clerc,et al. The particle swarm - explosion, stability, and convergence in a multidimensional complex space , 2002, IEEE Trans. Evol. Comput..
[6] Riccardo Poli,et al. Particle swarm optimization , 1995, Swarm Intelligence.
[7] Shie Mannor,et al. The Cross Entropy Method for Fast Policy Search , 2003, ICML.
[8] Andries Petrus Engelbrecht,et al. Fundamentals of Computational Swarm Intelligence , 2005 .
[9] 一将 白髪,et al. 捕食者被食者の関係を導入したHeterogeneous Particle Swarm Optimization , 2012 .
[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[11] Andrew W. Moore,et al. Variable Resolution Discretization for High-Accuracy Solutions of Optimal Control Problems , 1999, IJCAI.
[12] J. Baxter,et al. Direct gradient-based reinforcement learning , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).