A Learning Automata Approach to Multi-agent Policy Gradient Learning
暂无分享,去创建一个
Maarten Peeters | Ann Nowé | Katja Verbeeck | Ville Könönen | A. Nowé | K. Verbeeck | V. Könönen | M. Peeters
[1] Ville Könönen,et al. Gradient descent for symmetric and asymmetric multiagent reinforcement learning , 2005, Web Intell. Agent Syst..
[2] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .
[3] M. Thathachar,et al. Networks of Learning Automata: Techniques for Online Stochastic Optimization , 2003 .
[4] Ville Könönen. Multiagent reinforcement learning in Markov games : asymmetric and symmetric approaches , 2004 .
[5] Kee-Eung Kim,et al. Learning to Cooperate via Policy Search , 2000, UAI.
[6] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[7] M. A. L. Thathachar,et al. Networks of Learning Automata , 2004 .
[8] Ville Könönen,et al. Gradient Based Method for Symmetric and Asymmetric Multiagent Reinforcement Learning , 2003, IDEAL.
[9] J. Filar,et al. Competitive Markov Decision Processes , 1996 .
[10] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] A. Bowker,et al. A test for symmetry in contingency tables. , 1948, Journal of the American Statistical Association.
[13] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[14] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[15] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[16] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[17] Thomas P. Hettmansperger,et al. Bowker's Test for Symmetry , 2004 .