Reinforcement Learning Approaches to Coordination in Cooperative Multi-agent Systems
暂无分享,去创建一个
Daniel Kudenko | Malcolm J. A. Strens | Spiros Kapetanakis | M. Strens | D. Kudenko | S. Kapetanakis
[1] C. Watkins. Learning from delayed rewards , 1989 .
[2] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[3] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.
[4] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[5] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[6] Sandip Sen,et al. Individual learning of coordination knowledge , 1998, J. Exp. Theor. Artif. Intell..
[7] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[8] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .
[9] Craig Boutilier,et al. Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.
[10] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.
[11] Tommi S. Jaakkola,et al. Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.