The Dynamics of Multi-Agent Reinforcement Learning
暂无分享,去创建一个
[1] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[2] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.
[3] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[4] Luke Dickens,et al. Learning to Act Stochastically , 2009 .
[5] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[7] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[8] Manuela M. Veloso,et al. Existence of Multiagent Equilibria with Limited Agents , 2004, J. Artif. Intell. Res..
[9] Karl Tuyls,et al. An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games , 2005, Autonomous Agents and Multi-Agent Systems.
[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[11] Shalabh Bhatnagar,et al. Incremental Natural Actor-Critic Algorithms , 2007, NIPS.
[12] Manuela M. Veloso,et al. Simultaneous Adversarial Multi-Robot Learning , 2003, IJCAI.
[13] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[14] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .
[15] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[16] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[17] Michael L. Littman,et al. Cyclic Equilibria in Markov Games , 2005, NIPS.
[18] Eric van Damme,et al. Non-Cooperative Games , 2000 .