暂无分享,去创建一个
[1] Michael L. Littman,et al. Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.
[2] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[3] Kee-Eung Kim,et al. Learning to Cooperate via Policy Search , 2000, UAI.
[4] Yishay Mansour,et al. Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.
[5] Michael L. Littman,et al. Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.
[6] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[7] Xiaofeng Wang,et al. Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.
[8] Bikramjit Banerjee,et al. Adaptive policy gradient in multiagent learning , 2003, AAMAS '03.
[9] Michael P. Wellman,et al. Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..
[10] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[11] Michael H. Bowling,et al. Convergence and No-Regret in Multiagent Learning , 2004, NIPS.
[12] Nicholas R. Jennings,et al. Cooperative Information Sharing to Improve Distributed Learning in Multi-Agent Systems , 2005, J. Artif. Intell. Res..
[13] Karl Tuyls,et al. An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games , 2005, Autonomous Agents and Multi-Agent Systems.
[14] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[15] Vincent Conitzer,et al. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents , 2003, Machine Learning.
[16] Victor R. Lesser,et al. Learning the task allocation game , 2006, AAMAS '06.
[17] Victor R. Lesser,et al. Multiagent reinforcement learning and self-organization in a network of agents , 2007, AAMAS '07.
[18] Bikramjit Banerjee,et al. Generalized multiagent learning with performance bound , 2007, Autonomous Agents and Multi-Agent Systems.
[19] Victor R. Lesser,et al. Non-linear dynamics in multiagent reinforcement learning algorithms , 2008, AAMAS.