New Criteria and a New Algorithm for Learning in Multi-Agent Systems
暂无分享,去创建一个
[1] O. H. Brownlee,et al. ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .
[2] W. Hoeffding. On the Distribution of the Number of Successes in Independent Trials , 1956 .
[3] E. Kalai,et al. Rational Learning Leads to Nash Equilibrium , 1993 .
[4] D. Fudenberg,et al. Consistency and Cautious Fictitious Play , 1995 .
[5] S. Hart,et al. A simple adaptive procedure leading to correlated equilibrium , 2000 .
[6] Regret in the On-line Decision , 1997 .
[7] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[8] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .
[9] Sandip Sen,et al. Learning in multiagent systems , 1999 .
[10] D. Fudenberg,et al. Conditional Universal Consistency , 1999 .
[11] Dean P. Foster,et al. Regret in the On-Line Decision Problem , 1999 .
[12] Manuela M. Veloso,et al. Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.
[13] Leonid Sheremetov,et al. Weiss, Gerhard. Multiagent Systems a Modern Approach to Distributed Artificial Intelligence , 2009 .
[14] Yishay Mansour,et al. Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.
[15] Peter Stone,et al. Implicit Negotiation in Repeated Games , 2001, ATAL.
[16] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[17] Ronen I. Brafman,et al. Efficient learning equilibrium , 2004, Artificial Intelligence.
[18] Gerald Tesauro,et al. Extending Q-Learning to General Adaptive Multi-Agent Systems , 2003, NIPS.
[19] Yoav Shoham,et al. Multi-Agent Reinforcement Learning:a critical survey , 2003 .
[20] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[21] Yoav Shoham,et al. Run the GAMUT: a comprehensive approach to evaluating game-theoretic algorithms , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[22] Vincent Conitzer,et al. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents , 2003, Machine Learning.