论文信息 - Multiagent learning using a variable learning rate - 字舞流文

Multiagent learning using a variable learning rate

Manuela M. Veloso | Michael H. Bowling | M. Veloso | Michael Bowling

[1] Tommi S. Jaakkola,et al. Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[2] William T. B. Uther,et al. Adversarial Reinforcement Learning , 2003 .

[3] Manuela M. Veloso,et al. Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.

[4] Manuela M. Veloso,et al. Convergence of Gradient Dynamics with a Variable Learning Rate , 2001, ICML.

[5] Manuela Veloso,et al. An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[6] Yishay Mansour,et al. Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.

[7] Michael H. Bowling,et al. Convergence Problems of General-Sum Multiagent Reinforcement Learning , 2000, ICML.

[8] Peter L. Bartlett,et al. Reinforcement Learning in POMDP's via Direct Gradient Ascent , 2000, ICML.

[9] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[10] Michael P. Wellman,et al. Learning in dynamic noncooperative multiagent systems , 1999 .

[11] Andrew W. Moore,et al. Gradient Descent for General Reinforcement Learning , 1998, NIPS.

[12] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[13] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[14] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[15] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[16] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .

[17] H. Kuhn. Classics in Game Theory , 1997 .

[18] Avrim Blum,et al. On-line Learning and the Metrical Task System Problem , 1997, COLT '97.

[19] J. Filar,et al. Competitive Markov Decision Processes , 1996 .

[20] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[21] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[22] Jörgen W. Weibull,et al. Evolutionary Game Theory , 1996 .

[23] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[24] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.

[25] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[26] Michael I. Jordan,et al. Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems , 1994, NIPS.

[27] L. C. Thomas,et al. Stochastic Games with Finite State and Action Spaces , 1988 .

[28] Hervé Reinhard,et al. Differential equations: Foundations and applications , 1986 .

[29] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[30] O. Mangasarian,et al. Two-person nonzero-sum games and quadratic programming , 1964 .

[31] A. M. Fink,et al. Equilibrium in a stochastic $n$-person game , 1964 .

[32] R. Howard. Dynamic Programming and Markov Processes , 1960 .

[33] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[34] J. Nash. Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.