Multiagent learning using a variable learning rate

[1]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[2]  William T. B. Uther,et al.  Adversarial Reinforcement Learning , 2003 .

[3]  Manuela M. Veloso,et al.  Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.

[4]  Manuela M. Veloso,et al.  Convergence of Gradient Dynamics with a Variable Learning Rate , 2001, ICML.

[5]  Manuela Veloso,et al.  An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[6]  Yishay Mansour,et al.  Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.

[7]  Michael H. Bowling,et al.  Convergence Problems of General-Sum Multiagent Reinforcement Learning , 2000, ICML.

[8]  Peter L. Bartlett,et al.  Reinforcement Learning in POMDP's via Direct Gradient Ascent , 2000, ICML.

[9]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[10]  Michael P. Wellman,et al.  Learning in dynamic noncooperative multiagent systems , 1999 .

[11]  Andrew W. Moore,et al.  Gradient Descent for General Reinforcement Learning , 1998, NIPS.

[12]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[13]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[14]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[15]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[16]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[17]  H. Kuhn Classics in Game Theory , 1997 .

[18]  Avrim Blum,et al.  On-line Learning and the Metrical Task System Problem , 1997, COLT '97.

[19]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[20]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[21]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[22]  Jörgen W. Weibull,et al.  Evolutionary Game Theory , 1996 .

[23]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[24]  Sandip Sen,et al.  Learning to Coordinate without Sharing Information , 1994, AAAI.

[25]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[26]  Michael I. Jordan,et al.  Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems , 1994, NIPS.

[27]  L. C. Thomas,et al.  Stochastic Games with Finite State and Action Spaces , 1988 .

[28]  Hervé Reinhard,et al.  Differential equations: Foundations and applications , 1986 .

[29]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[30]  O. Mangasarian,et al.  Two-person nonzero-sum games and quadratic programming , 1964 .

[31]  A. M. Fink,et al.  Equilibrium in a stochastic $n$-person game , 1964 .

[32]  R. Howard Dynamic Programming and Markov Processes , 1960 .

[33]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[34]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.