Cyclic Equilibria in Markov Games
暂无分享,去创建一个
Michael L. Littman | Martin Zinkevich | Amy Greenwald | M. Littman | Martin A. Zinkevich | A. Greenwald
[1] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.
[2] R. Bellman. Dynamic programming. , 1957, Science.
[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[4] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .
[5] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[6] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.
[7] Michael L. Littman,et al. Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.
[8] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..
[9] Thomas Dyhre Nielsen,et al. Symbolic and Quantitative Approaches to Reasoning with Uncertainty , 2003, Lecture Notes in Computer Science.
[10] Keith B. Hall,et al. Correlated Q-Learning , 2003, ICML.
[11] Jeffrey O. Kephart,et al. Pricing in Agent Economies Using Multi-Agent Q-Learning , 2002, Autonomous Agents and Multi-Agent Systems.