Coco-Q: Learning in Stochastic Games with Side Payments
暂无分享,去创建一个
Michael L. Littman | Amy Greenwald | Eric Sodomka | Elizabeth Hilliard | M. Littman | A. Greenwald | Eric Sodomka | Elizabeth Hilliard
[1] Michael P. Wellman,et al. Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..
[2] Xiaotie Deng,et al. Settling the Complexity of Two-Player Nash Equilibrium , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).
[3] Keith B. Hall,et al. Correlated Q-Learning , 2003, ICML.
[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[5] Michael H. Bowling,et al. Convergence Problems of General-Sum Multiagent Reinforcement Learning , 2000, ICML.
[6] Adam Tauman Kalai,et al. Cooperation in Strategic Games Revisited , 2013 .
[7] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[8] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.
[9] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[10] Adam Tauman Kalai,et al. Cooperation and competition in strategic games with private information , 2010, EC '10.
[11] Michael L. Littman,et al. Cyclic Equilibria in Markov Games , 2005, NIPS.
[12] Ryszard Kowalczyk,et al. Dynamic analysis of multiagent Q-learning with ε-greedy exploration , 2009, ICML '09.
[13] Eitan Zemel,et al. Nash and correlated equilibria: Some complexity considerations , 1989 .
[14] Robert H. Crites,et al. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.
[15] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[16] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[17] Michael L. Littman,et al. A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games , 2008, UAI.
[18] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[19] Michael L. Littman,et al. Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.