Classes of Multiagent Q-learning Dynamics with epsilon-greedy Exploration
暂无分享,去创建一个
Michael L. Littman | Monica Babes-Vroman | Michael Wunder | M. Littman | Monica Babes-Vroman | M. Wunder
[1] Robert H. Crites,et al. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.
[2] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[3] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[4] Yishay Mansour,et al. Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.
[5] Peter Stone,et al. Implicit Negotiation in Repeated Games , 2001, ATAL.
[6] Manuela M. Veloso,et al. Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.
[7] Tom Lenaerts,et al. A selection-mutation model for q-learning in multi-agent systems , 2003, AAMAS '03.
[8] John N. Tsitsiklis,et al. Asynchronous Stochastic Approximation and Q-Learning , 1994, Machine Learning.
[9] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[10] C. Budd,et al. Review of ”Piecewise-Smooth Dynamical Systems: Theory and Applications by M. di Bernardo, C. Budd, A. Champneys and P. 2008” , 2020 .
[11] P. Dayan,et al. Reinforcement learning: The Good, The Bad and The Ugly , 2008, Current Opinion in Neurobiology.
[12] Ryszard Kowalczyk,et al. Dynamic analysis of multiagent Q-learning with ε-greedy exploration , 2009, ICML '09.