Adaptive Dynamics Learning and Q-initialization in the context of multiagent learning
暂无分享,去创建一个
[1] Richard Bellman,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.
[2] Yoav Shoham,et al. New Criteria and a New Algorithm for Learning in Multi-Agent Systems , 2004, NIPS.
[3] H. Young,et al. The Evolution of Conventions , 1993 .
[4] Yishay Mansour,et al. Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.
[5] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[6] William T. B. Uther,et al. Adversarial Reinforcement Learning , 2003 .
[7] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[8] Michael P. Wellman,et al. Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..
[9] Brahim Chaib-draa,et al. Multiagent learning in adaptive dynamic systems , 2007, AAMAS '07.
[10] Michael L. Littman,et al. Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.
[11] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[12] Gerald Tesauro,et al. Extending Q-Learning to General Adaptive Multi-Agent Systems , 2003, NIPS.
[13] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.
[14] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.
[15] John E. Moody,et al. Note on Learning Rate Schedules for Stochastic Optimization , 1990, NIPS.
[16] Brahim Chaib-draa,et al. Effective Learning in Adaptive Dynamic Systems , 2007, AAAI Spring Symposium: Game Theoretic and Decision Theoretic Agents.
[17] Leslie Pack Kaelbling,et al. Playing is believing: The role of beliefs in multi-agent learning , 2001, NIPS.
[18] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[19] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[20] Gunes Ercal,et al. On No-Regret Learning, Fictitious Play, and Nash Equilibrium , 2001, ICML.
[21] Reid G. Simmons,et al. The Effect of Representation and Knowledge on Goal-Directed Exploration with Reinforcement-Learning Algorithms , 2005, Machine Learning.
[22] Yoav Shoham,et al. Learning against opponents with bounded memory , 2005, IJCAI.
[23] Michael L. Littman,et al. Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.
[24] Brahim Chaib-draa,et al. Apprentissage de la coordination multiagent. Une méthode basée sur le Q-learning par jeu adaptatif , 2006, Rev. d'Intelligence Artif..
[25] Craig Boutilier,et al. Coordination in multiagent reinforcement learning: a Bayesian approach , 2003, AAMAS '03.
[26] M. F.,et al. Bibliography , 1985, Experimental Gerontology.
[27] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[28] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[29] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.
[30] Yoav Shoham,et al. Run the GAMUT: a comprehensive approach to evaluating game-theoretic algorithms , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..
[31] J. Nash. Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.
[32] J. Neumann,et al. Prisoner's Dilemma , 1993 .
[33] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[34] Yoav Shoham,et al. Multi-Agent Reinforcement Learning:a critical survey , 2003 .
[35] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[36] Blai Bonet,et al. Planning as heuristic search , 2001, Artif. Intell..
[37] Xiaofeng Wang,et al. Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.
[38] Daniel Kudenko,et al. Reinforcement learning of coordination in cooperative multi-agent systems , 2002, AAAI/IAAI.
[39] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[40] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[41] A. M. Fink,et al. Equilibrium in a stochastic $n$-person game , 1964 .