A novel multi-agent Q-learning algorithm in cooperative multi-agent system
暂无分享,去创建一个
Xu Xiaoming | Zhang Weidong | Zhang Wenyuan | Ou Haitao | Zhang Weidong | X. Xiaoming | Zhang Wenyuan | Ou Haitao
[1] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[2] E. Kalai,et al. Rational Learning Leads to Nash Equilibrium , 1993 .
[3] Richard Wheeler,et al. Decentralized learning in finite Markov chains , 1985, 1985 24th IEEE Conference on Decision and Control.
[4] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[5] K. Narendra,et al. Decentralized learning in finite Markov chains , 1985, 1985 24th IEEE Conference on Decision and Control.
[6] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[7] Junling Hu,et al. Self-fulfilling Bias in Multiagent Learning , 1996 .