Learning to Communicate with Deep Multi-Agent Reinforcement Learning
暂无分享,去创建一个
Shimon Whiteson | Nando de Freitas | Jakob N. Foerster | Jakob Foerster | Yannis M. Assael | S. Whiteson | N. D. Freitas | Yannis Assael | Shimon Whiteson
[1] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[3] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[4] C. Lee Giles,et al. Learning Communication for Multi-agent Systems , 2002, WRAC.
[5] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.
[6] Nikos A. Vlassis,et al. Optimal and Approximate Q-value Functions for Decentralized POMDPs , 2008, J. Artif. Intell. Res..
[7] A. Kamiya,et al. Learning of communication codes in multi-agent reinforcement learning problem , 2008, 2008 IEEE Conference on Soft Computing in Industrial Applications.
[8] Yoav Shoham,et al. Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .
[9] 100 PRISONERS AND A LIGHT BULB , 2009 .
[10] Geoffrey E. Hinton,et al. Discovering Binary Codes for Documents by Learning Deep Generative Models , 2011, Top. Cogn. Sci..
[11] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[12] Francisco S. Melo,et al. QueryPOMDP: POMDP-Based Communication in Multiagent Systems , 2011, EUMAS.
[13] Victor R. Lesser,et al. Coordinating multi-agent reinforcement learning with limited communication , 2013, AAMAS.
[14] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.
[15] Kevin Leyton-Brown,et al. Empirically Evaluating Multiagent Learning Algorithms , 2014, ArXiv.
[16] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[17] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[18] Wojciech Zaremba,et al. An Empirical Exploration of Recurrent Network Architectures , 2015, ICML.
[19] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.
[20] Regina Barzilay,et al. Language Understanding for Text-based Games using Deep Reinforcement Learning , 2015, EMNLP.
[21] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[22] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.
[23] Yoshua Bengio,et al. BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 , 2016, ArXiv.
[24] Shimon Whiteson,et al. Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks , 2016, ArXiv.
[25] Bikramjit Banerjee,et al. Multi-agent reinforcement learning as a rehearsal for decentralized planning , 2016, Neurocomputing.
[26] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[27] Ran El-Yaniv,et al. Binarized Neural Networks , 2016, NIPS.
[28] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.