Extending World Models for Multi-Agent Reinforcement Learning in MALMÖ
暂无分享,去创建一个
Feryal M. P. Behbahani | Valliappa Chockalingam | Tegg Taekyong Sung | Tegg Tae Kyong Sung | Feryal Behbahani | Rishab Gargeya | Amlesh Sivanantham | Aleksandra Malysheva | Valliappa Chockalingam | Rishab Gargeya | Aleksandra Malysheva | Amlesh Sivanantham
[1] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[2] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[3] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[4] Marc G. Bellemare,et al. As Expected ? An Analysis of Distributional Reinforcement Learning , 2018 .
[5] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.
[6] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.
[7] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[8] Jürgen Schmidhuber,et al. World Models , 2018, ArXiv.
[9] Masashi Sugiyama,et al. Nonparametric Return Distribution Approximation for Reinforcement Learning , 2010, ICML.
[10] Tom M. Mitchell,et al. The Need for Biases in Learning Generalizations , 2007 .
[11] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[13] Sanja Fidler,et al. NerveNet: Learning Structured Policy with Graph Neural Networks , 2018, ICLR.
[14] Anil A. Bharath,et al. Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.
[15] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[16] Guy Lever,et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward , 2018, AAMAS.
[17] Fabio Viola,et al. Learning and Querying Fast Generative Models for Reinforcement Learning , 2018, ArXiv.
[18] Bo An,et al. HogRider: Champion Agent of Microsoft Malmo Collaborative AI Challenge , 2018, AAAI.
[19] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.