暂无分享,去创建一个
Joel Z. Leibo | Alexander Sasha Vezhnevets | Yuhuai Wu | Remi Leblond | A. Vezhnevets | Rémi Leblond | Yuhuai Wu
[1] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[2] Karl Tuyls,et al. alpha-Rank: Multi-Agent Evaluation by Evolution , 2019 .
[3] Max Jaderberg,et al. Open-ended Learning in Symmetric Zero-sum Games , 2019, ICML.
[4] Michael H. Bowling,et al. Solving Imperfect Information Games Using Decomposition , 2013, AAAI.
[5] Marlos C. Machado,et al. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents , 2017, J. Artif. Intell. Res..
[6] Li Wang,et al. Hierarchical Deep Multiagent Reinforcement Learning , 2018, ArXiv.
[7] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[8] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.
[9] Jianye Hao,et al. Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction , 2018 .
[10] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[11] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[12] Shimon Whiteson,et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.
[13] J. Harsanyi. Games with Incomplete Information Played by 'Bayesian' Players, Part III. The Basic Probability Distribution of the Game , 1968 .
[14] Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.
[15] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[16] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.
[17] Joel Z. Leibo,et al. Malthusian Reinforcement Learning , 2018, AAMAS.
[18] Joel Z. Leibo,et al. Multi-agent Reinforcement Learning in Sequential Social Dilemmas , 2017, AAMAS.
[19] Joel Z. Leibo,et al. Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research , 2019, ArXiv.
[20] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[21] Alexander Peysakhovich,et al. Maintaining cooperation in complex social dilemmas using deep reinforcement learning , 2017, ArXiv.
[22] J. Hofbauer,et al. Evolutionary game dynamics , 2011 .
[23] Peter Dayan,et al. Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning , 2019, ICLR 2019.
[24] Igor Mordatch,et al. Emergent Tool Use From Multi-Agent Autocurricula , 2019, ICLR.
[25] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[26] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[27] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[28] John C. Harsanyi,et al. Games with Incomplete Information Played by "Bayesian" Players, I-III: Part I. The Basic Model& , 2004, Manag. Sci..
[29] Patrick M. Pilarski,et al. Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction , 2011, AAMAS.
[30] Joel Z. Leibo,et al. Inequity aversion improves cooperation in intertemporal social dilemmas , 2018, NeurIPS.
[31] Filipe Wall Mutz,et al. Hindsight policy gradients , 2017, ICLR.
[32] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[33] Doina Precup,et al. Temporal abstraction in reinforcement learning , 2000, ICML 2000.
[34] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[35] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[36] Rémi Munos,et al. Recurrent Experience Replay in Distributed Reinforcement Learning , 2018, ICLR.
[37] Fei Sha,et al. Actor-Attention-Critic for Multi-Agent Reinforcement Learning , 2018, ICML.
[38] Nicolas Heess,et al. Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search , 2018, ICLR.
[39] Nicolas Heess,et al. Hierarchical visuomotor control of humanoids , 2018, ICLR.
[40] David Silver,et al. A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.
[41] Thore Graepel,et al. Re-evaluating evaluation , 2018, NeurIPS.
[42] Karol Hausman,et al. Learning an Embedding Space for Transferable Robot Skills , 2018, ICLR.