Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
暂无分享,去创建一个
[1] Doina Precup,et al. Temporal abstraction in reinforcement learning , 2000, ICML 2000.
[2] Wenwu Yu,et al. An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination , 2012, IEEE Transactions on Industrial Informatics.
[3] Matthew E. Taylor,et al. A survey and critique of multiagent deep reinforcement learning , 2018, Autonomous Agents and Multi-Agent Systems.
[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[5] Zhi Zhang,et al. Integrating Independent and Centralized Multi-agent Reinforcement Learning for Traffic Signal Network Optimization , 2019, AAMAS.
[6] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[7] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.
[8] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[9] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[10] Ahmad Beirami,et al. On Multi-Agent Learning in Team Sports Games , 2019, ArXiv.
[11] Joseph J. Lim,et al. Learning to Coordinate Manipulation Skills via Skill Behavior Diversification , 2020, ICLR.
[12] Yisong Yue,et al. Coordinated Multi-Agent Imitation Learning , 2017, ICML.
[13] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[14] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[15] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[16] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[17] Li Wang,et al. Hierarchical Deep Multiagent Reinforcement Learning , 2018, ArXiv.
[18] Peter Dayan,et al. Feudal Multi-Agent Hierarchies for Cooperative Reinforcement Learning , 2019, ICLR 2019.
[19] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[20] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[21] Sridhar Mahadevan,et al. Hierarchical multi-agent reinforcement learning , 2001, AGENTS '01.
[22] Shimon Whiteson,et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.
[23] David Isele,et al. CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning , 2018, ICLR.
[24] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[25] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[26] Sarit Kraus,et al. Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination , 2010, AAAI.
[27] Doina Precup,et al. Learning Options in Reinforcement Learning , 2002, SARA.
[28] L. Bornn,et al. Characterizing the spatial structure of defensive skill in professional basketball , 2014, 1405.0231.
[29] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[30] Patrice Marcotte,et al. An overview of bilevel optimization , 2007, Ann. Oper. Res..
[31] Guy Lever,et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward , 2018, AAMAS.
[32] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.
[33] Olivier Bachem,et al. Google Research Football: A Novel Reinforcement Learning Environment , 2020, AAAI.
[34] Yung Yi,et al. QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning , 2019, ICML.
[35] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[36] Daan Wierstra,et al. Variational Intrinsic Control , 2016, ICLR.
[37] Mohsen Sardari,et al. Winning Isn't Everything: Training Human-Like Agents for Playtesting and Game AI , 2019, ArXiv.
[38] Manuela M. Veloso,et al. Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.
[39] Sergey Levine,et al. Diversity is All You Need: Learning Skills without a Reward Function , 2018, ICLR.
[40] Sridhar Mahadevan,et al. Learning to Take Concurrent Actions , 2002, NIPS.
[41] Alan Fern,et al. Bayesian Policy Search for Multi-Agent Role Discovery , 2010, AAAI.
[42] Guy Lever,et al. Emergent Coordination Through Competition , 2019, ICLR.
[43] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.