暂无分享,去创建一个
Pierre Baldi | Roy Fox | Marc Lanctot | Stephen McAleer | John Lanier | Kevin Wang | P. Baldi | Roy Fox | Marc Lanctot | S. McAleer | John Lanier | Kevin A. Wang
[1] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.
[2] Pierre Baldi,et al. XDO: A Double Oracle Algorithm for Extensive-Form Games , 2021, ArXiv.
[3] David Silver,et al. A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.
[4] Haitham Bou-Ammar,et al. Online Double Oracle , 2021, ArXiv.
[5] Michael P. Wellman,et al. Iterative Empirical Game Solving via Single Policy Best Response , 2021, ICLR.
[6] Wojciech M. Czarnecki,et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.
[7] Xidong Feng,et al. Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games , 2021, ArXiv.
[8] Sriram Srinivasan,et al. OpenSpiel: A Framework for Reinforcement Learning in Games , 2019, ArXiv.
[9] Michael H. Bowling,et al. Finding Optimal Abstract Strategies in Extensive-Form Games , 2012, AAAI.
[10] Peter Bro Miltersen,et al. On Range of Skill , 2008, AAAI.
[11] Roy Fox,et al. Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games , 2020, NeurIPS.
[12] Michael H. Bowling,et al. A New Algorithm for Generating Equilibria in Massive Zero-Sum Games , 2007, AAAI.