暂无分享,去创建一个
Roy Fox | Pierre Baldi | Stephen McAleer | John Lanier | P. Baldi | Roy Fox | S. McAleer | John Lanier
[1] Guy Lever,et al. A Generalized Training Approach for Multiagent Learning , 2020, ICLR.
[2] Tuomas Sandholm,et al. Deep Counterfactual Regret Minimization , 2018, ICML.
[3] Manuela M. Veloso,et al. Multiagent learning using a variable learning rate , 2002, Artif. Intell..
[4] Wojciech M. Czarnecki,et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.
[5] Michael I. Jordan,et al. RLlib: Abstractions for Distributed Reinforcement Learning , 2017, ICML.
[6] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.
[7] Michael H. Bowling,et al. Regret Minimization in Games with Incomplete Information , 2007, NIPS.
[8] Michael I. Jordan,et al. Ray: A Distributed Framework for Emerging AI Applications , 2017, OSDI.
[9] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[10] Jakub W. Pachocki,et al. Dota 2 with Large Scale Deep Reinforcement Learning , 2019, ArXiv.
[11] David Silver,et al. Deep Reinforcement Learning from Self-Play in Imperfect-Information Games , 2016, ArXiv.
[12] Michael H. Bowling,et al. Actor-Critic Policy Optimization in Partially Observable Multiagent Environments , 2018, NeurIPS.
[13] Tim Roughgarden,et al. Algorithmic Game Theory , 2007 .
[14] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[15] Guy Lever,et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning , 2018, Science.
[16] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .
[17] Yoav Shoham,et al. Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .
[18] Noam Brown,et al. Superhuman AI for heads-up no-limit poker: Libratus beats top professionals , 2018, Science.
[19] Mark H. M. Winands,et al. Quiescence Search for Stratego , 2009 .
[20] Max Jaderberg,et al. Open-ended Learning in Symmetric Zero-sum Games , 2019, ICML.
[21] Petros Christodoulou,et al. Soft Actor-Critic for Discrete Action Settings , 2019, ArXiv.
[22] David Silver,et al. A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.
[23] Yuxi Li,et al. Deep Reinforcement Learning: An Overview , 2017, ArXiv.
[24] David Silver,et al. Fictitious Self-Play in Extensive-Form Games , 2015, ICML.
[25] P. Taylor,et al. Evolutionarily Stable Strategies and Game Dynamics , 1978 .
[26] Michael H. Bowling,et al. The Advantage Regret-Matching Actor-Critic , 2020, ArXiv.
[27] Adam Lerer,et al. DREAM: Deep Regret minimization with Advantage baselines and Model-free learning , 2020, ArXiv.
[28] Maarten P. D. Schadd,et al. The 3rd Stratego Computer World Championship , 2009 .