暂无分享,去创建一个
[1] Demis Hassabis,et al. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.
[2] Xinrun Wang,et al. Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play , 2021, IJCAI.
[3] Yan Liu,et al. Policy Learning for Continuous Space Security Games Using Neural Networks , 2018, AAAI.
[4] Kenneth O. Stanley,et al. First return then explore , 2021, Nature.
[5] Vincent Conitzer,et al. A double oracle algorithm for zero-sum security games on graphs , 2011, AAMAS.
[6] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[7] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[8] Lantao Yu,et al. Deep Reinforcement Learning for Green Security Games with Real-Time Information , 2018, AAAI.
[9] Milind Tambe,et al. When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing , 2015, IJCAI.
[10] Geoff Boeing,et al. OSMnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks , 2016, Comput. Environ. Urban Syst..
[11] Steven Okamoto,et al. Solving non-zero sum multiagent network flow security games with attack costs , 2012, AAMAS.
[12] David Silver,et al. Learning and Planning in Complex Action Spaces , 2021, ICML.
[13] Wojciech M. Czarnecki,et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.
[14] Rémi Coulom,et al. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.
[15] Xinrun Wang,et al. CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space , 2021, IJCAI.
[16] Kevin Waugh,et al. DeepStack: Expert-level artificial intelligence in heads-up no-limit poker , 2017, Science.
[17] Jianfeng Gao,et al. Deep Reinforcement Learning with a Natural Language Action Space , 2015, ACL.
[18] Nicholas R. Jennings,et al. Optimal Interdiction of Urban Criminals with the Aid of Real-Time Information , 2019, AAAI.
[19] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[20] Nikos A. Vlassis,et al. Optimal and Approximate Q-value Functions for Decentralized POMDPs , 2008, J. Artif. Intell. Res..
[21] Milind Tambe,et al. Scalable Game-Focused Learning of Adversary Models: Data-to-Decisions in Network Security Games , 2020, AAMAS.
[22] Demis Hassabis,et al. Mastering Atari, Go, chess and shogi by planning with a learned model , 2019, Nature.
[23] Nicholas R. Jennings,et al. Optimal Escape Interdiction on Transportation Networks , 2017, IJCAI.
[24] Zhen Wang,et al. Computing Optimal Monitoring Strategy for Detecting Terrorist Plots , 2016, AAAI.
[25] Mykel J. Kochenderfer,et al. Cooperative Multi-agent Control Using Deep Reinforcement Learning , 2017, AAMAS Workshops.
[26] David Silver,et al. Deep Reinforcement Learning from Self-Play in Imperfect-Information Games , 2016, ArXiv.
[27] Branislav Bosanský,et al. An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information , 2014, J. Artif. Intell. Res..
[28] Christopher D. Rosin,et al. Multi-armed bandits with episode context , 2011, Annals of Mathematics and Artificial Intelligence.