Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
暂无分享,去创建一个
Sarah H. Cen | Eugene V. Tarassov | Jerome T. Connor | L. Sifre | Tobias Pohlen | D. Hassabis | R. Munos | Mark Rowland | Satinder Singh | Bilal Piot | Marc Lanctot | A. Gruslys | K. Tuyls | J. Pérolat | Thomas W. Anthony | Sherjil Ozair | Edward Lockhart | J. Lespiau | Neil Burch | Shayegan Omidshafiei | Florian Strub | Tom Eccles | Daniel Hennes | Zhe Wang | R. Elie | Paul Muller | Aleksandra Malysheva | B. D. Vylder | Finbarr Timbers | S. McAleer | David Silver | V. D. Boer | Mina Khan | Nathalie Beauguerlange | Eugene Tarassov
[1] Matteo Hessel,et al. Podracer architectures for scalable Reinforcement Learning , 2021, ArXiv.
[2] Richard G. Gibson. Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents , 2014 .
[3] R. Howe,et al. 17th International Conference on Medical Image Computing and Computer-Assisted Intervention. , 2014, Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention.
[4] Michael Johanson,et al. Measuring the Size of Large No-Limit Poker Games , 2013, ArXiv.
[5] Geoffrey E. Hinton,et al. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition-' Washington , D . C . , June , 1983 OPTIMAL PERCEPTUAL INFERENCE , 2011 .
[6] Léon J. M. Rothkrantz,et al. Invincible - A Stratego Bot , 2008, Int. J. Intell. Games Simul..
[7] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[8] J M Smith,et al. Evolution and the theory of games , 1976 .
[9] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.