Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Julien Perolat*,1,‡, Bart de Vylder∗,1,‡, Daniel Hennes1, Eugene Tarassov1, Florian Strub1, Vincent de Boer†1, Paul Muller1, Jerome T. Connor1, Neil Burch1, Thomas Anthony1, Stephen McAleer1, Romuald Elie1, Sarah H. Cen1, Zhe Wang1, Audrunas Gruslys1, Aleksandra Malysheva1, Mina Khan1, Sherjil Ozair1, Finbarr Timbers1, Toby Pohlen1, Tom Eccles1, Mark Rowland1, Marc Lanctot1, Jean-Baptiste Lespiau1, Bilal Piot1, Shayegan Omidshafiei1, Edward Lockhart1, Laurent Sifre1, Nathalie Beauguerlange1, Remi Munos1, David Silver1, Satinder Singh1, Demis Hassabis1, and Karl Tuyls∗,1,‡