论文信息 - Evolving Agents for the Hanabi 2018 CIG Competition

Evolving Agents for the Hanabi 2018 CIG Competition

Hanabi is a cooperative card game with hidden information that has won important awards in the industry and received some recent academic attention. A two-track competition of agents for the game will take place in the 2018 CIG conference. In this paper, we develop a genetic algorithm that builds rule-based agents by determining the best sequence of rules from a fixed rule set to use as strategy. In three separate experiments, we remove human assumptions regarding the ordering of rules, add new, more expressive rules to the rule set and independently evolve agents specialized at specific game sizes. As result, we achieve scores superior to previously published research for the mirror and mixed evaluation of agents.

[1] Hirotaka Osawa,et al. Solving Hanabi: Estimating Hands by Opponent's Actions in Cooperative Game with Incomplete Information , 2015, AAAI Workshop: Computer Poker and Imperfect Information.

[2] Simon M. Lucas,et al. A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[3] Murray Campbell,et al. Deep Blue , 2002, Artif. Intell..

[4] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[5] Simon M. Lucas,et al. Evaluating and modelling Hanabi-playing agents , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[6] Walter A. Kosters,et al. Aspects of the Cooperative Card Game Hanabi , 2016, BNCAI.

[7] Michael C. Ferris,et al. Genetic Algorithms for Combinatorial Optimization: The Assemble Line Balancing Problem , 1994, INFORMS J. Comput..

[8] Sarit Kraus,et al. Empirical evaluation of ad hoc teamwork in the pursuit domain , 2011, AAMAS.

[9] Marcel Roeloffzen,et al. Hanabi is NP-complete, Even for Cheaters who Look at Their Cards , 2016, FUN.

[10] Julian Togelius,et al. Product Geometric Crossover for the Sudoku Puzzle , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[11] David E. Goldberg,et al. Genetic Algorithms, Tournament Selection, and the Effects of Noise , 1995, Complex Syst..

[12] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[13] C. Cox,et al. How to Make the Perfect Fireworks Display: Two Strategies for Hanabi , 2015 .

[14] Chris Martens,et al. An intentional AI for hanabi , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[15] Bruno Bouzy,et al. Playing Hanabi Near-Optimally , 2017, ACG.

[16] Julian Togelius,et al. Evolving personas for player decision modeling , 2014, 2014 IEEE Conference on Computational Intelligence and Games.