论文信息 - Playing Multiaction Adversarial Games: Online Evolutionary Planning Versus Tree Search

Playing Multiaction Adversarial Games: Online Evolutionary Planning Versus Tree Search

We address the problem of playing turn-based multiaction adversarial games, which include many strategy games with extremely high branching factors as players take multiple actions each turn. This leads to the breakdown of standard tree search methods, including Monte Carlo tree search (MCTS), as they become unable to reach a sufficient depth in the game tree. In this paper, we introduce online evolutionary planning (OEP) to address this challenge, which searches for combinations of actions to perform during a single turn guided by a fitness function that evaluates the quality of a particular state. We compare OEP to different MCTS variations that constrain the exploration to deal with the high branching factor in the turn-based multiaction game Hero Academy. While the constrained MCTS variations outperform the vanilla MCTS implementation by a large margin, OEP is able to search the space of plans more efficiently than any of the tested tree search methods as it has a relative advantage when the number of actions per turn increases.

[1] Peter I. Cowling,et al. Monte Carlo Tree Search with macro-actions and heuristic route planning for the Physical Travelling Salesman Problem , 2012, 2012 IEEE Conference on Computational Intelligence and Games (CIG).

[2] Bruno Bouzy,et al. Computer Go: An AI oriented survey , 2001, Artif. Intell..

[3] Abdallah Saffidine,et al. UCD: Upper Confidence Bound for Rooted Directed Acyclic Graphs , 2010 .

[4] Tristan Cazenave,et al. A Phantom-Go Program , 2006, ACG.

[5] Peter I. Cowling,et al. Ensemble Determinization in Monte Carlo Tree Search for the Imperfect Information Card Game Magic: The Gathering , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[6] Tomáš Kozelek,et al. Methods of MCTS and the game Arimaa , 2009 .

[7] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[8] Richard K. Belew,et al. New Methods for Competitive Coevolution , 1997, Evolutionary Computation.

[9] David P. Helmbold,et al. All-Moves-As-First Heuristics in Monte-Carlo Go , 2009, IC-AI.

[10] Simon M. Lucas,et al. Rolling horizon evolution versus tree search for navigation in single-player real-time games , 2013, GECCO '13.

[11] S.M. Lucas,et al. Evolutionary computation and games , 2006, IEEE Computational Intelligence Magazine.

[12] Michael Buro,et al. Hierarchical Portfolio Search: Prismata's Robust AI Architecture for Games with Large Search Spaces , 2015, AIIDE.

[13] Risto Miikkulainen,et al. Real-time neuroevolution in the NERO video game , 2005, IEEE Transactions on Evolutionary Computation.

[14] Guillaume Maurice Jean-Bernard Chaslot Chaslot,et al. Monte-Carlo Tree Search , 2010 .

[15] Julian Togelius,et al. Portfolio Online Evolution in StarCraft , 2016, AIIDE.

[16] Risto Miikkulainen,et al. General Video Game Playing , 2013, Artificial and Computational Intelligence in Games.

[17] Sebastian Risi,et al. Continual online evolutionary planning for in-game build order adaptation in StarCraft , 2017, GECCO.

[18] Santiago Ontañón,et al. The Combinatorial Multi-Armed Bandit Problem and Its Application to Real-Time Strategy Games , 2013, AIIDE.

[19] Pieter Spronck,et al. Monte-Carlo Tree Search: A New Framework for Game AI , 2008, AIIDE.

[20] Claude E. Shannon,et al. XXII. Programming a Computer for Playing Chess 1 , 1950 .

[21] Sylvain Gelly,et al. Exploration exploitation in Go: UCT for Monte-Carlo Go , 2006, NIPS 2006.

[22] Simon M. Lucas,et al. Rolling Horizon Coevolutionary planning for two-player video games , 2016, 2016 8th Computer Science and Electronic Engineering (CEEC).

[23] Simon M. Lucas,et al. A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[24] Samir W. Mahfoud. Niching methods for genetic algorithms , 1996 .

[25] Daniele Loiacono,et al. Evolving competitive car controllers for racing games with neuroevolution , 2009, GECCO '09.

[26] Regina Barzilay,et al. Non-Linear Monte-Carlo Search in Civilization II , 2011, IJCAI.

[27] John R. Koza,et al. Genetically breeding populations of computer programs to solve problems in artificial intelligence , 1990, [1990] Proceedings of the 2nd International IEEE Conference on Tools for Artificial Intelligence.

[28] J. Neumann. Zur Theorie der Gesellschaftsspiele , 1928 .

[29] Olivier Teytaud,et al. Grid Coevolution for Adaptive Simulations: Application to the Building of Opening Books in the Game of Go , 2009, EvoWorkshops.

[30] Michael Buro,et al. Portfolio greedy search and simulation for large-scale combat in starcraft , 2013, 2013 IEEE Conference on Computational Inteligence in Games (CIG).

[31] Rémi Coulom,et al. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.

[32] Bruno Bouzy,et al. Move-Pruning Techniques for Monte-Carlo Go , 2006, ACG.

[33] Julian Togelius,et al. Online Evolution for Multi-action Adversarial Games , 2016, EvoApplications.

[34] Julian Togelius,et al. Script- and cluster-based UCT for StarCraft , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[35] Pieter Spronck,et al. Monte-Carlo Tree Search in Settlers of Catan , 2009, ACG.

[36] H. Jaap van den Herik,et al. Progressive Strategies for Monte-Carlo Tree Search , 2008 .

[37] Simon M. Lucas,et al. Monte-Carlo Tree Search for the Physical Travelling Salesman Problem , 2012, EvoApplications.

[38] Simon M. Lucas,et al. Analysis of Vanilla Rolling Horizon Evolution Parameters in General Video Game Playing , 2017, EvoApplications.

[39] Mark H. M. Winands,et al. Monte Carlo Tree Search for the Hide-and-Seek Game Scotland Yard , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[40] Julian Togelius,et al. Neuroevolution in Games: State of the Art and Open Challenges , 2014, IEEE Transactions on Computational Intelligence and AI in Games.

[41] Oren Somekh,et al. Almost Optimal Exploration in Multi-Armed Bandits , 2013, ICML.

[42] Julian Togelius,et al. Super mario evolution , 2009, 2009 IEEE Symposium on Computational Intelligence and Games.