Self-Adaptive Rolling Horizon Evolutionary Algorithms for General Video Game Playing

For general video game playing agents, the biggest challenge is adapting to the wide variety of situations they encounter and responding appropriately. Some success was recently achieved by modifying search-control parameters in agents on-line, during one play-through of a game. We propose adapting such methods for Rolling Horizon Evolutionary Algorithms, which have shown high performance in many different environments, and test the effect of on-line adaptation on the agent’s win rate. On-line tuned agents are able to achieve results comparable to the state of the art, including first win rates in hard problems, while employing a more general and highly adaptive approach. We additionally include further insight into the algorithm itself, given by statistics gathered during the tuning process and highlight key parameter choices.

[1]  Daniel A. Ashlock,et al.  Evolutionary computation for modeling and optimization , 2005 .

[2]  Julian Togelius,et al.  General Video Game AI: Competition, Challenges and Opportunities , 2016, AAAI.

[3]  Risto Miikkulainen,et al.  General Video Game Playing , 2013, Artificial and Computational Intelligence in Games.

[4]  Simon M. Lucas,et al.  Rolling Horizon Coevolutionary planning for two-player video games , 2016, 2016 8th Computer Science and Electronic Engineering (CEEC).

[5]  Simon M. Lucas,et al.  Tackling Sparse Rewards in Real-Time Games with Statistical Forward Planning Methods , 2019, AAAI.

[6]  Marlos C. Machado,et al.  Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents , 2017, J. Artif. Intell. Res..

[7]  Simon M. Lucas,et al.  Introducing real world physics and macro-actions to general video game ai , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[8]  Peter I. Cowling,et al.  Evolutionary MCTS with Flexible Search Horizon , 2018, AIIDE.

[9]  Dennis J. N. J. Soemers,et al.  Enhancements for real-time Monte-Carlo Tree Search in General Video Game Playing , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[10]  Demis Hassabis,et al.  Mastering Atari, Go, chess and shogi by planning with a learned model , 2019, Nature.

[11]  Elliot Meyerson,et al.  Frame Skip Is a Powerful Parameter for Learning to Play Atari , 2015, AAAI Workshop: Learning for General Competency in Video Games.

[12]  Simon M. Lucas,et al.  Efficient Evolutionary Methods for Game Agent Optimisation: Model-Based is Best , 2019, ArXiv.

[13]  Julian Togelius,et al.  Matching Games and Algorithms for General Video Game Playing , 2021, AIIDE.

[14]  Bruno Santos,et al.  An Improved Rolling Horizon Evolution Algorithm with Shift Buffer for General Game Playing , 2018, 2018 17th Brazilian Symposium on Computer Games and Digital Entertainment (SBGames).

[15]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[16]  Mike Preuss,et al.  Planning chemical syntheses with deep neural networks and symbolic AI , 2017, Nature.

[17]  Julian Togelius,et al.  A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI Benchmarking , 2020, 2020 IEEE Congress on Evolutionary Computation (CEC).

[18]  Simon M. Lucas,et al.  Population seeding techniques for Rolling Horizon Evolution in General Video Game Playing , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[19]  Simon M. Lucas,et al.  Rolling horizon methods for games with continuous states and actions , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[20]  Santiago Ontañón,et al.  Combinatorial Multi-armed Bandits for Real-Time Strategy Games , 2017, J. Artif. Intell. Res..

[21]  Julian Togelius,et al.  Playing Multiaction Adversarial Games: Online Evolutionary Planning Versus Tree Search , 2018, IEEE Transactions on Games.

[22]  Julian Togelius,et al.  General video game rule generation , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[23]  Simon M. Lucas,et al.  Rinascimento: Optimising Statistical Forward Planning Agents for Playing Splendor , 2019, 2019 IEEE Conference on Games (CoG).

[24]  Simon M. Lucas,et al.  The N-Tuple bandit evolutionary algorithm for automatic game improvement , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[25]  Simon M. Lucas,et al.  Rolling horizon evolution versus tree search for navigation in single-player real-time games , 2013, GECCO '13.

[26]  Chiara F. Sironi,et al.  Analysis of Self-Adaptive Monte Carlo Tree Search in General Video Game Playing , 2018, 2018 IEEE Conference on Computational Intelligence and Games (CIG).

[27]  Simon M. Lucas,et al.  The N-Tuple Bandit Evolutionary Algorithm for Game Agent Optimisation , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[28]  Simon M. Lucas,et al.  Self-adaptive MCTS for General Video Game Playing , 2018, EvoApplications.

[29]  Mark J. Nelson,et al.  Investigating vanilla MCTS scaling on the GVG-AI game corpus , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[30]  Simon M. Lucas,et al.  The 2016 Two-Player GVGAI Competition , 2018, IEEE Transactions on Games.

[31]  Diego Pérez-Liébana,et al.  Modeling Player Experience with the N-Tuple Bandit Evolutionary Algorithm , 2018, AIIDE.

[32]  Xin Tong,et al.  Enhancing Rolling Horizon Evolution with Policy and Value Networks , 2019, 2019 IEEE Conference on Games (CoG).

[33]  Julian Togelius,et al.  General Video Game Artificial Intelligence , 2019, Synthesis Lectures on Games and Computational Intelligence.

[34]  Chiara F. Sironi,et al.  Self-Adaptive Monte Carlo Tree Search in General Game Playing , 2020, IEEE Transactions on Games.

[35]  Julian Togelius,et al.  General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms , 2018, IEEE Transactions on Games.

[36]  Julian Togelius,et al.  General Video Game Level Generation , 2016, GECCO.

[37]  Daniel A. Ashlock,et al.  General video game playing escapes the no free lunch theorem , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[38]  Julian Togelius,et al.  Hyper-heuristic general video game playing , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[39]  Maciej Swiechowski,et al.  Self-Adaptation of Playing Strategies in General Game Playing , 2014, IEEE Transactions on Computational Intelligence and AI in Games.

[40]  Simon M. Lucas,et al.  Rolling horizon evolution enhancements in general video game playing , 2017, 2017 IEEE Conference on Computational Intelligence and Games (CIG).

[41]  Simon M. Lucas,et al.  Analysis of Vanilla Rolling Horizon Evolution Parameters in General Video Game Playing , 2017, EvoApplications.

[42]  Simon M. Lucas,et al.  Project Thyia: A Forever Gameplayer , 2019, 2019 IEEE Conference on Games (CoG).