Performance Evaluation Gaps in a Real-Time Strategy Game Between Human and Artificial Intelligence Players

Since 2010, annual StarCraft artificial intelligence (AI) competitions have promoted the development of successful AI players for complex real-time strategy games. In these competitions, AI players are ranked based on their win ratio over thousands of head-to-head matches. Although simple and easily implemented, this evaluation scheme may less adequately help develop more human-competitive AI players. In this paper, we recruited 45 human StarCraft players at different expertise levels (expert/medium/novice) and asked them to play against the 18 top AI players selected from the five years of competitions (2011–2015). The results show that the human evaluations of AI players differ substantially from the current standard evaluation and ranking method. In fact, from a human standpoint, there has been little progress in the quality of StarCraft AI players over the years. It is even possible that AI-only tournaments can lead to AIs being created that are unacceptable competitors for humans. This paper is the first to systematically explore the human evaluation of AI players, the evolution of AI players, and the differences between human perception and tournament-based evaluations. The discoveries from this paper can support AI developers in game companies and AI tournament organizers to better incorporate the perspective of human users into their AI systems.

[1]  Julian Togelius,et al.  General Video Game AI: Competition, Challenges and Opportunities , 2016, AAAI.

[2]  Julian Togelius,et al.  How to Run a Successful Game-Based AI Competition , 2016, IEEE Transactions on Computational Intelligence and AI in Games.

[3]  Gifford Cheung,et al.  Starcraft from the stands: understanding the game spectator , 2011, CHI.

[4]  Julian Togelius,et al.  AI-based Game Design Patterns , 2015, FDG.

[5]  Julian Togelius,et al.  The 2009 Simulated Car Racing Championship , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[6]  Philip Hingston,et al.  A Turing Test for Computer Game Bots , 2009, IEEE Transactions on Computational Intelligence and AI in Games.

[7]  Glen Robertson,et al.  A Review of Real-Time Strategy Game AI , 2014, AI Mag..

[8]  Ian Millington,et al.  Artificial Intelligence for Games , 2006, The Morgan Kaufmann series in interactive 3D technology.

[9]  C. Martin 2015 , 2015, Les 25 ans de l’OMC: Une rétrospective en photos.

[10]  Santiago Ontañón,et al.  A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft , 2013, IEEE Transactions on Computational Intelligence and AI in Games.

[11]  Julian Togelius,et al.  The 2010 Mario AI Championship: Level Generation Track , 2011, IEEE Transactions on Computational Intelligence and AI in Games.

[12]  Mark O. Riedl,et al.  AI for game production , 2013, 2013 IEEE Conference on Computational Inteligence in Games (CIG).