论文信息 - High-level reinforcement learning in strategy games

High-level reinforcement learning in strategy games

Video games provide a rich testbed for artificial intelligence methods. In particular, creating automated opponents that perform well in strategy games is a difficult task. For instance, human players rapidly discover and exploit the weaknesses of hard coded strategies. To build better strategies, we suggest a reinforcement learning approach for learning a policy that switches between high-level strategies. These strategies are chosen based on different game situations and a fixed opponent strategy. Our learning agents are able to rapidly adapt to fixed opponents and improve deficiencies in the hard coded strategies, as the results demonstrate.

Guy Shani | Christopher Amato | Guy Shani | Chris Amato

[1] D. Stahl,et al. On Players' Models of Other Players: Theory and Experimental Evidence , 1995 .

[2] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[3] John E. Laird,et al. Human-Level AI's Killer Application: Interactive Computer Games , 2000, AI Mag..

[4] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[5] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[7] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[8] Michael Buro,et al. Adversarial Planning Through Strategy Simulation , 2007, 2007 IEEE Symposium on Computational Intelligence and Games.

[9] Peter Stone,et al. Generalized model learning for reinforcement learning in factored domains , 2009, AAMAS.

[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11] Robert H. Crites,et al. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[12] Alan Fern,et al. Online Planning for Resource Production in Real-Time Strategy Games , 2007, ICAPS.

[13] Ian D. Watson,et al. Using reinforcement learning for city site selection in the turn-based strategy game Civilization IV , 2008, 2008 IEEE Symposium On Computational Intelligence and Games.

[14] Thore Graepel,et al. LEARNING TO FIGHT , 2004 .

[15] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[16] Jeffrey O. Kephart,et al. Pricing in Agent Economies Using Multi-Agent Q-Learning , 2002, Autonomous Agents and Multi-Agent Systems.

[17] Alan Fern,et al. UCT for Tactical Assault Planning in Real-Time Strategy Games , 2009, IJCAI.