论文信息 - Using Double-Oracle Method and Serialized Alpha-Beta Search for Pruning in Simultaneous Move Games

Using Double-Oracle Method and Serialized Alpha-Beta Search for Pruning in Simultaneous Move Games

We focus on solving two-player zero-sum extensive-form games with perfect information and simultaneous moves. In these games, both players fully observe the current state of the game where they simultaneously make a move determining the next state of the game. We solve these games by a novel algorithm that relies on two components: (1) it iteratively solves the games that correspond to a single simultaneous move using a double-oracle method, and (2) it prunes the states of the game using bounds on the sub-game values obtained by the classical Alpha-Beta search on a serialized variant of the game. We experimentally evaluate our algorithm on the Goofspiel card game, a pursuit-evasion game, and randomly generated games. The results show that our novel algorithm typically provides significant running-time improvements and reduction in the number of evaluated nodes compared to the full search algorithm.

[1] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.

[2] U. Dulleck,et al. μ-σ Games , 2012, Games.

[3] Dana S. Nau,et al. An Analysis of Forward Pruning , 1994, AAAI.

[4] Oriol Carbonell-Nicolau. Games and Economic Behavior , 2011 .

[5] Michael Buro,et al. Solving the Oshi-Zumo Game , 2003, ACG.

[6] D. Koller,et al. Efficient Computation of Equilibria for Extensive Two-Person Games , 1996 .

[7] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[8] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[9] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[10] Olivier Teytaud,et al. Upper Confidence Trees with Short Term Partial Information , 2011, EvoApplications.

[11] B. Stengel,et al. Efficient Computation of Behavior Strategies , 1996 .

[12] Michael Buro,et al. Heuristic Search Applied to Abstract Combat Games , 2005, Canadian Conference on AI.

[13] Laurent Bartholdi,et al. Computer Solution to the Game of Pure Strategy , 2012, Games.

[14] Branislav Bosanský,et al. Extending Security Games to Defenders with Constrained Mobility , 2012, AAAI Spring Symposium: Game Theory for Security, Sustainability, and Health.

[15] Yoav Shoham,et al. Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[16] Vincent Conitzer,et al. A double oracle algorithm for zero-sum security games on graphs , 2011, AAMAS.

[17] Milind Tambe,et al. Security and Game Theory - Algorithms, Deployed Systems, Lessons Learned , 2011 .

[18] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[19] Duane Szafron,et al. Efficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions , 2012, NIPS.

[20] Martin Müller,et al. Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[21] Michael Buro,et al. Alpha-Beta Pruning for Games with Simultaneous Moves , 2012, AAAI.

[22] Michael R. Genesereth,et al. General Game Playing: Overview of the AAAI Competition , 2005, AI Mag..

[23] Milind Tambe. Security and Game Theory: EFFICIENT ALGORITHMS FOR MASSIVE SECURITY GAMES , 2011 .

[24] Ruck Thawonmas,et al. Monte Carlo Tree Search for Collaboration Control of Ghosts in Ms. Pac-Man , 2013, IEEE Transactions on Computational Intelligence and AI in Games.