论文信息 - A Unification of Extensive-Form Games and Markov Decision Processes

A Unification of Extensive-Form Games and Markov Decision Processes

We describe a generalization of extensive-form games that greatly increases representational power while still allowing efficient computation in the zero-sum setting. A principal feature of our generalization is that it places arbitrary convex optimization problems at decision nodes, in place of the finite action sets typically considered. The possibly-infinite action sets mean we must "forget" the exact action taken (feasible solution to the optimization problem), remembering instead only some statistic sufficient for playing the rest of the game optimally. Our new model provides an exponentially smaller representation for some games; in particular, we show how to compactly represent (and solve) extensive-form games with outcome uncertainty and a generalization of Markov decision processes to multi-stage adversarial planning games.

Geoffrey J. Gordon | H. Brendan McMahan | H. B. McMahan

[1] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[2] Geoffrey J. Gordon,et al. Robust planning in domains with stochastic outcomes, adversaries, and partial observability , 2006 .

[3] S. Karlin,et al. SOLUTIONS OF CONVEX GAMES AS FIXED-POINTS, , 1951 .

[4] Marek Petrik,et al. Average-Reward Decentralized Markov Decision Processes , 2007, IJCAI.

[5] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.

[6] Bernhard von Stengel,et al. Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.

[7] Peter Bro Miltersen,et al. Computing sequential equilibria for two-player games , 2006, SODA '06.