论文信息 - Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games - 字舞流文

Potential-Aware Imperfect-Recall Abstraction with Earth Mover's Distance in Imperfect-Information Games

There is often a large disparity between the size of a game we wish to solve and the size of the largest instances solvable by the best algorithms; for example, a popular variant of poker has about 10165 nodes in its game tree, while the currently best approximate equilibrium-finding algorithms scale to games with around 1012 nodes. In order to approximate equilibrium strategies in these games, the leading approach is to create a sufficiently small strategic approximation of the full game, called an abstraction, and to solve that smaller game instead. The leading abstraction algorithm for imperfect-information games generates abstractions that have imperfect recall and are distribution aware, using k-means with the earth mover's distance metric to cluster similar states together. A distribution-aware abstraction groups states together at a given round if their full distributions over future strength are similar (as opposed to, for example, just the expectation of their strength). The leading algorithm considers distributions over future strength at the final round of the game. However, one might benefit by considering the trajectory of distributions over strength in all future rounds, not just the final round. An abstraction algorithm that takes all future rounds into account is called potential aware. We present the first algorithm for computing potential-aware imperfect-recall abstractions using earth mover's distance. Experiments on no-limit Texas Hold'em show that our algorithm improves performance over the previously best approach.

Tuomas Sandholm | Sam Ganzfried | T. Sandholm | Sam Ganzfried

[1] Tuomas Sandholm,et al. Regret Transfer and Parameter Optimization , 2014, AAAI.

[2] Tuomas Sandholm,et al. A Competitive Texas Hold'em Poker Player via Automated Abstraction and Real-Time Equilibrium Computation , 2006, AAAI.

[3] Tuomas Sandholm,et al. Extensive-form game abstraction with bounds , 2014, EC.

[4] Kevin Waugh,et al. A Practical Use of Imperfect Recall , 2009, SARA.

[5] Michael H. Bowling,et al. Evaluating state-space abstractions in extensive-form games , 2013, AAMAS.

[6] Michael Werman,et al. A Linear Time Histogram Metric for Improved SIFT Matching , 2008, ECCV.

[7] Michael H. Bowling,et al. No-Regret Learning in Extensive-Form Games with Imperfect Recall , 2012, ICML.

[8] Michael H. Bowling,et al. Regret Minimization in Games with Incomplete Information , 2007, NIPS.

[9] Troels Bjerre Lund,et al. Potential-Aware Automated Abstraction of Sequential Games, and Holistic Equilibrium Analysis of Texas Hold'em Poker , 2007, AAAI.

[10] Charles Elkan,et al. Using the Triangle Inequality to Accelerate k-Means , 2003, ICML.

[11] Javier Peña,et al. Smoothing Techniques for Computing Nash Equilibria of Sequential Games , 2010, Math. Oper. Res..

[12] Duane Szafron,et al. Using Sliding Windows to Generate Action Abstractions in Extensive-Form Games , 2012, AAAI.

[13] Michael L. Littman,et al. Abstraction Methods for Game Theoretic Poker , 2000, Computers and Games.

[14] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.

[15] Jonathan Schaeffer,et al. Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[16] Tuomas Sandholm,et al. Expectation-Based Versus Potential-Aware Automated Abstraction in Imperfect Information Games: An Experimental Comparison Using Poker , 2008, AAAI.

[17] Troels Bjerre Lund,et al. A heads-up no-limit Texas Hold'em poker player: discretized betting models and automatically generated equilibrium-finding programs , 2008, AAMAS.

[18] Michael Werman,et al. Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19] Ian Frank,et al. Revised Papers from the Second International Conference on Computers and Games , 2000 .

[20] Kevin Waugh,et al. Abstraction pathologies in extensive games , 2009, AAMAS.

[21] Kevin Waugh,et al. Monte Carlo Sampling for Regret Minimization in Extensive Games , 2009, NIPS.

[22] Tuomas Sandholm,et al. Lossless abstraction of imperfect information games , 2007, JACM.

[23] Michael Johanson,et al. Measuring the Size of Large No-Limit Poker Games , 2013, ArXiv.

[24] Tuomas Sandholm,et al. Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker , 2007, AAMAS '07.

[25] Duane Szafron,et al. Automated Action Abstraction of Imperfect Information Extensive-Form Games , 2011, AAAI.

[26] Tuomas Sandholm,et al. Lossy stochastic game abstraction with bounds , 2012, EC '12.