论文信息 - Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents - 字舞流文

Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents

Richard G. Gibson

[1] Martin Zinkevich,et al. The Annual Computer Poker Competition , 2013, AI Mag..

[2] Nathan R. Sturtevant,et al. A parameterized family of equilibrium profiles for three-player kuhn poker , 2013, AAMAS.

[3] Michael H. Bowling,et al. Evaluating state-space abstractions in extensive-form games , 2013, AAMAS.

[4] Michael Johanson,et al. Measuring the Size of Large No-Limit Poker Games , 2013, ArXiv.

[5] Michael H. Bowling,et al. Monte carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games , 2013 .

[6] Duane Szafron,et al. Efficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions , 2012, NIPS.

[7] Duane Szafron,et al. Generalized Sampling and Variance in Counterfactual Regret Minimization , 2012, AAAI.

[8] Michael H. Bowling,et al. Finding Optimal Abstract Strategies in Extensive-Form Games , 2012, AAAI.

[9] Michael H. Bowling,et al. Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization , 2012, AAMAS.

[10] Michael H. Bowling,et al. No-Regret Learning in Extensive-Form Games with Imperfect Recall , 2012, ICML.

[11] Duane Szafron,et al. On Strategy Stitching in Large Extensive Form Multiplayer Games , 2011, NIPS.

[12] Kevin Waugh,et al. Accelerating Best Response Calculation in Large Extensive Games , 2011, IJCAI.

[13] Duane Szafron,et al. Regret Minimization in Multiplayer Extensive Games , 2011, IJCAI.

[14] Tuomas Sandholm,et al. Computing equilibria by incorporating qualitative models? , 2010, AAMAS.

[15] Duane Szafron,et al. Using counterfactual regret minimization to create competitive multiplayer poker agents , 2010, AAMAS.

[16] Javier Peña,et al. Smoothing Techniques for Computing Nash Equilibria of Sequential Games , 2010, Math. Oper. Res..

[17] Ludovic Renou,et al. Minimax Regret and Strategic Uncertainty , 2008, J. Econ. Theory.

[18] Kevin Waugh,et al. Strategy Grafting in Extensive Games , 2009, NIPS.

[19] Kevin Waugh,et al. Monte Carlo Sampling for Regret Minimization in Extensive Games , 2009, NIPS.

[20] Kevin Waugh,et al. Abstraction in Large Extensive Games , 2009 .

[21] Kevin Waugh,et al. A Practical Use of Imperfect Recall , 2009, SARA.

[22] Michael H. Bowling,et al. Probabilistic State Translation in Extensive Games with Large Action Sets , 2009, IJCAI.

[23] Joseph Y. Halpern,et al. Iterated Regret Minimization: A New Solution Concept , 2009, IJCAI.

[24] Tuomas Sandholm,et al. Computing Equilibria in Multiplayer Stochastic Games of Imperfect Information , 2009, IJCAI.

[25] Miroslav Dudík,et al. A Sampling-Based Approach to Computing Equilibria in Succinct Extensive-Form Games , 2009, UAI.

[26] Kevin Waugh,et al. Abstraction pathologies in extensive games , 2009, AAMAS.

[27] Bernhard von Stengel,et al. Extensive-Form Correlated Equilibrium: Definition and Computational Complexity , 2008, Math. Oper. Res..

[28] David Silver,et al. Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Achieving Master Level Play in 9 × 9 Computer Go , 2022 .

[29] Duane Szafron,et al. Strategy evaluation in extensive games with importance sampling , 2008, ICML '08.

[30] Tuomas Sandholm,et al. Computing an approximate jam/fold equilibrium for 3-player no-limit Texas Hold'em tournaments , 2008, AAMAS.

[31] Troels Bjerre Lund,et al. A heads-up no-limit Texas Hold'em poker player: discretized betting models and automatically generated equilibrium-finding programs , 2008, AAMAS.

[32] Bret Hoehn,et al. Effective short-term opponent exploitation in simplified poker , 2005, Machine Learning.

[33] Michael H. Bowling,et al. Regret Minimization in Games with Incomplete Information , 2007, NIPS.

[34] Troels Bjerre Lund,et al. Potential-Aware Automated Abstraction of Sequential Games, and Holistic Equilibrium Analysis of Texas Hold'em Poker , 2007, AAAI.

[35] Tuomas Sandholm,et al. Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker , 2007, AAMAS '07.

[36] Y. Mansour,et al. Algorithmic Game Theory: Learning, Regret Minimization, and Equilibria , 2007 .

[37] Geoffrey J. Gordon. No-regret Algorithms for Online Convex Programs , 2006, NIPS.

[38] Xiaotie Deng,et al. Settling the Complexity of Two-Player Nash Equilibrium , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[39] Paul W. Goldberg,et al. The complexity of computing a Nash equilibrium , 2006, STOC '06.

[40] Zheng Li,et al. Bounds for Regret-Matching Algorithms , 2006, AI&M.

[41] Michael H. Bowling,et al. Bayes' Bluff: Opponent Modelling in Poker , 2005, UAI 2005.

[42] Tuomas Sandholm,et al. Optimal Rhode Island Hold'em Poker , 2005, AAAI.

[43] Vincent Conitzer,et al. Complexity of (iterated) dominance , 2005, EC '05.

[44] Xi Chen,et al. 3-NASH is PPAD-Complete , 2005, Electron. Colloquium Comput. Complex..

[45] Christos H. Papadimitriou,et al. Three-Player Games Are Hard , 2005, Electron. Colloquium Comput. Complex..

[46] Shlomo Zilberstein,et al. Dynamic Programming for Partially Observable Stochastic Games , 2004, AAAI.

[47] Jonathan Schaeffer,et al. The challenge of poker , 2002, Artif. Intell..

[48] Jonathan Schaeffer,et al. Opponent Modeling in Poker , 1998, AAAI/IAAI.

[49] Jonathan Schaeffer,et al. Poker as Testbed for AI Research , 1998, Canadian Conference on AI.

[50] S. Hart,et al. A simple adaptive procedure leading to correlated equilibrium , 2000 .

[51] Michael Buro,et al. The Othello Match of the Year: Takeshi Murakami vs. Logistello , 1997, J. Int. Comput. Games Assoc..

[52] Ariel Rubinstein,et al. On the Interpretation of Decision Problems with Imperfect Recall , 1996, TARK.

[53] M. Kaneko,et al. Behavior strategies, mixed strategies and perfect recall , 1995 .

[54] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[55] Bernhard von Stengel,et al. Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.

[56] Eitan Zemel,et al. The Complexity of Eliminating Dominated Strategies , 1993, Math. Oper. Res..

[57] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[58] D. Knuth,et al. A note on strategy elimination in bimatrix games , 1988 .

[59] Ken Thompson,et al. Retrograde Analysis of Certain Endgames , 1986, J. Int. Comput. Games Assoc..