DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

Artificial intelligence has seen a number of breakthroughs in recent years, with games often serving as significant milestones. A common feature of games with these successes is that they involve information symmetry among the players, where all players have identical information. This property of perfect information, though, is far more common in games than in real-world problems. Poker is the quintessential game of imperfect information, and it has been a longstanding challenge problem in artificial intelligence. In this paper we introduce DeepStack, a new algorithm for imperfect information settings such as poker. It combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition about arbitrary poker situations that is automatically learned from selfplay games using deep learning. In a study involving dozens of participants and 44,000 hands of poker, DeepStack becomes the first computer program to beat professional poker players in heads-up no-limit Texas hold’em. Furthermore, we show this approach dramatically reduces worst-case exploitability compared to the abstraction paradigm that has been favored for over a decade.

[1]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[2]  E. Parzen Annals of Mathematical Statistics , 1962 .

[3]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[4]  R. J. Joenk,et al.  IBM journal of research and development: information for authors , 1978 .

[5]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[6]  L. V. Allis,et al.  Searching for solutions in games and artificial intelligence , 1994 .

[7]  J. V. Rijswijck,et al.  THE SECOND INTERNATIONAL CONFERENCE ON COMPUTERS AND GAMES , 2001 .

[8]  News Item , 2004, Acta Neuropathologica.

[9]  PROCEssIng magazInE IEEE Signal Processing Magazine , 2004 .

[10]  Christos H. Papadimitriou,et al.  Proceedings of the 4th International Workshop on Internet and Network Economics , 2008 .

[11]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[12]  Daniel Gooch,et al.  Communications of the ACM , 2011, XRDS.

[13]  Michael Johanson,et al.  Measuring the Size of Large No-Limit Poker Games , 2013, ArXiv.

[14]  Joseph Y. Halpern,et al.  Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence , 2014, AAAI 2014.

[15]  Qiang Yang,et al.  Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2015, Buenos Aires, Argentina, 25-31 July 2015 , 2015, IJCAI 2015.

[16]  Charles Sutton,et al.  Proceedings for the 5th International Conference on Learning Representations , 2017 .

[17]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.