论文信息 - An Investigation of an Adaptive Poker Player

An Investigation of an Adaptive Poker Player

Other work has shown that adaptive learning can be highly successful in developing programs which are able to play games at a level similar to human players and, in some cases, exceed the ability of a vast majority of human players. This study uses poker to investigate how adaptation can be used in games of imperfect information. An internal learning value is manipulated which allows a poker playing agent to develop its playing strategy over time. The results suggest that the agent is able to learn how to play poker, initially losing, before winning as the players strategy becomes more developed. The evolved player performs well against opponents with different playing styles. Some limitations of previous work are overcome, such as deal rotation to remove the bias introduced by one player always being the last to act. This work provides encouragement that this is an area worth exploring more fully in our future work.

Graham Kendall | Mark Willdig | G. Kendall | Mark Willdig

[1] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[2] Nicholas V. Findler,et al. Studies in machine cognition using the game of poker , 1977, CACM.

[3] Luigi Barone,et al. Evolving Computer Opponents to Play a Game of Simplified Poker , 1997 .

[4] Jonathan Schaeffer,et al. One jump ahead - challenging human supremacy in checkers , 1997, J. Int. Comput. Games Assoc..

[5] Avi Pfeffer,et al. Representations and Solutions for Game-Theoretic Problems , 1997, Artif. Intell..

[6] Jonathan Schaeffer. One Jump Ahead , 1997 .

[7] Scott Hamilton,et al. Deep Blue's Hardware-Software Synergy , 1997, Computer.

[8] Jonathan Schaeffer,et al. Poker as Testbed for AI Research , 1998, Canadian Conference on AI.

[9] Jonathan Schaeffer,et al. Opponent Modeling in Poker , 1998, AAAI/IAAI.

[10] Luigi Barone,et al. An adaptive learning model for simplified poker using evolutionary algorithms , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[11] Jonathan Schaeffer,et al. Learning to Play Strong Poker , 1999, ICML 1999.

[12] Jonathan Schaeffer,et al. Using Probabilistic Knowledge and Simulation to Play Poker , 1999, AAAI/IAAI.

[13] R. Lyndon While,et al. Adaptive Learning for Poker , 2000, GECCO.

[14] David B. Fogel,et al. Anaconda defeats Hoyle 6-0: a case study competing an evolved checkers program against commercially available software , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[15] Arthur L. Samuel,et al. Some studies in machine learning using the game of checkers , 2000, IBM J. Res. Dev..

[16] Graham Kendall,et al. An evolutionary approach for the tuning of a chess evaluation function using population dynamics , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).