论文信息 - Solving the Dice Game Pig : an introduction to dynamic programming and value iteration

Solving the Dice Game Pig : an introduction to dynamic programming and value iteration

For such a simple dice game, one might expect a simple optimal strategy, such as in Blackjack (e.g., “stand on 17” under certain circumstances, etc.). As we shall see, this simple dice game yields a much more complex and intriguing optimal policy. In our exploration of Pig we will learn about dynamic programming and value iteration, covering fundamental concepts of reinforcement learning techniques. For the interested reader, there is a companion Game of Pig website that features an optimal Pig computer player, VRML visualizations of the optimal policy, and information about Pig and its variants.

[1] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.

[2] Clifton G. M. Presser,et al. Optimal Play of the Dice Game Pig , 2004 .

[3] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[4] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[5] Reiner Knizia. Dice Games Properly Explained , 2010 .

[6] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.

[7] Ruma Falk,et al. THINK: A Game of Choice and Chance. , 1999 .

[8] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[9] Jason A. Osborne. Markov Chains for the RISK Board Game Revisited , 2003 .

[10] Dan Brutlag. Choice and Chance in Life: The Game of "Skunk.". , 1994 .

[11] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .