论文信息 - On the Design and Training of Bots to Play Backgammon Variants

On the Design and Training of Bots to Play Backgammon Variants

Recently, a backgammon bot named Palamedes won the first prize in backgammon at the 16th Computer Olympiad. Palamedes is an ongoing work aimed at developing intelligent bots to play a variety of popular backgammon variants. Currently, the Greek variants Portes, Plakoto and Fevga are supported. A different neural network has been designed, trained and evaluated for each one of these variants. This paper presents the details of the architecture and the training procedure for each case. New expert features as inputs to the networks are also introduced, whereas experimental results demonstrate improvement over previous versions of Palamedes.

Ioannis Refanidis | Nikolaos Papahristou | I. Refanidis | Nikolaos Papahristou

[1] Joachim Diederich,et al. Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[2] G. Tesauro. Practical Issues in Temporal Difference Learning , 1992 .

[3] Jonathan Schaeffer,et al. *-Minimax Performance in Backgammon , 2004, Computers and Games.

[4] Zongmin Ma,et al. Computers and Games , 2008, Lecture Notes in Computer Science.

[5] Ioannis Refanidis,et al. Improving Temporal Difference Learning Performance in Backgammon Variants , 2011, ACG.

[6] Marco Wiering. Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning , 2010, J. Intell. Learn. Syst. Appl..

[7] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[8] D. Michie. GAME-PLAYING AND GAME-LEARNING AUTOMATA , 1966 .

[9] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..

[10] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.

[11] Tony R. Martinez,et al. The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.

[12] Ioannis Refanidis,et al. Training Neural Networks to Play Backgammon Variants Using Reinforcement Learning , 2011, EvoApplications.

[13] Gerald Tesauro,et al. Programming backgammon using self-teaching neural nets , 2002, Artif. Intell..