On the Design and Training of Bots to Play Backgammon Variants
暂无分享,去创建一个
[1] Joachim Diederich,et al. Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..
[2] G. Tesauro. Practical Issues in Temporal Difference Learning , 1992 .
[3] Jonathan Schaeffer,et al. *-Minimax Performance in Backgammon , 2004, Computers and Games.
[4] Zongmin Ma,et al. Computers and Games , 2008, Lecture Notes in Computer Science.
[5] Ioannis Refanidis,et al. Improving Temporal Difference Learning Performance in Backgammon Variants , 2011, ACG.
[6] Marco Wiering. Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning , 2010, J. Intell. Learn. Syst. Appl..
[7] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[8] D. Michie. GAME-PLAYING AND GAME-LEARNING AUTOMATA , 1966 .
[9] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..
[10] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[11] Tony R. Martinez,et al. The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.
[12] Ioannis Refanidis,et al. Training Neural Networks to Play Backgammon Variants Using Reinforcement Learning , 2011, EvoApplications.
[13] Gerald Tesauro,et al. Programming backgammon using self-teaching neural nets , 2002, Artif. Intell..