Reinforcement Learning of Local Shape in the Game of Go
暂无分享,去创建一个
Richard S. Sutton | David Silver | Martin Müller | R. Sutton | D. Silver | Martin Müller | David Silver
[1] Albert L. Zobrist,et al. A New Hashing Method with Application for Game Playing , 1990 .
[2] Jonathan Schaeffer,et al. A World Championship Caliber Checkers Program , 1992, Artif. Intell..
[3] Terrence J. Sejnowski,et al. Temporal Difference Learning of Position Evaluation in the Game of Go , 1993, NIPS.
[4] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[5] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[6] Ken Chen,et al. Machine Learning, Game Play, and Go , 1998 .
[7] Michael Buro,et al. From Simple Features to Sophisticated Evaluation Functions , 1998, Computers and Games.
[8] Andrew Tridgell,et al. Experiments in Parameter Learning Using Temporal Differences , 1998, J. Int. Comput. Games Assoc..
[9] Murray Campbell,et al. Deep Blue , 2002, Artif. Intell..
[10] Martin Müller,et al. Computer Go , 2002, Artif. Intell..
[11] Brian Sheppard,et al. World-championship-caliber Scrabble , 2002, Artif. Intell..
[12] Eric O. Postma,et al. Local Move Prediction in Go , 2002, Computers and Games.
[13] Markus Enzenberger,et al. Evaluation in Go by a Neural Network using Soft Segmentation , 2003, ACG.
[14] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[15] Nathan R. Sturtevant,et al. Feature Construction for Reinforcement Learning in Hearts , 2006, Computers and Games.