Temporal difference learning applied to game playing and the results of application to Shogi
暂无分享,去创建一个
[1] Andrew Tridgell,et al. KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search , 1998, ICML.
[2] Donald F. Beal,et al. Learning Piece Values Using Temporal Differences , 1997, J. Int. Comput. Games Assoc..
[3] Hiroyuki Iida,et al. Natural Developments in Game Research , 1996, J. Int. Comput. Games Assoc..
[4] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[5] Christian Donninger,et al. Null Move and Deep Search , 1993, J. Int. Comput. Games Assoc..
[6] Tony Marsland,et al. COMPUTER CHESS AND SEARCH , 1992 .
[7] Robert Levinson,et al. Adaptive Pattern-Oriented Chess , 1991, AAAI Conference on Artificial Intelligence.
[8] Richard E. Korf,et al. A Unified Theory of Heuristic Evaluation Functions and its Application to Learning , 1986, AAAI.
[9] J. Fairbairn. Shogi for beginners , 1984 .
[10] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..