Using Reinforcement Learning in Chess Engines
暂无分享,去创建一个
Raúl Rojas | Erik Cuevas | Daniel Zaldivar | Ketill Gunnarsson | Ernesto Tapia | R. Rojas | Erik Cuevas | D. Zaldívar | E. Tapia | Marco Block | M. Bader | Marco Block | Maro Bader | Marte Ramírez | Marte Raḿırez | K. Gunnarsson
[1] Gerald Tesauro,et al. Comparison training of chess evaluation functions , 2001 .
[2] Andrew Tridgell,et al. KnightCap: A chess program that learns by combining TD( ) with minimax search , 1997, ICML 1997.
[3] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[4] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[5] Andrew Tridgell,et al. Experiments in Parameter Learning Using Temporal Differences , 1998, J. Int. Comput. Games Assoc..
[6] Michael Gherrity,et al. A game-learning machine , 1993 .
[7] Byoung-Tak Zhang. Lernen durch Genetisch-Neuronale Evolution: Aktive Anpassung an unbekannte Umgebungen mit selbstentwickelten parallelen Netzwerken , 1992, DISKI.
[8] Sebastian Thrun,et al. Learning to Play the Game of Chess , 1994, NIPS.
[9] Aske Plaat,et al. RESEARCH RE: SEARCH & RE-SEARCH , 1996 .
[10] Nikhil Deshpande,et al. Temporal Difference Learning in Chinese Chess , 1998, IEA/AIE.
[11] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[12] George F. Luger,et al. Künstliche Intelligenz - Strategien zur Lösung komplexer Probleme (4. Aufl.) , 2001 .
[13] Raúl Rojas,et al. Theorie der neuronalen Netze , 1993 .