Temporal difference learning and TD-Gammon
暂无分享,去创建一个
[1] Claude E. Shannon,et al. Programming a computer for playing chess , 1950 .
[2] Norman Zadeh,et al. On Optimal Doubling in Backgammon , 1977 .
[3] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[4] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[5] Gerald Tesauro,et al. Neurogammon Wins Computer Olympiad , 1989, Neural Computation.
[6] Christian Lebiere,et al. The Cascade-Correlation Learning Architecture , 1989, NIPS.
[7] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[8] Paul E. Utgoff,et al. Automatic Feature Generation for Problem Solving Systems , 1992, ML.
[9] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..
[10] Terrence J. Sejnowski,et al. Temporal Difference Learning of Position Evaluation in the Game of Go , 1993, NIPS.