Programming backgammon using self-teaching neural nets

[1]  Jordan B. Pollack,et al.  Co-Evolution in the Successful Learning of Backgammon Strategy , 1998, Machine Learning.

[2]  Gerald Tesauro,et al.  Practical issues in temporal difference learning , 1992, Machine Learning.

[3]  Jonathan Schaeffer,et al.  The games computers (and people) play , 2000, Adv. Comput..

[4]  John N. Tsitsiklis,et al.  Call admission control and routing in integrated services networks using neuro-dynamic programming , 2000, IEEE Journal on Selected Areas in Communications.

[5]  Michael Buro Efficient Approximation of Backgammon Race Equities , 1999, J. Int. Comput. Games Assoc..

[6]  Matthew Saffell,et al.  Reinforcement Learning for Trading Systems and Portfolios , 1998, KDD.

[7]  Andrew W. Moore,et al.  Value Function Based Production Scheduling , 1998, ICML.

[8]  Andrew Tridgell,et al.  KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search , 1998, ICML.

[9]  Sridhar Mahadevan,et al.  Optimizing Production Manufacturing Using Reinforcement Learning , 1998, FLAIRS.

[10]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[11]  John Moody,et al.  Reinforcement Learning for Trading Systems and Portfolios: Immediate vs Future Rewards , 1998 .

[12]  Andrew Tridgell,et al.  KnightCap: A chess program that learns by combining TD( ) with minimax search , 1997, ICML 1997.

[13]  Gerald Tesauro,et al.  On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.

[14]  Dimitri P. Bertsekas,et al.  Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems , 1996, NIPS.

[15]  Thomas G. Dietterich,et al.  High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network , 1995, NIPS 1995.

[16]  Andrew G. Barto,et al.  Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[17]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[18]  Gerald Tesauro,et al.  Neurogammon Wins Computer Olympiad , 1989, Neural Computation.

[19]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[20]  L. Victor Allis,et al.  A Knowledge-Based Approach of Connect-Four , 1988, J. Int. Comput. Games Assoc..

[21]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[22]  Norman Zadeh,et al.  On Optimal Doubling in Backgammon , 1977 .

[23]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[24]  John R. Crawford,et al.  The Backgammon Book , 1970 .

[25]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[26]  Jörg Bewersdorff,et al.  Backgammon , 2022, Luck, Logic, and White Lies.