Programming backgammon using self-teaching neural nets
暂无分享,去创建一个
[1] Jordan B. Pollack,et al. Co-Evolution in the Successful Learning of Backgammon Strategy , 1998, Machine Learning.
[2] Gerald Tesauro,et al. Practical issues in temporal difference learning , 1992, Machine Learning.
[3] Jonathan Schaeffer,et al. The games computers (and people) play , 2000, Adv. Comput..
[4] John N. Tsitsiklis,et al. Call admission control and routing in integrated services networks using neuro-dynamic programming , 2000, IEEE Journal on Selected Areas in Communications.
[5] Michael Buro. Efficient Approximation of Backgammon Race Equities , 1999, J. Int. Comput. Games Assoc..
[6] Matthew Saffell,et al. Reinforcement Learning for Trading Systems and Portfolios , 1998, KDD.
[7] Andrew W. Moore,et al. Value Function Based Production Scheduling , 1998, ICML.
[8] Andrew Tridgell,et al. KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search , 1998, ICML.
[9] Sridhar Mahadevan,et al. Optimizing Production Manufacturing Using Reinforcement Learning , 1998, FLAIRS.
[10] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[11] John Moody,et al. Reinforcement Learning for Trading Systems and Portfolios: Immediate vs Future Rewards , 1998 .
[12] Andrew Tridgell,et al. KnightCap: A chess program that learns by combining TD( ) with minimax search , 1997, ICML 1997.
[13] Gerald Tesauro,et al. On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.
[14] Dimitri P. Bertsekas,et al. Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems , 1996, NIPS.
[15] Thomas G. Dietterich,et al. High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network , 1995, NIPS 1995.
[16] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.
[17] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[18] Gerald Tesauro,et al. Neurogammon Wins Computer Olympiad , 1989, Neural Computation.
[19] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
[20] L. Victor Allis,et al. A Knowledge-Based Approach of Connect-Four , 1988, J. Int. Comput. Games Assoc..
[21] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[22] Norman Zadeh,et al. On Optimal Doubling in Backgammon , 1977 .
[23] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.
[24] John R. Crawford,et al. The Backgammon Book , 1970 .
[25] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[26] Jörg Bewersdorff,et al. Backgammon , 2022, Luck, Logic, and White Lies.