暂无分享,去创建一个
Gerald Tesauro | James Fan | David Gondek | John M. Prager | Jonathan Lenchner | G. Tesauro | David Gondek | James Fan | J. Prager | J. Lenchner
[1] W. Hamilton,et al. The Evolution of Cooperation , 1984 .
[2] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[3] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[4] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[5] Gerald Tesauro,et al. On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.
[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[7] Kurt Hornik,et al. On the generation of correlated artificial binary data , 1998 .
[8] Dimitri P. Bertsekas,et al. Rollout Algorithms for Stochastic Scheduling Problems , 1999, J. Heuristics.
[9] Matthew L. Ginsberg,et al. GIB: Steps Toward an Expert-Level Bridge-Playing Program , 1999, IJCAI.
[10] Darse Billings,et al. The First International RoShamBo Programming Competition , 2000, J. Int. Comput. Games Assoc..
[11] Bill Ravens,et al. An Introduction to Copulas , 2000, Technometrics.
[12] Jonathan Schaeffer,et al. The challenge of poker , 2002, Artif. Intell..
[13] Brian Sheppard,et al. World-championship-caliber Scrabble , 2002, Artif. Intell..
[14] M. El-Sabaawi. Breakdown of Will , 2002 .
[15] Fu‐Chun Wu,et al. Second-order Monte Carlo uncertainty/variability analysis using correlated model parameters: application to salmonid embryo survival risk assessment , 2004 .
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] R. Nelsen. An Introduction to Copulas (Springer Series in Statistics) , 2006 .
[18] Jennifer Chu-Carroll,et al. Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..
[19] M. Dufwenberg. Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.
[20] Gerald Tesauro,et al. Simulation, learning, and optimization techniques in Watson's game strategies , 2012, IBM J. Res. Dev..
[21] David A. Ferrucci,et al. Introduction to "This is Watson" , 2012, IBM J. Res. Dev..
[22] Jennifer Chu-Carroll,et al. Special Questions and techniques , 2012, IBM J. Res. Dev..
[23] Richard S. Sutton,et al. Reinforcement Learning , 1992, Handbook of Machine Learning.