Backpropagation Modification in Monte-Carlo Game Tree Search
暂无分享,去创建一个
Zhiqing Liu | Fan Xie | Zhiqing Liu | Fan Xie
[1] Brian Sheppard,et al. World-championship-caliber Scrabble , 2002, Artif. Intell..
[2] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.
[3] Martin Müller,et al. Computer Go , 2002, Artif. Intell..
[4] Yishay Mansour,et al. A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes , 1999, Machine Learning.
[5] Jonathan Schaeffer,et al. The challenge of poker , 2002, Artif. Intell..
[6] Gerald Tesauro,et al. On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.
[7] Sylvain Gelly,et al. Exploration exploitation in Go: UCT for Monte-Carlo Go , 2006, NIPS 2006.
[8] SheppardBrian. World-championship-caliber Scrabble , 2002 .
[9] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[10] H. Jaap van den Herik,et al. Progressive Strategies for Monte-Carlo Tree Search , 2008 .
[11] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.