Learning with Monte-Carlo methods
暂无分享,去创建一个
[1] Yngvi Björnsson,et al. CadiaPlayer: A Simulation-Based General Game Player , 2009, IEEE Transactions on Computational Intelligence and AI in Games.
[2] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.
[3] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[4] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.
[5] Rémi Coulom,et al. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.