Improving the Exploration in Upper Confidence Trees
暂无分享,去创建一个
[1] H. Jaap van den Herik,et al. Progressive Strategies for Monte-Carlo Tree Search , 2008 .
[2] Csaba Szepesvári,et al. Online Optimization in X-Armed Bandits , 2008, NIPS.
[3] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[4] Rémi Coulom,et al. Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.
[5] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[6] Zongmin Ma,et al. Computers and Games , 2008, Lecture Notes in Computer Science.
[7] Rémi Coulom,et al. Computing "Elo Ratings" of Move Patterns in the Game of Go , 2007, J. Int. Comput. Games Assoc..
[8] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.
[9] Nataliya Sokolovska,et al. Continuous Upper Confidence Trees , 2011, LION.