Rollout-based Game-tree Search Outprunes Traditional Alpha-beta
暂无分享,去创建一个
[1] Judea Pearl,et al. The solution for the branching factor of the alpha-beta pruning algorithm and its optimality , 1982, CACM.
[2] Rémi Munos,et al. Bandit Algorithms for Tree Search , 2007, UAI.
[3] Thomas J. Walsh,et al. Integrating Sample-Based Planning and Model-Based Reinforcement Learning , 2010, AAAI.
[4] George C. Stockman,et al. A Minimax Algorithm Better than Alpha-Beta? , 1979, Artif. Intell..
[5] Donald E. Knuth,et al. The Solution for the Branching Factor of the Alpha-Beta Pruning Algorithm , 1981, ICALP.
[6] Daniel Edwards,et al. The Alpha-Beta Heuristic , 1963 .
[7] R. Ramanujan,et al. On the Behavior of UCT in Synthetic Search Spaces , 2011 .
[8] Jonathan Schaeffer,et al. Best-First Fixed-Depth Minimax Algorithms , 1996, J. Int. Comput. Games Assoc..
[9] David Silver,et al. Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Achieving Master Level Play in 9 × 9 Computer Go , 2022 .
[10] Tristan Cazenave,et al. Score Bounded Monte-Carlo Tree Search , 2010, Computers and Games.
[11] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.