论文信息 - A Principled Method for Exploiting Opening Books

A Principled Method for Exploiting Opening Books

In the past we used a great deal of computational power and human expertise for storing a rather big dataset of good 9×9 Go games, in order to build an opening book. We improved the algorithm used for generating and storing these games considerably. However, the results were not very robust, as (i) opening books are definitely not transitive, making the non-regression testing extremely difficult, (ii) different time settings lead to opposite conclusions, because a good opening for a game with 10s per move on a single core is quite different from a good opening for a game with 30s per move on a 32-cores machine, and (iii) some very bad moves sometimes still occur. In this paper, we formalize the optimization of an opening book as a matrix game, compute the Nash equilibrium, and conclude that a naturally randomized opening book provides optimal performance (in the sense of Nash equilibria). Moreover, our research showed that from a finite set of opening books, we can choose a distribution on these opening books so that the resultant randomly constructed opening book has a significantly better performance than each of the deterministic opening books.

[1] J. Robinson. AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[2] Richard E. Korf,et al. Depth-First Iterative-Deepening: An Optimal Admissible Tree Search , 1985, Artif. Intell..

[3] Steven Walczak. Improving opening book performance through modeling of chess opponents , 1996, CSC '96.

[4] Michael Buro. Toward Opening Book Learning , 1999, J. Int. Comput. Games Assoc..

[5] Hiroyuki Iida,et al. Self-playing-based Opening Book Tuning , 2006 .

[6] Ulf Lorenz,et al. Innovative Opening-Book Handling , 2006, ACG.

[7] David Silver,et al. Combining online and offline knowledge in UCT , 2007, ICML '07.

[8] Rolf Drechsler,et al. Applications of Evolutionary Computing, EvoWorkshops 2008: EvoCOMNET, EvoFIN, EvoHOT, EvoIASP, EvoMUSART, EvoNUM, EvoSTOC, and EvoTransLog, Naples, Italy, March 26-28, 2008. Proceedings , 2008, EvoWorkshops.

[9] Olivier Teytaud,et al. Grid Coevolution for Adaptive Simulations: Application to the Building of Opening Books in the Game of Go , 2009, EvoWorkshops.

[10] Tzung-Pei Hong,et al. The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments , 2009, IEEE Transactions on Computational Intelligence and AI in Games.

[11] Shlomo Zilberstein,et al. Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs , 2010, Autonomous Agents and Multi-Agent Systems.

[12] Olivier Teytaud,et al. Consistency Modifications for Automatically Tuned Monte-Carlo Tree Search , 2010, LION.