Focused Depth-first Proof Number Search using Convolutional Neural Networks for the Game of Hex

Proof Number search (PNS) is an effective algorithm for searching theoretical values on games with non-uniform branching factors. Focused depth-first proof number search (FDFPN) with dynamic widening was proposed for Hex where the branching factor is nearly uniform. However, FDFPN is fragile to its heuristic move ordering function. The recent advances of Convolutional Neural Networks (CNNs) have led to considerable progress in game playing. We investigate how to incorporate the strength of CNNs into solving, with application to the game of Hex. We describe FDFPN-CNN, a new focused DFPN search that uses convolutional neural networks. FDFPN-CNN integrates two CNNs trained from games played by expert players. The value approximation CNN provides reliable information for defining the widening size by estimating the value of the node to expand, while the policy CNN selects promising children nodes to the search. On 8x8 Hex, experimental results show FDFPN-CNN performs notably better than FDFPN, suggesting a promising direction for better solving Hex positions where learning from strong players is possible.

[1]  Vadim V. Anshelevich,et al.  A hierarchical approach to computer Hex , 2002, Artif. Intell..

[2]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[3]  Ryan B. Hayward,et al.  Monte Carlo Tree Search in Hex , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[4]  Tristan Cazenave Generalized Widening , 2004, ECAI.

[5]  Yuandong Tian,et al.  Better Computer Go Player with Neural Network and Long-term Prediction , 2016, ICLR.

[6]  Akihiro Kishimoto,et al.  Search versus Knowledge for Solving Life and Death Problems in Go , 2005, AAAI.

[7]  H. Jaap van den Herik,et al.  Proof-Number Search and Its Variants , 2008, Oppositional Concepts in Computational Intelligence.

[8]  Jonathan Schaeffer,et al.  Checkers Is Solved , 2007, Science.

[9]  Kazuki Yoshizoe A New Proof-Number Calculation Technique for Proof-Number Search , 2008, Computers and Games.

[10]  Ryan B. Hayward,et al.  Solving 8 × 8 Hex ∗ , 2009 .

[11]  Akihiro Kishimoto Dealing with Infinite Loops, Underestimation, and Overestimation of Depth-First Proof-Number Search , 2010, AAAI.

[12]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[13]  H. Jaap van den Herik,et al.  Proof-Number Search , 1994, Artif. Intell..

[14]  Shih-Chieh Huang,et al.  MoHex 2.0: A Pattern-Based MCTS Hex Player , 2013, Computers and Games.

[15]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[16]  Graham Kendall,et al.  Editorial: IEEE Transactions on Computational Intelligence and AI in Games , 2015, IEEE Trans. Comput. Intell. AI Games.

[17]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[18]  Jakub Pawlewicz,et al.  Stronger Virtual Connections in Hex , 2015, IEEE Transactions on Computational Intelligence and AI in Games.

[19]  Ryan B. Hayward,et al.  Solving 7x7 Hex: Virtual Connections and Game-State Reduction , 2003, ACG.

[20]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[21]  David Silver,et al.  Move Evaluation in Go Using Deep Convolutional Neural Networks , 2014, ICLR.

[22]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23]  Jakub Pawlewicz,et al.  Scalable Parallel DFPN Search , 2013, Computers and Games.

[24]  Dale Schuurmans,et al.  Reward Augmented Maximum Likelihood for Neural Structured Prediction , 2016, NIPS.

[25]  Amos J. Storkey,et al.  Training Deep Convolutional Neural Networks to Play Go , 2015, ICML.

[26]  長井 歩,et al.  Df-pn algorithm for searching AND/OR trees and its applications , 2002 .

[27]  Volume 16 , 2004, Journal of Clinical Monitoring and Computing.

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  R. Lathe Phd by thesis , 1988, Nature.