论文信息 - Solving QSAT problems with neural MCTS

Solving QSAT problems with neural MCTS

Recent achievements from AlphaZero using self-play has shown remarkable performance on several board games. It is plausible to think that self-play, starting from zero knowledge, can gradually approximate a winning strategy for certain two-player games after an amount of training. In this paper, we try to leverage the computational power of neural Monte Carlo Tree Search (neural MCTS), the core algorithm from AlphaZero, to solve Quantified Boolean Formula Satisfaction (QSAT) problems, which are PSPACE complete. Knowing that every QSAT problem is equivalent to a QSAT game, the game outcome can be used to derive the solutions of the original QSAT problems. We propose a way to encode Quantified Boolean Formulas (QBFs) as graphs and apply a graph neural network (GNN) to embed the QBFs into the neural MCTS. After training, an off-the-shelf QSAT solver is used to evaluate the performance of the algorithm. Our result shows that, for problems within a limited size, the algorithm learns to solve the problem correctly merely from self-play.

Ruiyang Xu | Karl Lieberherr

[1] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[2] Danna Zhou,et al. d. , 1840, Microbial pathogenesis.

[3] Demis Hassabis,et al. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[4] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.

[5] David L. Dill,et al. Learning a SAT Solver from Single-Bit Supervision , 2018, ICLR.

[6] Mikolás Janota,et al. Towards Generalization in QBF Solving via Machine Learning , 2018, AAAI.

[7] Mikolás Janota,et al. Solving QBF with Counterexample Guided Refinement , 2012, SAT.

[8] Samuel S. Schoenholz,et al. Neural Message Passing for Quantum Chemistry , 2017, ICML.

[9] Christopher D. Rosin,et al. Multi-armed bandits with episode context , 2011, Annals of Mathematics and Artificial Intelligence.

[10] Manuel Kauers,et al. Symmetries of Quantified Boolean Formulas , 2018, SAT.

[11] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.