Proving Theorems using Incremental Learning and Hindsight Experience Replay

Traditional automated theorem provers for first-order logic depend on speed-optimized search and many handcrafted heuristics that are designed to work best over a wide range of domains. Machine learning approaches in literature either depend on these traditional provers to bootstrap themselves or fall short on reaching comparable performance. In this paper, we propose a general incremental learning algorithm for training domainspecific provers for first-order logic without equality, based only on a basic given-clause algorithm, but using a learned clause-scoring function. Clauses are represented as graphs and presented to transformer networks with spectral features. To address the sparsity and the initial lack of training data as well as the lack of a natural curriculum, we adapt hindsight experience replay to theorem proving, so as to be able to learn even when no proof can be found. We show that provers trained this way can match and sometimes surpass state-of-the-art traditional provers on the TPTP dataset in terms of both quantity and quality of the proofs.

[1]  Simon Cruanes,et al.  Faster, Higher, Stronger: E 2.3 , 2019, CADE.

[2]  Zarathustra Amadeus Goertzel Make E Smart Again (Short Paper) , 2020, IJCAR.

[3]  Jörg Denzinger,et al.  DISCOUNT - A Distributed and Learning Equational Prover , 2004, Journal of Automated Reasoning.

[4]  Xavier Bresson,et al.  A Generalization of Transformer Networks to Graphs , 2020, ArXiv.

[5]  Xavier Leroy,et al.  Formal verification of a realistic compiler , 2009, CACM.

[6]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[7]  Larry Wos,et al.  Otter - The CADE-13 Competition Incarnations , 1997, Journal of Automated Reasoning.

[8]  Cezary Kaliszyk,et al.  Deep Network Guided Proof Search , 2017, LPAR.

[9]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[10]  Laurent Orseau,et al.  Learning to Prove from Synthetic Theorems , 2020, ArXiv.

[11]  Josef Urban,et al.  Hammering Mizar by Learning Clause Guidance (Short Paper) , 2019, ITP.

[12]  Wolfgang Bibel,et al.  leanCoP: lean connection-based theorem proving , 2003, J. Symb. Comput..

[13]  Nathan R. Sturtevant,et al.  Iterative Budgeted Exponential Search , 2019, IJCAI.

[14]  Melvin Fitting,et al.  First-Order Logic and Automated Theorem Proving , 1990, Graduate Texts in Computer Science.

[15]  Stephan Schulz We know (nearly) nothing!l But can we learn? , 2017, ARCADE@CADE.

[16]  Josef Urban,et al.  Prolog Technology Reinforcement Learning Prover , 2020, IJCAR.

[17]  Peter Elias,et al.  Universal codeword sets and representations of the integers , 1975, IEEE Trans. Inf. Theory.

[18]  Gerwin Klein,et al.  Operating system verification—An overview , 2009 .

[19]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[20]  Josef Urban,et al.  ENIGMA Anonymous: Symbol-Independent Inference Guiding Machine (System Description) , 2020, IJCAR.

[21]  Laurent Orseau,et al.  Policy-Guided Heuristic Search with Guarantees , 2021, AAAI.

[22]  D. Knuth,et al.  Simple Word Problems in Universal Algebras , 1983 .

[23]  Ilya Sutskever,et al.  Generative Language Modeling for Automated Theorem Proving , 2020, ArXiv.

[24]  Marcin Andrychowicz,et al.  Hindsight Experience Replay , 2017, NIPS.

[25]  Josef Urban,et al.  ENIGMA-NG: Efficient Neural and Gradient-Boosted Inference Guidance for E , 2019, CADE.

[26]  Achille Fokoue,et al.  A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving , 2021, AAAI.

[27]  David Zuckerman,et al.  Optimal Speedup of Las Vegas Algorithms , 1993, Inf. Process. Lett..

[28]  Cezary Kaliszyk,et al.  Reinforcement Learning of Theorem Proving , 2018, NeurIPS.

[29]  Geoff Sutcliffe The TPTP Problem Library and Associated Infrastructure , 2017, Journal of Automated Reasoning.

[30]  Sandra Zilles,et al.  Learning heuristic functions for large state spaces , 2011, Artif. Intell..

[31]  Andrei Voronkov,et al.  First-Order Theorem Proving and Vampire , 2013, CAV.

[32]  Stephan Schulz,et al.  E - a brainiac theorem prover , 2002, AI Commun..