Proof searching and prediction in HOL4 with evolutionary/heuristic and deep learning techniques

Interactive theorem provers (ITPs), also known as proof assistants, allow human users to write and verify formal proofs. The proof development process in ITPs can be a cumbersome and time-consuming activity due to manual user interactions. This makes proof guidance and proof automation the two most desired features for ITPs. In this paper, we first provide two evolutionary and heuristic-based proof searching approaches for the HOL4 proof assistant, where a genetic algorithm (GA) and simulated annealing (SA) is used to search and optimize the proofs in different HOL theories. In both approaches, random proof sequences are first generated from a population of frequently occurring HOL4 proof steps that are discovered with sequential pattern mining. Generated proof sequences are then evolved using GA operators (crossover and mutation) and by applying the annealing process of SA till their fitness match the fitness of the target proof sequences. Experiments were done to compare the performance of SA with that of GA. Results have shown that the two proof searching approaches can be used to efficiently evolve the random sequences to obtain the target sequences. However, these approaches lack the ability to learn the proof process, that is important for the prediction of new proof sequences. For this purpose, we propose to use a deep learning technique known as long short-term memory (LSTM). LSTM is trained on various HOL4 theories for proof learning and prediction. Experimental results suggest that combining evolutionary/heuristic and deep learning techniques with proof assistants can greatly facilitate proof finding/optimization and proof prediction.

[1]  Alex Graves,et al.  Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Quoc V. Le,et al.  A Neural Conversational Model , 2015, ArXiv.

[4]  Meng Sun,et al.  Using Recurrent Neural Network to Predict Tactics for Proving Component Connector Properties in Coq , 2019, 2019 International Symposium on Theoretical Aspects of Software Engineering (TASE).

[5]  Osman Hasan,et al.  Proof searching in HOL4 with genetic algorithm , 2020, SAC.

[6]  Thibault Gauthier,et al.  TacticToe: Learning to Reason with HOL4 Tactics , 2017, LPAR.

[7]  Cezary Kaliszyk,et al.  Machine Learning of Coq Proof Guidance: First Experiments , 2014, SCSS.

[8]  Tzung-Pei Hong,et al.  Evolution of Appropriate Crossover and Mutation Operators in a Genetic Process , 2001, Applied Intelligence.

[9]  Daniel Whalen,et al.  Holophrasm: a neural Automated Theorem Prover for higher-order logic , 2016, ArXiv.

[10]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[11]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[12]  Thibault Gauthier,et al.  Premise Selection and External Provers for HOL4 , 2015, CPP.

[13]  Xin Yao,et al.  Simulated Evolution and Learning: An Introduction , 2004, Applied Intelligence.

[14]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[15]  José Espírito Santo,et al.  Curry-Howard for Sequent Calculus at Last! , 2015, TLCA.

[16]  Michael Norrish,et al.  A Brief Overview of HOL4 , 2008, TPHOLs.

[17]  Philippe Fournier-Viger,et al.  Proof Guidance in PVS with Sequential Pattern Mining , 2019, FSEN.

[18]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[19]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[20]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[21]  Chao-Hong Chen,et al.  Automatically proving mathematical theorems with evolutionary algorithms and proof assistants , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[22]  Pierre Castéran,et al.  Interactive Theorem Proving and Program Development , 2004, Texts in Theoretical Computer Science An EATCS Series.

[23]  Lukasz Kaiser,et al.  Neural GPUs Learn Algorithms , 2015, ICLR.

[24]  Cezary Kaliszyk,et al.  Deep Network Guided Proof Search , 2017, LPAR.

[25]  Lakhmi C. Jain,et al.  Recurrent Neural Networks: Design and Applications , 1999 .

[26]  Josef Urban,et al.  DeepMath - Deep Sequence Models for Premise Selection , 2016, NIPS.

[27]  Josef Urban,et al.  History of Interactive Theorem Proving , 2014, Computational Logic.

[28]  Jian Wang,et al.  Premise Selection for Theorem Proving by Deep Graph Embedding , 2017, NIPS.

[29]  Jesse Alama,et al.  Premise Selection for Mathematics by Corpus Analysis and Kernel Methods , 2011, Journal of Automated Reasoning.

[30]  Cezary Kaliszyk,et al.  Learning-assisted theorem proving with millions of lemmas☆ , 2015, J. Symb. Comput..

[31]  Cezary Kaliszyk,et al.  HOL(y)Hammer: Online ATP Service for HOL Light , 2013, Math. Comput. Sci..

[32]  Marc Gravel,et al.  Ensuring population diversity in genetic algorithms: A technical note with application to the cell formation problem , 2007, Eur. J. Oper. Res..

[33]  Sofiène Tahar,et al.  Formal Verification Methods , 2015 .

[34]  Xavier Leroy,et al.  Formal verification of a realistic compiler , 2009, CACM.

[35]  Cezary Kaliszyk,et al.  HolStep: A Machine Learning Dataset for Higher-order Logic Theorem Proving , 2017, ICLR.

[36]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[37]  Jeremy Avigad,et al.  A Machine-Checked Proof of the Odd Order Theorem , 2013, ITP.

[38]  Hazel Duncan,et al.  The use of data-mining for the automatic formation of tactics , 2004 .

[39]  Marcel Mongeau,et al.  Simulated Annealing: From Basics to Applications , 2018, Handbook of Metaheuristics.

[40]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[41]  Yi Li,et al.  A Survey on Theorem Provers in Formal Methods , 2019, ArXiv.

[42]  Zhenbo Li,et al.  Study on hybrid PS-ACO algorithm , 2011, Applied Intelligence.

[43]  Terry Jones,et al.  Crossover, Macromutationand, and Population-Based Search , 1995, ICGA.

[44]  GERARD P. HUET,et al.  The Undecidability of Unification in Third Order Logic , 1973, Inf. Control..

[45]  Chad E. Brown,et al.  Internal Guidance for Satallax , 2016, IJCAR.

[46]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[47]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[48]  Kathryn A. Dowsland,et al.  Simulated Annealing , 1989, Encyclopedia of GIS.

[49]  Ying-ping Chen,et al.  Proving theorems by using evolutionary search with human involvement , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[50]  Tobias Nipkow,et al.  A FORMAL PROOF OF THE KEPLER CONJECTURE , 2015, Forum of Mathematics, Pi.

[51]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[52]  Tobias Nipkow,et al.  Mining the Archive of Formal Proofs , 2015, CICM.