Efficient Semantic Features for Automated Reasoning over Large Theories

Large formal mathematical knowledge bases encode considerable parts of advanced mathematics and exact science, allowing deep semantic computer assistance and verification of complicated theories down to the atomic logical rules. An essential part of automated reasoning over such large theories are methods learning selection of relevant knowledge from the thousands of proofs in the corpora. Such methods in turn rely on efficiently computable features characterizing the highly structured and inter-related mathematical statements. In this work we (i) propose novel semantic features characterizing the statements in such large semantic knowledge bases, (ii) propose and carry out their efficient implementation using deductive-AI data-structures such as substitution trees and discrimination nets, and (iii) show that they significantly improve the strength of existing knowledge selection methods and automated reasoning methods over the large formal knowledge bases. In particular, on a standard large-theory benchmark we improve the average predicted rank of a mathematical statement needed for a proof by 22% in comparison with state of the art. This allows us to prove 8% more theorems in comparison with state of the art.

[1]  Cezary Kaliszyk,et al.  Lemma Mining over HOL Light , 2013, LPAR.

[2]  Adam Naumowicz,et al.  Mizar in a Nutshell , 2010, J. Formaliz. Reason..

[3]  Robert Veroff,et al.  Using hints to increase the effectiveness of an automated reasoning program: Case studies , 1996, Journal of Automated Reasoning.

[4]  Peter Graf,et al.  Term Indexing , 1996, Lecture Notes in Computer Science.

[5]  Geoff Sutcliffe The 6th IJCAR automated theorem proving system competition - CASC-J6 , 2013, AI Commun..

[6]  Tobias Nipkow,et al.  Concrete Semantics: With Isabelle/HOL , 2014 .

[7]  J. A. Robinson,et al.  Handbook of Automated Reasoning (in 2 volumes) , 2001 .

[8]  Andrei Voronkov,et al.  Comparing Unification Algorithms in First-Order Theorem Proving , 2009, KI.

[9]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[10]  Josef Urban,et al.  MaLARea SG1- Machine Learner for Automated Reasoning with Semantic Guidance , 2008, IJCAR.

[11]  Josef Urban,et al.  Learning from Multiple Proofs: First Experiments , 2012, PAAR@IJCAR.

[12]  Steven Greenbaum Input transformations and resolution implementation techniques for theorem proving in first-order logic (clause form, discrimination networks, heuristic search, locking resolution) , 1986 .

[13]  T. Hales Dense Sphere Packings: A Blueprint for Formal Proofs , 2012 .

[14]  Jeremy Avigad,et al.  A Machine-Checked Proof of the Odd Order Theorem , 2013, ITP.

[15]  Jesse Alama,et al.  Premise Selection for Mathematics by Corpus Analysis and Kernel Methods , 2011, Journal of Automated Reasoning.

[16]  Lawrence C. Paulson,et al.  Lightweight relevance filtering for machine-generated resolution problems , 2009, J. Appl. Log..

[17]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[18]  Andrei Voronkov,et al.  Sine Qua Non for Large Theory Reasoning , 2011, CADE.

[19]  Cezary Kaliszyk,et al.  Learning-Assisted Automated Reasoning with Flyspeck , 2012, Journal of Automated Reasoning.

[20]  Josef Urban,et al.  MaLeCoP Machine Learning Connection Prover , 2011, TABLEAUX.

[21]  Michael Norrish,et al.  seL4: formal verification of an operating-system kernel , 2010, Commun. ACM.

[22]  Josef Urban,et al.  Overview and Evaluation of Premise Selection Techniques for Large Theory Mathematics , 2012, IJCAR.

[23]  Stephan Schulz,et al.  Learning search control knowledge for equational deduction , 2000, DISKI.

[24]  R. Lathe Phd by thesis , 1988, Nature.

[25]  Cezary Kaliszyk,et al.  MaSh: Machine Learning for Sledgehammer , 2013, ITP.

[26]  Nikolaj Bjørner,et al.  Z3: An Efficient SMT Solver , 2008, TACAS.

[27]  John Harrison,et al.  HOL Light: A Tutorial Introduction , 1996, FMCAD.

[28]  Journal of automated reasoning , 1986 .

[29]  Peter Graf Substitution Tree Indexing , 1995, RTA.

[30]  Cezary Kaliszyk,et al.  Stronger Automation for Flyspeck by Feature Weighting and Strategy Evolution , 2013, PxTP@CADE.

[31]  Stephan Schulz,et al.  E - a brainiac theorem prover , 2002, AI Commun..

[32]  Andrei Voronkov,et al.  First-Order Theorem Proving and Vampire , 2013, CAV.

[33]  Josef Urban,et al.  Evaluation of Automated Theorem Proving on the Mizar Mathematical Library , 2010, ICMS.