Automated generation of machine verifiable and readable proofs: A case study of Tarski’s geometry

The power of state-of-the-art automated and interactive theorem provers has reached the level at which a significant portion of non-trivial mathematical contents can be formalized almost fully automatically. In this paper we present our framework for the formalization of mathematical knowledge that can produce machine verifiable proofs (for different proof assistants) but also human-readable (nearly textbook-like) proofs. As a case study, we focus on one of the twentieth century classics – a book on Tarski’s geometry. We tried to automatically generate such proofs for the theorems from this book using resolution theorem provers and a coherent logic theorem prover. In the first experiment, we used only theorems from the book, in the second we used additional lemmas from the existing Coq formalization of the book, and in the third we used specific dependency lists from the Coq formalization for each theorem. The results show that 37 % of the theorems from the book can be automatically proven (with readable and machine verifiable proofs generated) without any guidance, and with additional lemmas this percentage rises to 42 %. These results give hope that the described framework and other forms of automation can significantly aid mathematicians in developing formal and informal mathematical knowledge.

[1]  Geoff Sutcliffe The TPTP Problem Library and Associated Infrastructure , 2009, Journal of Automated Reasoning.

[2]  T. Makarios A further simplification of Tarski's axioms of geometry , 2013, 1306.0066.

[3]  Jacques D. Fleuriot,et al.  Formalizing Hilbert's Grundlagen in Isabelle/Isar , 2003, TPHOLs.

[4]  Tobias Nipkow,et al.  Automatic Proof and Disproof in Isabelle/HOL , 2011, FroCoS.

[5]  Freek Wiedijk,et al.  The Seventeen Provers of the World , 2006 .

[6]  David Aspinall,et al.  Formalising Java's Data Race Free Guarantee , 2007, TPHOLs.

[7]  Thomas C. Hales,et al.  Introduction to the Flyspeck Project , 2005, Mathematics, Algorithms, Proofs.

[8]  David Hilbert,et al.  Grundlagen der Geometrie , 2022 .

[9]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[10]  Larry Wos,et al.  OTTER Proofs in Tarskian Geometry , 2014, IJCAR.

[11]  Marc Bezem,et al.  A Vernacular for Coherent Logic , 2014, CICM.

[12]  Richard J. Boulton,et al.  Theorem Proving in Higher Order Logics , 2003, Lecture Notes in Computer Science.

[13]  Pascal Schreck,et al.  Using small scale automation to improve both accessibility and readability of formal proofs in geometry , 2014 .

[14]  Alfred Tarski,et al.  Tarski's System of Geometry , 1999, Bulletin of Symbolic Logic.

[15]  Jasmin Christian Blanchette,et al.  Redirecting Proofs by Contradiction , 2013, PxTP@CADE.

[16]  Thierry Coquand,et al.  Automating Coherent Logic , 2005, LPAR.

[17]  Jeremy Avigad,et al.  A Machine-Checked Proof of the Odd Order Theorem , 2013, ITP.

[18]  Pascal Schreck,et al.  A short note about case distinctions in Tarski's geometry , 2014 .

[19]  Michael Beeson,et al.  Proof and Computation in Geometry , 2012, Automated Deduction in Geometry.

[20]  Stephan Schulz,et al.  E - a brainiac theorem prover , 2002, AI Commun..

[21]  KaliszykCezary,et al.  Learning-Assisted Automated Reasoning with Flyspeck , 2014 .

[22]  J. Blanchette,et al.  Third International Workshop on Proof Exchange for Theorem Proving, PxTP 2013, Lake Placid, NY, USA, June 9-10, 2013 , 2013, PxTP@CADE.

[23]  Art Quaife,et al.  Automated development of Tarski's geometry , 1989, Journal of Automated Reasoning.

[24]  Julien Narboux,et al.  Mechanical Theorem Proving in Tarski's Geometry , 2006, Automated Deduction in Geometry.

[25]  Jacques D. Fleuriot,et al.  Mechanical Theorem Proving in Computational Geometry , 2004, Automated Deduction in Geometry.

[26]  John Fisher,et al.  Skolem Machines and Geometric Logic , 2007, ICTAC.

[27]  Julien Narboux,et al.  From Tarski to Hilbert , 2012, Automated Deduction in Geometry.

[28]  Lawrence C. Paulson,et al.  Extending Sledgehammer with SMT Solvers , 2011, Journal of Automated Reasoning.

[29]  A. Tarski What is Elementary Geometry , 1959 .

[30]  M. Ganesalingam,et al.  A fully automatic problem solver with human-style output , 2013, ArXiv.

[31]  Markus Wenzel,et al.  Isar - A Generic Interpretative Approach to Readable Formal Proof Documents , 1999, TPHOLs.

[32]  Cezary Kaliszyk,et al.  Communicating Formal Proofs: The Case of Flyspeck , 2013, ITP.

[33]  Sana Stojanovic,et al.  A Coherent Logic Based Geometry Theorem Prover Capable of Producing Formal and Readable Proofs , 2010, Automated Deduction in Geometry.

[34]  Jeremy Avigad,et al.  A FORMAL SYSTEM FOR EUCLID’S ELEMENTS , 2008, The Review of Symbolic Logic.

[35]  Christoph Weidenbach,et al.  SPASS Version 3.5 , 2009, CADE.

[36]  A. Tarski,et al.  Metamathematische Methoden in der Geometrie , 1983 .

[37]  Marc Bezem,et al.  On the Mechanization of the Proof of Hessenberg’s Theorem in Coherent Logic , 2007, Journal of Automated Reasoning.

[38]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[39]  Andrei Voronkov,et al.  The design and implementation of VAMPIRE , 2002, AI Commun..