Logical Induction

We present a computable algorithm that assigns probabilities to every logical statement in a given formal language, and refines those probabilities over time. For instance, if the language is Peano arithmetic, it assigns probabilities to all arithmetical statements, including claims about the twin prime conjecture, the outputs of long-running computations, and its own probabilities. We show that our algorithm, an instance of what we call a logical inductor, satisfies a number of intuitive desiderata, including: (1) it learns to predict patterns of truth and falsehood in logical statements, often long before having the resources to evaluate the statements, so long as the patterns can be written down in polynomial time; (2) it learns to use appropriate statistical summaries to predict sequences of statements whose truth values appear pseudorandom; and (3) it learns to have accurate beliefs about its own current beliefs, in a manner that avoids the standard paradoxes of self-reference. For example, if a given computer program only ever produces outputs in a certain range, a logical inductor learns this fact in a timely manner; and if late digits in the decimal expansion of π are difficult to predict, then a logical inductor learns to assign ≈ 10% probability to “the nth digit of π is a 7” for large n. Logical inductors also learn to trust their future beliefs more than their current beliefs, and their beliefs are coherent in the limit (whenever φ → ψ, P∞(φ) ≤ P∞(ψ), and so on); and logical inductors strictly dominate the universal semimeasure in the limit. These properties and many others all follow from a single logical induction criterion, which is motivated by a series of stock trading analogies. Roughly speaking, each logical sentence φ is associated with a stock that is worth $1 per share if φ is true and nothing otherwise, and we interpret the belief-state of a logically uncertain reasoner as a set of market prices, where Pn(φ) = 50% means that on day n, shares of φ may be bought or sold from the reasoner for 50¢. The logical induction criterion says (very roughly) that there should not be any polynomial-time computable trading strategy with finite risk tolerance that earns unbounded profits in that market over time. This criterion bears strong resemblance to the “no Dutch book” criteria that support both expected utility theory (von Neumann and Morgenstern 1944) and Bayesian probability theory (Ramsey 1931; de Finetti 1937).

[1]  Jaakko Hintikka,et al.  Knowledge and Belief: An Introduction to the Logic of the Two Notions. , 1965 .

[2]  Pedro M. Domingos,et al.  Efficient Weight Learning for Markov Logic Networks , 2007, PKDD.

[3]  Oren Etzioni,et al.  Markov Logic Networks for Natural Language Question Answering , 2015, ArXiv.

[4]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[5]  Scott Aaronson,et al.  Why Philosophers Should Care About Computational Complexity , 2011, Electron. Colloquium Comput. Complex..

[6]  Lyle Zynda,et al.  Old evidence and new theories , 1995 .

[7]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[8]  Pedro M. Domingos,et al.  Learning the structure of Markov logic networks , 2005, ICML.

[9]  Shlomo Zilberstein Metareasoning and Bounded Rationality , 2011, Metareasoning.

[10]  Veikko Rantala,et al.  Urn models: A new kind of non-standard model for first-order logic , 1975, J. Philos. Log..

[11]  Pedro M. Domingos,et al.  Discriminative Training of Markov Logic Networks , 2005, AAAI.

[12]  D. Hilbert Mathematical Problems , 2019, Mathematics: People · Problems · Results.

[13]  Nate Soares,et al.  Inductive Coherence , 2016, 1604.05288.

[14]  Eric Horvitz,et al.  Metareasoning for Planning Under Uncertainty , 2015, IJCAI.

[15]  Vann McGee,et al.  Truth, Vagueness, and Paradox: An Essay on the Logic of Truth , 1991 .

[16]  Ben Taskar,et al.  Bayesian Logic Programming: Theory and Tool , 2007 .

[17]  W. van der Hoek,et al.  Epistemic logic for AI and computer science , 1995, Cambridge tracts in theoretical computer science.

[18]  N. Vorobʹev Foundations of game theory , 1994 .

[19]  A. Rubinstein Modeling Bounded Rationality , 1998 .

[20]  J. Łoś On the axiomatic treatment of probability , 1955 .

[21]  H. Simon Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[22]  Cristian S. Calude,et al.  Most Programs Stop Quickly or Never Halt , 2006, Adv. Appl. Math..

[23]  Dimitar P. Filev,et al.  Fuzzy SETS AND FUZZY LOGIC , 1996 .

[24]  Leonard J. Savage,et al.  Difficulties in the Theory of Personal Probability , 1967, Philosophy of Science.

[25]  Pedro M. Domingos,et al.  Hybrid Markov Logic Networks , 2008, AAAI.

[26]  Ronald Fagin,et al.  Belief, Awareness, and Limited Reasoning. , 1987, Artif. Intell..

[27]  Giangiacomo Gerla,et al.  Fuzzy Logic: Mathematical Tools for Approximate Reasoning , 2001 .

[28]  H. Gaifman Concerning measures in first order calculi , 1964 .

[29]  Scott Garrabrant,et al.  Asymptotic Logical Uncertainty and the Benford Test , 2015, AGI.

[30]  James M. Joyce The Foundations of Causal Decision Theory , 1999 .

[31]  N. Belnap,et al.  The Revision Theory of Truth , 1993 .

[32]  Catrin Campbell-Moore,et al.  How to Express Self-Referential Probability∗ , 2014 .

[33]  M. Bacharach The epistemic structure of a theory of a game , 1994 .

[34]  Ian Hacking,et al.  Slightly More Realistic Personal Probability , 1967, Philosophy of Science.

[35]  Luc De Raedt,et al.  Logical and relational learning , 2008, Cognitive Technologies.

[36]  G. Boole An Investigation of the Laws of Thought: On which are founded the mathematical theories of logic and probabilities , 2007 .

[37]  E. Rowland Theory of Games and Economic Behavior , 1946, Nature.

[38]  Frank D. Wood,et al.  A New Approach to Probabilistic Programming Inference , 2014, AISTATS.

[39]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[40]  Marcus Hutter,et al.  Probabilities on Sentences in an Expressive Logic , 2012, J. Appl. Log..

[41]  S. Yau Mathematics and its applications , 2002 .

[42]  Stuart J. Russell,et al.  Principles of Metareasoning , 1989, Artif. Intell..

[43]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[44]  Yitang Zhang Bounded gaps between primes , 2014 .

[45]  David Lewis,et al.  Papers in metaphysics and epistemology , 1999 .

[46]  Ernest W. Adams,et al.  A primer of probability logic , 1996 .

[47]  M. Subrahmanyam Theory and Evidence , 2013 .

[48]  John Langford,et al.  Learning performance of prediction markets with Kelly bettors , 2012, AAMAS.

[49]  Abram Demski Logical Prior Probability , 2012, AGI.

[50]  Kurt Gödel,et al.  On undecidable propositions of formal mathematical systems , 1934 .

[51]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[52]  Thomas A. Henzinger,et al.  Probabilistic programming , 2014, FOSE.

[53]  Stuart J. Russell,et al.  Rationality and Intelligence: A Brief Update , 2013, PT-AI.

[54]  George N. Schlesinger The range of epistemic logic , 1985 .

[55]  Joseph Y. Halpern Reasoning about uncertainty , 2003 .

[56]  Andrew McCallum,et al.  FACTORIE: Probabilistic Programming via Imperatively Defined Factor Graphs , 2009, NIPS.

[57]  Luc De Raedt,et al.  Probabilistic Inductive Logic Programming , 2004, Probabilistic Inductive Logic Programming.

[58]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[59]  H. Jeffreys Logical Foundations of Probability , 1952, Nature.

[60]  Luc De Raedt,et al.  Bayesian Logic Programming: Theory and Tool , 2007 .

[61]  Pierpaolo Battigalli,et al.  Recent results on belief, knowledge and the epistemic foundations of game theory , 1999 .

[62]  David Ripley,et al.  Paraconsistent Logic , 2015, J. Philos. Log..

[63]  Ronald Fagin,et al.  Reasoning about knowledge , 1995 .

[64]  Nico Potyka,et al.  Solving Reasoning Problems for Probabilistic Conditional Logics with Consistent and Inconsistent Information , 2016 .

[65]  B. D. Finetti,et al.  Foresight: Its Logical Laws, Its Subjective Sources , 1992 .

[66]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[67]  Daniel Garber,et al.  Old Evidence and Logical Omniscience in Bayesian Confirmation Theory , 1983 .

[68]  Nicola Guarino,et al.  Formal Ontology and Information Systems , 1998 .

[69]  Raymond J. Mooney,et al.  Mapping and Revising Markov Logic Networks for Transfer Learning , 2007, AAAI.

[70]  Michael Glanzberg,et al.  The Liar in Context , 2001 .

[71]  L. Levin,et al.  THE COMPLEXITY OF FINITE OBJECTS AND THE DEVELOPMENT OF THE CONCEPTS OF INFORMATION AND RANDOMNESS BY MEANS OF THE THEORY OF ALGORITHMS , 1970 .

[72]  Jens Christian Bjerring On counterpossibles , 2014 .

[73]  André Nies,et al.  Demuth randomness and computational complexity , 2011, Ann. Pure Appl. Log..

[74]  J. Yen,et al.  Fuzzy Logic: Intelligence, Control, and Information , 1998 .

[75]  Michael A. Osborne,et al.  Probabilistic Integration , 2015, ArXiv.

[76]  Barton L. Lipman How to Decide How to Decide How to. . . : Modeling Limited Rationality , 1991 .

[77]  A. N. Kolmogorov,et al.  Foundations of the theory of probability , 1960 .

[78]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[79]  Newton C. A. da Costa,et al.  Why Paraconsistent Logics? , 2016, Towards Paraconsistent Engineering.

[80]  Luc De Raedt,et al.  Probabilistic (logic) programming concepts , 2015, Machine Learning.

[81]  Paul Christiano,et al.  Definability of Truth in Probabilistic Logic ( Early draft ) , 2013 .

[82]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[83]  Catrin Campbell-Moore,et al.  HOW TO EXPRESS SELF-REFERENTIAL PROBABILITY. A KRIPKEAN PROPOSAL , 2015, The Review of Symbolic Logic.

[84]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part I , 1964, Inf. Control..

[85]  Matthias Thimm,et al.  Inconsistency measures for probabilistic logics , 2013, Artif. Intell..

[86]  André Fuhrmann,et al.  Relevant logics, modal logics and theory change , 1988 .

[87]  Marc Snir,et al.  Probabilities over rich languages, testing and randomness , 1982, Journal of Symbolic Logic.

[88]  David Picado-Muiño,et al.  Measuring and repairing inconsistency in probabilistic knowledge bases , 2011, Int. J. Approx. Reason..

[89]  Michael A. Osborne,et al.  Frank-Wolfe Bayesian Quadrature: Probabilistic Integration with Theoretical Guarantees , 2015, NIPS.

[90]  Michael A. Osborne,et al.  Probabilistic numerics and uncertainty in computations , 2015, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[91]  Daniel Cohen,et al.  On What Cannot Be , 1990 .

[92]  Larry S. Davis,et al.  Event Modeling and Recognition Using Markov Logic Networks , 2008, ECCV.

[93]  Stephen Muggleton,et al.  Latest Advances in Inductive Logic Programming, ILP 2011, Late Breaking Papers, Windsor Great Park, UK, July 31 - August 3, 2011 , 2014, ILP.

[94]  J. Sprenger A Novel Solution to the Problem of Old Evidence , 2015, Philosophy of Science.

[95]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[96]  J. Schreiber Foundations Of Statistics , 2016 .

[97]  P. Jones Making Decisions , 1971, Nature.

[98]  P. Grim The Incomplete Universe: Totality, Knowledge, and Truth , 1994 .

[99]  Ming Li,et al.  An Introduction to Kolmogorov Complexity and Its Applications , 2019, Texts in Computer Science.

[100]  Nico Potyka,et al.  Probabilistic Reasoning with Inconsistent Beliefs Using Inconsistency Measures , 2015, IJCAI.

[101]  Paul Christiano Non-Omniscience, Probabilistic Inference, and Metamathematics , 2014 .

[102]  David Tolpin,et al.  Selecting Computations: Theory and Applications , 2012, UAI.

[103]  Benja Fallenstein,et al.  Toward Idealized Decision Theory , 2015, ArXiv.

[104]  P G rdenfors,et al.  Knowledge in flux: modeling the dynamics of epistemic states , 1988 .

[105]  L. Stein,et al.  Probability and the Weighing of Evidence , 1950 .

[106]  V. S. Subrahmanian,et al.  Paraconsistent Logic Programming , 1987, Theor. Comput. Sci..

[107]  Ellery Eells,et al.  Bayesian problems of old evidence , 1990 .

[108]  Stuart J. Russell,et al.  Do the right thing - studies in limited rationality , 1991 .

[109]  Paul Teller,et al.  Conditionalization and observation , 1973, Synthese.

[110]  Nate Soares,et al.  Asymptotic Convergence in Online Learning with Unbounded Delays , 2016, ArXiv.

[111]  Stephen Muggleton,et al.  Latest Advances in Inductive Logic Programming , 2014 .

[112]  F. Ramsey Truth and Probability , 2016 .