A Strategy for Constructing New Predicates in First-Order Logic

There is increasing interest within the Machine Learning community in systems which automatically reformulate their problem representation by deening and constructing new predicates. A previous paper discussed such a system, called CIGOL, and gave a derivation for the mechanism of inverting individual steps in rst order resolution proofs. In this paper we describe an enhancement to CIGOL's learning strategy which strongly constrains the formation of new concepts and hypotheses. The new strategy is based on results from algorithmic information theory. Using these results it is possible to compute the probability that the simpliications produced by adopting new concepts or hypotheses are not based on chance regularities within the examples. This can be derived from the amount of information compression produced by replacing the examples with the hypothesised concepts. CIGOL's improved performance, based on an approximation of this strategy, is demonstrated by way of the automatic \discovery" of the concept of radiation. This example also demonstrates CIGOL's ability to ignore irrelevant background knowledge and deal with multiple interacting concepts.

[1]  Larry A. Rendell,et al.  Substantial Constructive Induction Using Layered Information Compression: Tractable Feature Formation in Search , 1985, IJCAI.

[2]  Ranan B. Banerji,et al.  Learning in the Limit in a Growing Language , 1987, IJCAI.

[3]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[4]  Stefan Wrobel,et al.  Automatic Representation Adjustment in an Observational Discovery System , 1988, EWSL.

[5]  Wray L. Buntine Generalized Subsumption and Its Applications to Induction and Redundancy , 1986, Artif. Intell..

[6]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part I , 1964, Inf. Control..

[7]  Tom Michael Mitchell,et al.  Explanation-based generalization: A unifying view , 1986 .

[8]  Kurt Gödel,et al.  On Formally Undecidable Propositions of Principia Mathematica and Related Systems , 1966 .

[9]  H. Simon,et al.  Rediscovering Chemistry with the Bacon System , 1983 .

[10]  G. Plotkin Automatic Methods of Inductive Inference , 1972 .

[11]  Rüdiger Wirth,et al.  Learning by Failure to Prove , 1988, EWSL.

[12]  Gregory J. Chaitin,et al.  Information, Randomness and Incompleteness - Papers on Algorithmic Information Theory; 2nd Edition , 1987, World Scientific Series in Computer Science.

[13]  R. Michalski,et al.  Learning from Observation: Conceptual Clustering , 1983 .

[14]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[15]  Susan L. Epstein On the Discovery of Mathematical Theorems , 1987, IJCAI.

[16]  Stephen Muggleton,et al.  Duce, An Oracle-based Approach to Constructive Induction , 1987, IJCAI.

[17]  Gerald DeJong,et al.  Generalizations Based on Explanations , 1981, IJCAI.

[18]  Tim Niblett,et al.  A Study of Generalisation in Logic Programs , 1988, EWSL.

[19]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[20]  Ryszard S. Michalski,et al.  Selection of Most Representative Training Examples and Incremental Generation of VL1 Hypotheses: The Underlying Methodology and the Description of Programs ESEL and AQ11 , 1978 .

[21]  K. Popper,et al.  Conjectures and refutations;: The growth of scientific knowledge , 1972 .

[22]  A. Kolmogorov Three approaches to the quantitative definition of information , 1968 .

[23]  Stephen Muggleton,et al.  Machine Invention of First Order Predicates by Inverting Resolution , 1988, ML.