INDUCTIVE INFERENCE OF RECURSIVE CONCEPTS

Inductive Inference of Recursive Concepts Yasuhito Mukouchi Kyushu University December 1993 Inductive inference is a process of hypothesizing a general rule from examples. As a successful inference criterion for inductive inference of formal languages and models of logic programming, we have mainly used Gold's identification in the limit. An inference machine M is said to infer a concept L in the limit, if the sequence of guesses from M which is successively fed a sequence of examples of L converges to a correct expression of L, that is, all guesses from M become a unique expression in a finite time and that the expression is a correct one. A class, or a hypothesis space, is said to be inferable in the limit, if there is an inference machine M which infers every concept in the class. In the present thesis, we mainly investigate three criteria related to the identification in the limit. As we will see, they are necessary for practical applications of machine learning or machine discovery. The first criterion requires an inference machine to produce a unique guess. That is, we apply so-called finite identification to concept learning. As stated above, ordinary inductive inference is an infinite process. Thus we can not decide in general whether a sequence of guesses from an inference machine has converged or not at a certain time. To the contrary, in the criterion of finite identification, if an inference machine produces a guess, then it is a conclusive answer. The second criterion requires an inference machine to refute a hypothesis space in question, if a target concept is not in the hypothesis space. In the ordinary inductive inference, the behavior of an inference machine is not specified, when we feed examples of a target concept not belonging to the hypothesis space. That is, we implicitly assume that every target concept belongs to the hypothesis space. As far as data or facts are presented according to a concept that is unknown but guaranteed to be in the hypothesis space, the machine will eventually identify the hypothesis. However this assumption is not appropriate, if we want an inference machine to infer or to discover an unknown rule which explains examples or data obtained from scientific experiments. Thus we propose a successful inference criterion where, if there is no concept in the hypothesis space which coincides with a target concept, then an inference machine explicitly tells us this and stops in a finite time. The third criterion requires an inference machine to infer a minimal concept within the hypothesis space concerned. In actual applications of inductive inference, there are many cases where we want an inference machine to infer an approximate concept within the hypothesis space, even when there is no concept which exactly coincides with the target concept. Here we take a minimal concept as an approximate concept within the hypothesis space, and discuss inferability of a minimal concept of the target concept which may not belong to the hypothesis space. That is, we force an inference machine to converge to an expression of a minimal concept of the target concept, if there is a minimal concept of the target concept within the hypothesis space. In the present thesis, we discuss inferability of recursive concepts under the above three criteria, and show some necessary and sufficient conditions for inferability and some comparisons between inferable classes. Furthermore as practical and concrete hypothesis spaces, we take the classes definable by so-called lengt h-bounded elementary formal systems (EFS's, for short) and discuss their inferability in the above three criteria. In 1990, Shinohara showed that the classes definable by length-bounded EFS's with at most n axioms are inferable in the limit from positive data for any n 2: 1. In the present thesis, we show that the above classes are also refutably inferable from complete data, i.e. positive and negative data, as well as minimally inferable from positive data. This means that there are rich hypothesis spaces that are refutably inferable from complete data or minimally inferable from positive data.

[1]  Yasuhito Mukouchi,et al.  Inductive Inference with Bounded Mind Changes , 1992, ALT.

[2]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[3]  John Case,et al.  Machine Inductive Inference and Language Identification , 1982, ICALP.

[4]  Yasuhito Mukouchi,et al.  Inductive Inference of an Approximate Concept from Positive Data , 1994, AII/ALT.

[5]  Takeshi Shinohara,et al.  Polynomial Time Inference of Extended Regular Pattern Languages , 1983, RIMS Symposium on Software Science and Engineering.

[6]  J. Lloyd Foundations of Logic Programming , 1984, Symbolic Computation.

[7]  Rolf Wiehagen,et al.  Research in the theory of inductive inference by GDR mathematicians - A survey , 1980, Inf. Sci..

[8]  Takeshi Shinohara Inductive inference from positive data is powerful , 1990, COLT '90.

[9]  Thomas Zeugmann,et al.  Learning Recursive Languages with Bounded Mind Changes , 1993, Int. J. Found. Comput. Sci..

[10]  Akihiro Yamamoto,et al.  Elementary formal system as a unifying framework for language learning , 1989, COLT '89.

[11]  Hiroki Arimura Completeness of Depth-bounded Resolution for Weakly Reducing Programs , 1991, Software Science and Engineering.

[12]  Setsuo Arikawa,et al.  Inductive Inference Machines That Can Refute Hypothesis Spaces , 1993, ALT.

[13]  R. Smullyan Theory of formal systems , 1962 .

[14]  Keith Wright Identification of unions of languages drawn from an identifiable class , 1989, COLT '89.

[15]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[16]  P. Langley,et al.  Computational Models of Scientific Discovery and Theory Formation , 1990 .

[17]  J. Farris CONJECTURES AND REFUTATIONS , 1995, Cladistics : the international journal of the Willi Hennig Society.

[18]  Dana Angluin,et al.  Finding Patterns Common to a Set of Strings , 1980, J. Comput. Syst. Sci..

[19]  Takeshi Shinohara,et al.  The correct definition of finite elasticity: corrigendum to identification of unions , 1991, COLT '91.

[20]  Shyam Kapur,et al.  Monotonic Language Learning , 1992, ALT.

[21]  Gary James Jason,et al.  The Logic of Scientific Discovery , 1988 .

[22]  Thomas Zeugmann,et al.  Types of monotonic language learning and their characterization , 1992, COLT '92.