Elementary formal systems, intrinsic complexity, and procrastination

Recently, rich subclasses of elementary formal systems (EFS) have been shown to be identifiable in the limit from only positive data. Examples of these classes are Angluin’s pattern languages, unions of pattern languages by Wright and Shinohara, and classes of languages definable by length-bounded elementary formal systems studied by Shinohara. The present paper employs two distinct bodies of abstract studies in the inductive inference literature to analyze the learnability of the these concrete classes. The first approach, introduced by Freivalds and Smith, uses constructive ordinals to bound the number of mind changes. w denotes the first limit ordinal. An ordinal mind change bound of u means that identification can be carried out by a learner that after examining some element(s) of the language announces an upper bound on the number of mind changes it will make before converging; a bound of 2W means that the learner reserves the right to revise this upper bound once; a bound of 3W means the learner reserves the right to revise this upper bound twice, and so on. A bound of W2 means that identification can be carried out by a learner that announces an upper bound on the number of times it may revise its conjectured upper bound on the number of mind changes. It is shown in the present paper that the ordinal mind change complexity for identification of languages formed by unions of up to n pattern languages is Wn. It is also shown that this bound is essential. Similar results are also Permission to make digital/hard copies of all or part of this material for personal or classroom use is granted without fee provided that rhe copies are not made or distributed for profit or commercial advantage, the copyright ootice, the title of the publication and its date appear, and notice ia given that copyright is by permission of the ACM, Inc. To copy otherwise, to republish, to post on servers or to redistribute to lists, requirea apecitic permiaaion and/or fee. COLT’96, Desenzano del Garda, Italy @ 1996 ACM 0-89791-81 1-8/96/06 ..$3.50 shown to hold for classes definable by lengthbounded elementary formal systems with up to n clauses. The second approach, studied by Freivalds, Kinber and Wiehagen and by Jain and Sharma, employs reductions to study the intrinsic complexity of learnable classes. It is shown that the class of languages formed by taking unions of up to n + 1 pattern languages is a strictly more difficult learning problem than the class of languages formed by the union of up to n pattern languages. It is also shown that a similar hierarchy holds for the bound on the number of clauses in the case of languages definable by length-bounded EFS. In addition to building bridges between three distinct areas of inductive inference, viz., learnability of EFS subclasses, ordinal mind change complexity, and intrinsic complexity, this paper also presents results that relate topological properties of learnable classes to that of intrinsic complexity and ordinal mind change complexity. For example, it is shown that a clam that is complete according to the reductions for intrinsic complexity has infinite elasticity, Since EFS languages and their learnability results have counterparts in traditional logic programming, the present paper demonstrates the possibility of using abstract results of inductive inference to gain insights into inductive logic programming.

[1]  Paul Young,et al.  An introduction to the general theory of algorithms , 1978 .

[2]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[3]  Takeshi Shinohara,et al.  Rich Classes Inferable from Positive Data: Length-Bounded Elementary Formal Systems , 1994, Inf. Comput..

[4]  Satoru Miyano,et al.  More About Learning Elementary Formal Systems , 1991, Nonmonotonic and Inductive Logic.

[5]  Akihiro Yamamoto,et al.  Learning Elementary Formal Systems , 1992, Theor. Comput. Sci..

[6]  Akihiro Yamamoto,et al.  Algorithmic Learning Theory with Elementary Formal Systems , 1992 .

[7]  Arun Sharma,et al.  The Structure of Intrinsic Complexity of Learning , 1997, J. Symb. Log..

[8]  篠原 武 Studies on inductive inference from positive data , 1986 .

[9]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[10]  G. Sacks Higher recursion theory , 1990 .

[11]  Keith Wright Identification of unions of languages drawn from an identifiable class , 1989, COLT '89.

[12]  Manuel Blum,et al.  A Machine-Independent Theory of the Complexity of Recursive Functions , 1967, JACM.

[13]  John Case,et al.  Not-So-Nearly-Minimal-Size Program Inference , 1995, GOSLER Final Report.

[14]  Dana Angluin,et al.  Finding Patterns Common to a Set of Strings , 1980, J. Comput. Syst. Sci..

[15]  Takeshi Shinohara,et al.  The correct definition of finite elasticity: corrigendum to identification of unions , 1991, COLT '91.

[16]  Hiroki Arimura,et al.  Inductive Inference of Prolog Programs with Linear Data Dependency from Positive Data , 1993 .

[17]  Carl H. Smith,et al.  On the Role of Procrastination in Machine Learning , 1993, Inf. Comput..

[18]  Jr. Hartley Rogers Theory of Recursive Functions and Effective Computability , 1969 .

[19]  Setsuo Arikawa,et al.  Elementary formal systems and formal languages-simple formal systems , 1970 .

[20]  Carl H. Smith,et al.  On the Intrinsic Complexity of Learning , 1995, Inf. Comput..

[21]  Carl H. Smith,et al.  On the role of procrastination for machine learning , 1992, COLT '92.

[22]  R. Smullyan Theory of formal systems , 1962 .

[23]  Arun Sharma,et al.  On the intrinsic complexity of language identification , 1994, COLT '94.

[24]  Stephen Cole Kleene,et al.  On notation for ordinal numbers , 1938, Journal of Symbolic Logic.

[25]  M. R. K. Krishna Rao A Class of Prolog Programs Inferable from Positive Data , 1996, ALT.