Language Learning with a Neighbor System

We consider inductive language learning from positive examples, some of which may be incorrect. In the present paper, the error or incorrectness we consider is the one described uniformly in terms of a distance over strings. Firstly, we introduce a notion of a recursively generable distance over strings, and define a k-neighbor closure of a language L as the collection of strings each of which is at most k distant from some string in L. Then we define a k-neighbor system as the collection of original languages and their j-neighbor closures with j ≤ k, and adopt it as a hypothesis space. In ordinary learning paradigm, a target language, whose examples are fed to an inference machine, is assumed to belong to a hypothesis space without any guarantee. In this paper, we allow an inference machine to infer a neighbor closure instead of the original language as an admissible approximation. We formalize such kind of inference, and give some sufficient conditions for a hypothesis space.

[1]  Sanjay Jain,et al.  Program Synthesis in the Presence of Infinite Number of Inaccuracies , 1994, J. Comput. Syst. Sci..

[2]  Setsuo Arikawa,et al.  Towards a Mathematical Theory of Machine Discovery from Facts , 1995, Theor. Comput. Sci..

[3]  Dana Angluin,et al.  Inductive Inference of Formal Languages from Positive Data , 1980, Inf. Control..

[4]  Dana Angluin,et al.  Finding Patterns Common to a Set of Strings , 1980, J. Comput. Syst. Sci..

[5]  John Case,et al.  Synthesizing Learners Tolerating Computable Noisy Data , 1998, ALT.

[6]  Takeshi Shinohara,et al.  The correct definition of finite elasticity: corrigendum to identification of unions , 1991, COLT '91.

[7]  Dana Angluin,et al.  Finding patterns common to a set of strings (Extended Abstract) , 1979, STOC.

[8]  Steffen Lange,et al.  Machine Discovery in the Presence of Incomplete or Ambiguous Data , 1994, AII/ALT.

[9]  Frank Stephan Noisy Inference and Oracles , 1997, Theor. Comput. Sci..

[10]  Yasuhito Mukouchi,et al.  REFUTABLE INFERENCE WITH A RESTRICTED TARGET CLASS , 1999 .

[11]  Yasuhito Mukouchi,et al.  Inductive Inference of an Approximate Concept from Positive Data , 1994, AII/ALT.

[12]  Satoshi Kobayashi,et al.  On Approximately Identifying Concept Classes in the Limit , 1995, ALT.

[13]  Masako Sato,et al.  INDUCTIVE INFERENCE OF FORMAL LANGUAGES , 1995 .

[14]  Keith Wright Identification of unions of languages drawn from an identifiable class , 1989, COLT '89.

[15]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[16]  Sanjay Jain Learning with Refutation , 1998, J. Comput. Syst. Sci..

[17]  Thomas Zeugmann,et al.  Types of monotonic language learning and their characterization , 1992, COLT '92.