Learning Shallow Context-free Languages under Simple Distributions

In this paper I present the EMILE 3.0 algorithm 1. It can learn shallow context-free grammars eeciently. It does so under circumstances that, from a perspective of complexity, come resonably close to the conditions under which human beings learn a language. A language is shallow in its descriptive length if all the relevant constructions we need to know to learn it have a complexity that is logarithmic in the descriptive length of the grammar as a whole. I claim that natural languages are shallow and that shallowness in itself is an interesting general constraint in the context of formal learning theory. I also introduce the concept of sepa-rability. A language is separable if the validity of rules can be tested by means of memberbership queries using simple substitutions. Although I do not believe that human languages are strictly separable I do think it is an important aspect of eecient human communication. I illustrate the structure of the algorithm by means of an extensive example, and I indicate how a careful analysis of the bias involved in natural language learning may lead to versions of EMILE 3.0 that can be implemented in real-life dialogue systems. I claim that the EMILE approach could serve as a valuable metamodel for evaluating clustering approaches to language learning. The results of EMILE are mainly theoretical. They 1 The original acronym stands for Entity Modeling Intelligent Learning Engine. It refers to earlier versions of EMILE that also had semantic capacities. The name EMILE is also motivated by the book on education by J.-J. Rousseau. In order to distinguishseveral versions of EMILE I started recently to give them version numbers. Versions 1 and 2 are discussed in Adriaans, 1992a. 1 2 / Pieter W. Adriaans could potentially also be used to develop more eecient variants of other approaches to language learning.

[1]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[2]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[3]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[4]  J. Lambek,et al.  Categorial and Categorical Grammars , 1988 .

[5]  W. Buszkowski Generative Power of Categorial Grammars , 1988 .

[6]  Naoki Abe Polynominal Learnability and Locality of Formal Grammars , 1988, ACL.

[7]  Emmon W. Bach,et al.  Categorial Grammars and Natural Language Structures , 1988 .

[8]  Yasubumi Sakakibara,et al.  Learning context-free grammars from structural data in polynomial time , 1988, COLT '88.

[9]  Takashi Yokomori Learning Context-Free Languages Efficiently - A Report on Recent Results in Japan , 1989, AII.

[10]  Leonard Pitt,et al.  Inductive Inference, DFAs, and Computational Complexity , 1989, AII.

[11]  Gerald Penn,et al.  Categorial grammars determined from linguistic data by unification , 1990, Stud Logica.

[12]  Andrew Radford,et al.  Syntactic Theory and the Acquisition of English Syntax: The Nature of Early Child Grammars of English , 1990 .

[13]  Ming Li,et al.  Learning Simple Concept Under Simple Distributions , 1991, SIAM J. Comput..

[14]  Balas K. Natarajan,et al.  Machine Learning: A Theoretical Approach , 1992 .

[15]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[16]  Mati Pentus,et al.  Lambek grammars are context free , 1993, [1993] Proceedings Eighth Annual IEEE Symposium on Logic in Computer Science.

[17]  G. Marcus Negative evidence in language acquisition , 1993, Cognition.

[18]  Ming Li,et al.  An Introduction to Kolmogorov Complexity and Its Applications , 2019, Texts in Computer Science.

[19]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[20]  Marc M. Lankhorst Grammatical Inference with a Genetic Algorithm , 1994, EUROSIM.

[21]  Timo Honkela,et al.  Contextual Relations of Words in Grimm Tales, Analyzed by Self-Organizing Map , 1995 .

[22]  Hiroshi Sakamoto,et al.  Language Learning from Membership Queries and Characteristic Examples , 1995, ALT.

[23]  Makoto Kanazawa Learnable Classes of Categorial Grammars , 1998 .

[24]  Yasubumi Sakakibara,et al.  Learning Context-Free Grammars from Partially Structured Examples , 2000, ICGI.