Learning Auxiliary Fronting with Grammatical Inference

We present a simple context-free grammatical inference algorithm, and prove that it is capable of learning an interesting subclass of context-free languages. We also demonstrate that an implementation of this algorithm is capable of learning auxiliary fronting in polar interrogatives (AFIPI) in English. This has been one of the most important test cases in language acquisition over the last few decades. We demonstrate that learning can proceed even in the complete absence of examples of particular constructions, and thus that debates about the frequency of occurrence of such constructions are irrelevant. We discuss the implications of this on the type of innate learning biases that must be hypothesized to explain first language acquisition.

[1]  Robert C. Berwick,et al.  Reversible Automata and Induction of the English Auxiliary System , 1985, ACL.

[2]  Colin de la Higuera,et al.  Characteristic Sets for Polynomial Grammatical Inference , 1997, Machine Learning.

[3]  Noam Chomsky,et al.  The Logical Structure of Linguistic Theory , 1975 .

[4]  Menno van Zaanen,et al.  ABL: Alignment-Based Learning , 2000, COLING.

[5]  Dan Klein,et al.  Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency , 2004, ACL.

[6]  Takashi Yokomori,et al.  Polynomial-time identification of very simple grammars from positive data , 2003, Theor. Comput. Sci..

[7]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[8]  Noah A. Smith,et al.  Contrastive Estimation: Training Log-Linear Models on Unlabeled Data , 2005, ACL.

[9]  S. Pinker,et al.  The nature of the language faculty and its implications for evolution of language (Reply to Fitch, Hauser, and Chomsky) , 2005, Cognition.

[10]  Barbara C. Scholz,et al.  Empirical assessment of stimulus poverty arguments , 2002 .

[11]  S. Crain,et al.  Structure dependence in grammar formation , 1987 .

[12]  Alexander Clark,et al.  Identification in the Limit of Substitutable Context-Free Languages , 2005, ALT.

[13]  Morten H. Christiansen,et al.  Structure Dependence in Language Acquisition: Uncovering the Statistical Richness of the Stimulus , 2004 .

[14]  Dan Klein,et al.  A Generative Constituent-Context Model for Improved Grammar Induction , 2002, ACL.

[15]  Dana Angluin,et al.  Inference of Reversible Languages , 1982, JACM.

[16]  Eytan Ruppin,et al.  Unsupervised learning of natural languages , 2006 .