Bootstrapping syntax from morpho-phonology

It has been a puzzle how the syntax of natural language could be learned from positive evidence alone. Here we present a hybrid neural-network model in which artificial syntactic categories are acquired through unsupervised competitive learning due to grouping together lexical words with consistent phonological endings. These relatively large syntactic categories then become target signals for a feed-forward error-reducing network that learns to pair these lexical items with smaller numbers of function words to form phrases. This hybrid model learns phrasal syntax from positive evidence alone, while covering the essential findings in recent experiments on adult humans learning an artificial language. The model further predicts generalization to novel lexical words (exceptions) from knowledge of function words.

[1]  Thomas R. Shultz,et al.  A Model of Infant Learning of Word Stress , 2005 .

[2]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[3]  T. Shultz Computational Developmental Psychology , 2003 .

[4]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[5]  R. Gómez,et al.  Infant artificial language learning and language acquisition , 2000, Trends in Cognitive Sciences.

[6]  Janet L. McDonald,et al.  Properties of Phonological Markers That Affect the Acquisition of Gender-Like Subclasses☆☆☆★ , 1998 .

[7]  R. Gómez,et al.  A first step in form-based category abstraction by 12-month-old infants. , 2004, Developmental science.

[8]  Robert C. Berwick,et al.  The acquisition of syntactic knowledge , 1985 .

[9]  Shumeet Baluja,et al.  Reducing Network Depth in the Cascade-Correlation Learning Architecture, , 1994 .

[10]  Thomas R. Shultz,et al.  Neural Network Simulation of Infant Familiarization to Artificial Sentences: Rule-Like Behavior Without Explicit Rules and Variables. , 2001, Infancy : the official journal of the International Society on Infant Studies.

[11]  Thomas R. Shultz,et al.  Neural networks discover a near-identity relation to distinguish simple syntactic forms , 2006, Minds and Machines.

[12]  Jill Lany,et al.  The Role of Prior Experience in Language Acquisition , 2007, Cogn. Sci..

[13]  Nick Chater,et al.  What can be learned from positive data? Insights from an ‘ideal learner’. Commentary on ‘A Multiple process solution to the logical problem of language acquisition’ by Brian MacWhinney , 2004, Journal of Child Language.

[14]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.