When Unbiased Probabilistic Learning Is Not Enough: Acquiring a Parametric System of Metrical Phonology

Parametric systems have been proposed as models of how humans represent knowledge about language, motivated in part as a way to explain children's rapid acquisition of linguistic knowledge. Given this, it seems reasonable to examine if children with knowledge of parameters could in fact acquire the adult system from the data available to them. That is, we explore an argument from acquisition for this knowledge representation. We use the English metrical phonology system as a nontrivial case study and test several computational models of unbiased probabilistic learners. Special attention is given to the modeled learners' input and the psychological plausibility of the model components in order to consider the learning problem from the perspective of children acquiring their native language. We find that such cognitively inspired unbiased probabilistic learners uniformly fail to acquire the English grammar proposed in recent metrical studies from English child-directed speech, suggesting that probabilistic learning alone is insufficient to acquire the correct grammar when using this parametric knowledge representation. Several potential sources of this failure are discussed, along with their implications for the parametric knowledge representation and the trajectory of acquisition for English metrical phonology.

[1]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[2]  H. B. Barlow,et al.  Unsupervised Learning , 1989, Neural Computation.

[3]  R. Kingdon,et al.  The groundwork of English stress , 1958 .

[4]  B. Hayes Metrical Stress Theory: Principles and Case Studies , 1995 .

[5]  S. Suter Meaningful differences in the everyday experience of young American children , 2005, European Journal of Pediatrics.

[6]  Amy Weinberg,et al.  Necessary bias in natural language learning , 2007 .

[7]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[8]  Terrence J. Sejnowski,et al.  Unsupervised Learning , 2018, Encyclopedia of GIS.

[9]  Janet D. Fodor Unambiguous Triggers , 1998, Linguistic Inquiry.

[10]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[11]  Michael Tomasello,et al.  Acquiring Linguistic Constructions , 2007 .

[12]  Mary R. Newsome,et al.  The Beginnings of Word Segmentation in English-Learning Infants , 1999, Cognitive Psychology.

[13]  Janet Dean Fodor,et al.  Parsing to Learn , 1998 .

[14]  M. Halle,et al.  An essay on stress , 1987 .

[15]  Carla L. Hudson Kam,et al.  Regularizing Unpredictable Variation: The Roles of Adult and Child Learners in Language Formation and Change , 2005 .

[16]  P. Smolensky,et al.  Optimality Theory: Constraint Interaction in Generative Grammar , 2004 .

[17]  Judith G. Hochberg Learning Spanish Stress: Developmental and Theoretical Perspectives , 1988 .

[18]  Linda Wilbanks,et al.  IT Productivity = ?? , 2009, IT Prof..

[19]  Walter Daelemans,et al.  The Acquisition of Stress: A Data-Oriented Approach , 1994, Comput. Linguistics.

[20]  Amy Weinberg,et al.  Input Filtering in Syntactic Acquisition: Answers From Language Change Modeling , 2007 .

[21]  Lisa Pearl Putting the Emphasis on Unambiguous : The Feasibility of Data Filtering for Learning English Metrical Phonology , 2007 .

[22]  William Gregory Sakas,et al.  A Word-Order Database for Testing Computational Models of Language Acquisition , 2003, ACL.

[23]  P. Jusczyk,et al.  Infants′ Detection of the Sound Patterns of Words in Fluent Speech , 1995, Cognitive Psychology.

[24]  Jeffrey Lidz,et al.  When Domain-General Learning Fails and When It Succeeds: Identifying the Contribution of Domain Specificity , 2009 .

[25]  Thomas L. Griffiths,et al.  Distributional cues to word segmentation: Context is important , 2007 .

[26]  Joshua B. Tenenbaum,et al.  Indirect Evidence and the Poverty of the Stimulus: The Case of Anaphoric One , 2009, Cogn. Sci..

[27]  Charles D. Yang,et al.  Knowledge and learning in natural language , 2000 .

[28]  P. Jusczyk,et al.  Do English-Learning Infants use Syllable Weight to Determine Stress? , 1995, Language and speech.

[29]  Bezalel Elan Dresher,et al.  Charting the Learning Path: Cues to Parameter Setting , 1999, Linguistic Inquiry.

[30]  M. Brent,et al.  The role of exposure to isolated words in early vocabulary development , 2001, Cognition.

[31]  J. Tenenbaum,et al.  The learnability of abstract syntactic principles , 2011, Cognition.

[32]  Victor Chew,et al.  Point Estimation of the Parameter of the Binomial Distribution , 1971 .

[33]  P. Niyogi,et al.  A language learning model for finite parameter spaces , 1996, Cognition.

[34]  James L. McClelland,et al.  Unsupervised learning of vowel categories from infant-directed speech , 2007, Proceedings of the National Academy of Sciences.

[35]  Janet Dean Fodor,et al.  Language Acquisition and Learnability: The Structural Triggers Learner , 2001 .

[36]  Rebecca J. Panagos Meaningful Differences in the Everyday Experience of Young American Children , 1998 .

[37]  M. Halle,et al.  English stress : its form, its growth, and its role in verse , 1977 .

[38]  Walter Daelemans,et al.  Are Children ‘Lazy Learners’? A Comparison of Natural and Machine Learning of Stress , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[39]  R. Clark The Selection of Syntactic Knowledge , 1992 .

[40]  Stephen Crain,et al.  Why language acquisition is a snap , 2002 .

[41]  Charles Yang,et al.  Word Segmentation: Quick but not Dirty , 2005 .

[42]  Robin Clark,et al.  Kolmogorov Complexity and the Information Content of Parameters , 1994 .

[43]  T. Regier,et al.  Learning the unlearnable: the role of missing evidence , 2004, Cognition.

[44]  N. Ratner Patterns of vowel modification in mother–child speech , 1984, Journal of Child Language.

[45]  D. Lightfoot The development of language , 1999 .

[46]  Nameera Akhtar,et al.  Learning antecedents for anaphoric one , 2004, Cognition.

[47]  B. Dresher,et al.  A computational learning model for metrical phonology , 1990, Cognition.

[48]  Margaret Kehoe,et al.  Stress error patterns in English-speaking children's word productions , 1997 .

[49]  Margaret Kehoe,et al.  Support for metrical stress theory in stress acquisition , 1998 .

[50]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[51]  William Gregory Sakas,et al.  Search, Structure or Statistics? A Comparative Study of Memoryless Heuristics for Syntax Acquisition , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[52]  R. Brown,et al.  A First Language , 1973 .

[53]  R. Gómez,et al.  A first step in form-based category abstraction by 12-month-old infants. , 2004, Developmental science.

[54]  R. R. Bush,et al.  A mathematical model for simple learning. , 1951, Psychological review.

[55]  Angela Baumann,et al.  Nine-Month-Olds' Attention to Sound Similarities in Syllables☆☆☆ , 1999 .