How Data Drive Early Word Learning: A Cross-Linguistic Waiting Time Analysis

The extent to which word learning is delayed by maturation as opposed to accumulating data is a longstanding question in language acquisition. Further, the precise way in which data influence learning on a large scale is unknown—experimental results reveal that children can rapidly learn words from single instances as well as by aggregating ambiguous information across multiple situations. We analyze Wordbank, a large cross-linguistic dataset of word acquisition norms, using a statistical waiting time model to quantify the role of data in early language learning, building off Hidaka (2013). We find that the model both fits and accurately predicts the shape of children’s growth curves. Further analyses of model parameters suggest a primarily data-driven account of early word learning. The parameters of the model directly characterize both the amount of data required and the rate at which informative data occurs. With high statistical certainty, words require on the order of ∼ 10 learning instances, which occur on average once every two months. Our method is extremely simple, statistically principled, and broadly applicable to modeling data-driven learning effects in development.

[1]  M. Goldsmith,et al.  Statistical Learning by 8-Month-Old Infants , 1996 .

[2]  Anna L. Theakston,et al.  The ubiquity of frequency effects in first language acquisition , 2015, Journal of Child Language.

[3]  Stefanie Tellex,et al.  The Human Speechome Project , 2006, EELC.

[4]  Justin Halberda,et al.  Rapid fast-mapping abilities in 2-year-olds. , 2011, Journal of experimental child psychology.

[5]  L. Markson,et al.  Evidence against a dedicated system for word learning in children , 1997, Nature.

[6]  Linda B. Smith,et al.  Infants rapidly learn word-referent mappings via cross-situational statistics , 2008, Cognition.

[7]  Anne Fernald,et al.  Talking to Children Matters , 2013, Psychological science.

[8]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[9]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[10]  Brendan T. Johns,et al.  The role of semantic diversity in lexical organization. , 2012, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[11]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[12]  Susan Carey,et al.  Acquiring a Single New Word , 1978 .

[13]  T. Kushnir,et al.  Rational constructivism in cognitive development , 2012 .

[14]  Richard N Aslin,et al.  The Goldilocks effect in infant auditory attention. , 2014, Child development.

[15]  Richard N. Aslin,et al.  The Goldilocks Effect: Human Infants Allocate Attention to Visual Sequences That Are Neither Too Simple Nor Too Complex , 2012, PloS one.

[16]  Mike Frank,et al.  Large-scale investigations of variability in children's first words , 2015, CogSci.

[17]  K. Stevens,et al.  Linguistic experience alters phonetic perception in infants by 6 months of age. , 1992, Science.

[18]  D. Swingley,et al.  At 6–9 months, human infants know the meanings of many common nouns , 2012, Proceedings of the National Academy of Sciences.

[19]  P. D. Eimas,et al.  Speech Perception in Infants , 1971, Science.

[20]  Tom Lodewyckx,et al.  Bayesian Versus Frequentist Inference , 2008 .

[21]  Roger W. Brown,et al.  A First Language: The Early Stages , 1974 .

[22]  Larissa K. Samuelson,et al.  Fast Mapping but Poor Retention by 24-Month-Old Infants. , 2008, Infancy : the official journal of the International Society on Infant Studies.

[23]  A. Bryk,et al.  Early vocabulary growth: Relation to language input and gender. , 1991 .

[24]  Bob McMurray,et al.  Defusing the Childhood Vocabulary Explosion , 2007, Science.

[25]  E. Markman,et al.  Word learning in children: an examination of fast mapping. , 1987, Child development.

[26]  Michael C. Frank,et al.  Wordbank: an open repository for developmental vocabulary data* , 2016, Journal of Child Language.

[27]  S. Hidaka A Computational Model Associating Learning Process, Word Attributes, and Age of Acquisition , 2013, PloS one.

[28]  S. Levine,et al.  What counts as effective input for word learning?* , 2012, Journal of Child Language.

[29]  K. Wexler,et al.  Semantic and Pragmatic LanguageDevelopment: Children Know 'That' Better , 2007 .

[30]  H. Storkel Learning New Words , 2001 .

[31]  S. Carey The Origin of Concepts , 2000 .

[32]  Virginia A. Marchman,et al.  MacArthur-Bates Communicative Development Inventories , 2006 .

[33]  Ellen M. Markman,et al.  Constraints Children Place on Word Meanings , 1990, Cogn. Sci..

[34]  E. Hoff The specificity of environmental influence: socioeconomic status affects early vocabulary development via maternal speech. , 2003, Child development.

[35]  Hagit Borer,et al.  The Maturation of Syntax , 1987 .

[36]  Elissa L. Newport,et al.  Maturational Constraints on Language Learning , 1990, Cogn. Sci..

[37]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[38]  Fei Xu Rational Statistical Inference and Cognitive Development , 2008 .