Words and possible words in early language acquisition

In order to acquire language, infants must extract its building blocks-words-and master the rules governing their legal combinations from speech. These two problems are not independent, however: words also have internal structure. Thus, infants must extract two kinds of information from the same speech input. They must find the actual words of their language. Furthermore, they must identify its possible words, that is, the sequences of sounds that, being morphologically well formed, could be words. Here, we show that infants' sensitivity to possible words appears to be more primitive and fundamental than their ability to find actual words. We expose 12- and 18-month-old infants to an artificial language containing a conflict between statistically coherent and structurally coherent items. We show that 18-month-olds can extract possible words when the familiarization stream contains marks of segmentation, but cannot do so when the stream is continuous. Yet, they can find actual words from a continuous stream by computing statistical relationships among syllables. By contrast, 12-month-olds can find possible words when familiarized with a segmented stream, but seem unable to extract statistically coherent items from a continuous stream that contains minimal conflicts between statistical and structural information. These results suggest that sensitivity to word structure is in place earlier than the ability to analyze distributional information. The ability to compute nontrivial statistical relationships becomes fully effective relatively late in development, when infants have already acquired a considerable amount of linguistic knowledge. Thus, mechanisms for structure extraction that do not rely on extensive sampling of the input are likely to have a much larger role in language acquisition than general-purpose statistical abilities.

[1]  Casey Lew-Williams,et al.  Isolated words enhance statistical language learning in infancy. , 2011, Developmental science.

[2]  P. Jusczyk,et al.  The head-turn preference procedure for testing auditory perception , 1995 .

[3]  Morten H. Christiansen,et al.  Uncovering the Richness of the Stimulus: Structure Dependence and Indirect Statistical Evidence , 2005, Cogn. Sci..

[4]  Satsuki Nakai,et al.  Distinguishing novelty and familiarity effects in infant preference procedures , 2004 .

[5]  Elizabeth K. Johnson,et al.  Testing the limits of statistical learning for word segmentation. , 2010, Developmental science.

[6]  J. Mehler,et al.  Linguistic Constraints on Statistical Computations , 2005, Psychological science.

[7]  Eysenck Keane Cognitive Psychology, 6 , 2012 .

[8]  Antoni Rodríguez Fornells,et al.  Brain dynamics sustaining rapid rule extraction from speech , 2011 .

[9]  Nick Chater,et al.  Phonology impacts segmentation in online speech processing , 2005 .

[10]  R. Gómez,et al.  The Developmental Trajectory of Nonadjacent Dependency Learning. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[11]  Anne Cutler,et al.  Recognition and Representation of Function Words in English-Learning Infants , 2006 .

[12]  Elizabeth K. Johnson,et al.  Gender-marked determiners help Dutch learners' word recognition when gender information itself does not. , 2011, Journal of child language.

[13]  Erik D. Thiessen,et al.  Discovering Words in Fluent Speech: The Contribution of Two Kinds of Statistical Information , 2013, Front. Psychology.

[14]  Jeffrey L. Elman,et al.  The Emergence of Language: A Conspiracy Theory , 2013 .

[15]  Sylvia Yuan,et al.  “Really? She Blicked the Baby?” , 2009, Psychological science.

[16]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[17]  Elizabeth K. Johnson,et al.  Bootstrapping language: Are infant statisticians up to the job? , 2012 .

[18]  Maria Teresa Guasti,et al.  Language acquisition : the growth of grammar , 2002 .

[19]  Mark S. Seidenberg,et al.  Language Acquisition and Use: Learning and Applying Probabilistic Constraints , 1997, Science.

[20]  Edward Vul,et al.  Pure Reasoning in 12-Month-Old Infants as Probabilistic Inference , 2011, Science.

[21]  B Tillmann,et al.  Regularity of unit length boosts statistical learning in verbal and nonverbal artificial languages , 2013, Psychonomic bulletin & review.

[22]  P. Bloom How children learn the meanings of words , 2000 .

[23]  Paco Calvo,et al.  How Many Mechanisms Are Needed to Analyze Speech? A Connectionist Simulation of Structural Rule Learning in Artificial Language Acquisition , 2011, Cogn. Sci..

[24]  L. Gleitman,et al.  Propose but verify: Fast mapping meets cross-situational word learning , 2013, Cognitive Psychology.

[25]  J. Werker,et al.  Six-Month-Old Infants' Preference for Lexical Words , 2001, Psychological science.

[26]  R. Gómez,et al.  Twelve-Month-Old Infants Benefit From Prior Experience in Statistical Learning , 2008, Psychological science.

[27]  P. Jusczyk,et al.  Infants′ Detection of the Sound Patterns of Words in Fluent Speech , 1995, Cognitive Psychology.

[28]  E. Newport,et al.  Learning at a distance I. Statistical learning of non-adjacent dependencies , 2004, Cognitive Psychology.

[29]  Jessica F. Hay,et al.  Statistical learning in a natural language by 8-month-old infants. , 2009, Child development.

[30]  S Pinker,et al.  Overregularization in language acquisition. , 1992, Monographs of the Society for Research in Child Development.

[31]  Janet F. Werker,et al.  The basis of preference for lexical words in 6-month-old infants , 2003 .

[32]  Marina Nespor,et al.  An interaction between prosody and statistics in the segmentation of fluent speech , 2007, Cognitive Psychology.

[33]  A. Vouloumanos Fine-grained sensitivity to statistical information in adult word learning , 2008, Cognition.

[34]  S Pinker,et al.  Sensitivity of children's inflection to grammatical structure , 1994, Journal of Child Language.

[35]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[36]  J. Elman,et al.  Learning Rediscovered , 1996, Science.

[37]  Silvia Benavides-Varela,et al.  Consonants and vowels: different roles in early language acquisition. , 2011, Developmental science.

[38]  Ferran Pons,et al.  Structural generalizations over consonants and vowels in 11-month-old infants , 2010, Cognition.

[39]  LouAnn Gerken,et al.  Decisions, decisions: infant language learning when multiple generalizations are possible , 2006, Cognition.

[40]  Kathy Hirsh-Pasek,et al.  The Origins of Grammar: Evidence from Early Language Comprehension , 1999 .

[41]  R. Gómez,et al.  Artificial grammar learning by 1-year-olds leads to specific and abstract knowledge , 1999, Cognition.

[42]  Marieke van Heugten,et al.  Linking infants’ distributional learning abilities to natural language acquisition , 2010 .

[43]  S Pinker,et al.  Rules of language. , 1991, Science.

[44]  Elizabeth K. Johnson,et al.  At 11 months, prosody still outranks statistics. , 2009, Developmental science.

[45]  S. Waxman,et al.  What infants know about syntax but couldn't have learned: experimental evidence for syntactic structure at 18 months , 2003, Cognition.

[46]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[47]  Richard N Aslin,et al.  Statistical learning of new visual feature combinations by infants , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[48]  E. Newport,et al.  Science Current Directions in Psychological Statistical Learning : from Acquiring Specific Items to Forming General Rules on Behalf Of: Association for Psychological Science , 2022 .

[49]  Marina Nespor,et al.  How to hit Scylla without avoiding Charybdis: comment on Perruchet, Tyler, Galland, and Peereman (2004). , 2006, Journal of experimental psychology. General.

[50]  Richard N. Aslin,et al.  The Goldilocks Effect: Human Infants Allocate Attention to Visual Sequences That Are Neither Too Simple Nor Too Complex , 2012, PloS one.

[51]  A. Endress,et al.  Rapid learning of syllable classes from a perceptually continuous speech stream , 2007, Cognition.

[52]  Rachel A Robbins,et al.  A Review and Clarification of the Terms “holistic,” “configural,” and “relational” in the Face Perception Literature , 2012, Front. Psychology.

[53]  Charles D. Yang,et al.  Morphosyntactic Learning and the Development of Tense , 2007 .

[54]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[55]  LouAnn Gerken,et al.  Three Exemplars Allow at Least Some Linguistic Generalizations: Implications for Generalization Mechanisms and Constraints , 2008 .

[56]  Sharon Peperkamp,et al.  (Non)words, (non)words, (non)words: evidence for a protolexicon during the first year of life. , 2013, Developmental science.

[57]  Marina Nespor,et al.  Signal-Driven Computations in Speech Processing , 2002, Science.

[58]  Janet Dean Fodor,et al.  Children’s Acquisition of Syntax: Simple Models are Too Simple , 2012 .

[59]  Richard N Aslin,et al.  Prosody guides the rapid mapping of auditory word forms onto visual objects in 6-mo-old infants , 2011, Proceedings of the National Academy of Sciences.

[60]  R. Gómez Variability and Detection of Invariant Structure , 2002, Psychological science.

[61]  L. Gleitman,et al.  How words can and cannot be learned by observation , 2011, Proceedings of the National Academy of Sciences.

[62]  C. Fisher,et al.  Learning Words and Rules , 2006, Psychological science.

[63]  Michael C. Frank,et al.  Abstract Rule Learning for Visual Sequences in 8- and 11-Month-Olds. , 2009, Infancy : the official journal of the International Society on Infant Studies.

[64]  L. Bonatti,et al.  Finding words and word structure in artificial speech: the development of infants' sensitivity to morphosyntactic regularities* , 2014, Journal of Child Language.

[65]  Anne-Catherine Bachoud-Lévi,et al.  Different Neurophysiological Mechanisms Underlying Word and Rule Extraction from Speech , 2007, PloS one.

[66]  G. Altmann Learning and development in neural networks – the importance of prior experience , 2002, Cognition.

[67]  Morten H. Christiansen,et al.  The secret is in the sound: from unsegmented speech to lexical categories. , 2009, Developmental science.

[68]  Charles D. Yang Universal Grammar, statistics or both? , 2004, Trends in Cognitive Sciences.

[69]  Jill Lany,et al.  From Statistics to Meaning , 2010, Psychological science.

[70]  J. Werker,et al.  Developmental changes across childhood in the perception of non-native speech sounds. , 1983, Canadian journal of psychology.

[71]  Jutta L. Mueller,et al.  The Role of Pause Cues in Language Learning: The Emergence of Event-related Potentials Related to Sequence Processing , 2008, Journal of Cognitive Neuroscience.

[72]  Massimo Piattelli-Palmarini,et al.  Rich Languages From Poor Inputs , 2015 .

[73]  R. Peereman,et al.  Learning Nonadjacent Dependencies: No Need for Algebraic-like Computations Is It Possible to Learn the Relation between 2 Nonadjacent Events? , 2004 .

[74]  M. Goldsmith,et al.  Statistical Learning by 8-Month-Old Infants , 1996 .

[75]  D. Norris,et al.  The Possible-Word Constraint in the Segmentation of Continuous Speech , 1997, Cognitive Psychology.

[76]  Justin Halberda,et al.  Rapid fast-mapping abilities in 2-year-olds. , 2011, Journal of experimental child psychology.

[77]  S. Edelman,et al.  Learn locally, act globally: Learning language from variation set cues , 2008, Cognition.

[78]  J. Mehler,et al.  Finding Words and Rules in a Speech Stream , 2008, Psychological science.

[79]  Thierry Dutoit,et al.  The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[80]  J. Halberda,et al.  The development of a word-learning strategy , 2003, Cognition.

[81]  M. Nespor,et al.  Can prosody be used to discover hierarchical structure in continuous speech , 2012 .