Testing the limits of statistical learning for word segmentation.

Past research has demonstrated that infants can rapidly extract syllable distribution information from an artificial language and use this knowledge to infer likely word boundaries in speech. However, artificial languages are extremely simplified with respect to natural language. In this study, we ask whether infants' ability to track transitional probabilities between syllables in an artificial language can scale up to the challenge of natural language. We do so by testing both 5.5- and 8-month-olds' ability to segment an artificial language containing four words of uniform length (all CVCV) or four words of varying length (two CVCV, two CVCVCV). The transitional probability cues to word boundaries were held equal across the two languages. Both age groups segmented the language containing words of uniform length, demonstrating that even 5.5-month-olds are extremely sensitive to the conditional probabilities in their environment. However, neither age group succeeded in segmenting the language containing words of varying length, despite the fact that the transitional probability cues defining word boundaries were equally strong in the two languages. We conclude that infants' statistical learning abilities may not be as robust as earlier studies have suggested.

[1]  Suzanne Curtin,et al.  PRIMIR: A Developmental Framework of Infant Speech Processing , 2005 .

[2]  J. Werker,et al.  Influences on infant speech processing: toward a new synthesis. , 1999, Annual review of psychology.

[3]  M. Brent,et al.  The role of exposure to isolated words in early vocabulary development , 2001, Cognition.

[4]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[5]  Elizabeth K. Johnson,et al.  Word Segmentation by 8-Month-Olds: When Speech Cues Count More Than Statistics , 2001 .

[6]  L. Shockey,et al.  Phonological Processes in Speech Addressed to Children , 1980 .

[7]  Elizabeth K. Johnson,et al.  Exploring statistical learning by 8-month-olds : The role of complexity and variation , 2003 .

[8]  Amanda Seidl,et al.  Infant word segmentation revisited: edge alignment facilitates target extraction. , 2006, Developmental science.

[9]  Jenny R Saffran,et al.  Words in a sea of sounds: the output of infant statistical learning , 2001, Cognition.

[10]  Elizabeth K. Johnson,et al.  Boundary alignment enables 11-month-olds to segment vowel initial words from speech* , 2008, Journal of Child Language.

[11]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[12]  J. Mehler,et al.  Phonological phrase boundaries constrain lexical access II. Infant data , 2004 .

[13]  Morten H. Christiansen,et al.  Stress changes the representational landscape: evidence from word segmentation , 2005, Cognition.

[14]  M. Goldsmith,et al.  Statistical Learning by 8-Month-Old Infants , 1996 .

[15]  Thierry Dutoit,et al.  The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[16]  J. Saffran,et al.  The Infant's Auditory World: Hearing, Speech, and the Beginnings of Language , 2007 .

[17]  P. Jusczyk,et al.  Six-month-olds' Detection of Clauses Embedded in Continuous Speech: Effects of Prosodic Well-formedness , 2022 .

[18]  Anne Cutler,et al.  Frequency and form as determinants of functor sensitivity in English-acquiring infants. , 2006, The Journal of the Acoustical Society of America.

[19]  Linda Shockey,et al.  Sound Patterns of Spoken English , 2003 .

[20]  Ann M. Peters,et al.  The Units of Language Acquisition , 1983 .

[21]  Erik D. Thiessen,et al.  Learning to Learn: Infants’ Acquisition of Stress-Based Strategies for Word Segmentation , 2007 .

[22]  Julia L. Evans,et al.  Can Infants Map Meaning to Newly Segmented Words? , 2007, Psychological science.

[23]  R. Reber,et al.  The use of Control Groups in Artificial Grammar Learning , 2003, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[24]  Elizabeth K. Johnson,et al.  Infants use prosodically conditioned acoustic-phonetic cues to extract words from speech. , 2008, The Journal of the Acoustical Society of America.

[25]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[26]  Charles D. Yang Universal Grammar, statistics or both? , 2004, Trends in Cognitive Sciences.

[27]  Elizabeth K. Johnson,et al.  Clause Segmentation by 6-Month-Old Infants: A Crosslinguistic Perspective , 2008 .

[28]  P. Jusczyk,et al.  Infants′ Detection of the Sound Patterns of Words in Fluent Speech , 1995, Cognitive Psychology.

[29]  Rebecca L. Gómez,et al.  Statistical learning in infant language development , 2007 .

[30]  Mary R. Newsome,et al.  The Beginnings of Word Segmentation in English-Learning Infants , 1999, Cognitive Psychology.

[31]  M. Brent Speech segmentation and word discovery: a computational perspective , 1999, Trends in Cognitive Sciences.

[32]  J. Morgan,et al.  Mommy and Me , 2005, Psychological science.

[33]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .

[34]  B. J. Winer Statistical Principles in Experimental Design , 1992 .

[35]  Erik D. Thiessen,et al.  Infant-Directed Speech Facilitates Word Segmentation. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[36]  Elizabeth K. Johnson,et al.  At 11 months, prosody still outranks statistics. , 2009, Developmental science.

[37]  P. Jusczyk The discovery of spoken language , 1997 .

[38]  Melanie Soderstrom,et al.  Six-month-olds recognize clauses embedded in different passages of fluent speech , 2005 .

[39]  Marina Nespor,et al.  An interaction between prosody and statistics in the segmentation of fluent speech , 2007, Cognitive Psychology.

[40]  Daniel Swingley,et al.  Statistical clustering and the contents of the infant vocabulary , 2005, Cognitive Psychology.

[41]  A. Cutler,et al.  Rhythmic cues to speech segmentation: Evidence from juncture misperception , 1992 .