Bootstrapping language: Are infant statisticians up to the job?

Over the years, spoken language acquisition has attracted the attention of intellects from many disciplines. After much debate, two facts are apparent. On the one hand, it is clear that the ability to learn a language must be at least partially built in to our human psyche. Even chimps, our closest evolutionary cousins, fail to learn spoken language the way human children do. This is true even if they are cared for and spoken to as if they were human children (Terrace, Petitto, Sanders, and Bever 1979; see, however, Savage-Rumbaugh and Fields 2000). On the other hand, it is also clear that human language acquisition crucially depends on experience. Infants exposed to French learn French, infants exposed to Swahili learn Swahili (see also Curtiss 1977). But what must be inherited and what must be learned? And how do infants learn what they need to learn? All current models of language acquisition represent di¤erent answers to these very basic questions. Currently, many of the most popular models of early language acquisition are what could generally be described as distributional models. Most of these models place a heavy burden on the learning capabilities of prelingual infants. Some researchers working within the distributional framework strive to show how much children could accomplish in the absence of innate linguistic knowledge (e.g. Elman 1999), whereas others still emphasize the importance of linguistically-motivated constraints or expectations in statistical learning (e.g. Gervain, Nespor, Mazuka, Horie, and Mehler 2008; Mehler, Pena, Nespor, and Bonatti 2006; Yang 2004). The main focus of this chapter will be to examine distributional learning with the goal of better understanding what exactly we do and do not know about the ability of infants to extract linguistic generalizations from the speech signal. Although the discussion presented in this chapter is meant to apply to many levels of spoken language acquisition (e.g. phonology, morphology, syntax), the examples used to illustrate my points will be drawn primarily from the infant word segmentation literature. This

[1]  Joost van de Weijer,et al.  Word segmentation by 7.5-month-olds: Three words do not equal one. , 2001 .

[2]  Elissa L. Newport,et al.  Statistical Learning of Syntax: The Role of Transitional Probability , 2007 .

[3]  Anne Christophe Teresa Guasti Marina Nespor Reflections on Phonological Bootstrapping: Its Role for Lexical and Syntactic Acquisition , 1997 .

[4]  Jessica F. Hay,et al.  Statistical learning in a natural language by 8-month-old infants. , 2009, Child development.

[5]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[6]  P. Jusczyk,et al.  English-learning infants’ segmentation of trisyllabic words from fluent speech , 2004 .

[7]  Erik D. Thiessen,et al.  Learning to Learn: Infants’ Acquisition of Stress-Based Strategies for Word Segmentation , 2007 .

[8]  J. Mullennix,et al.  Some consequences of stimulus variability on speech processing by 2-month-old infants , 1992, Cognition.

[9]  Kristine H. Onishi,et al.  Infants learn phonotactic regularities from brief auditory experience , 2003, Cognition.

[10]  J. Nadal,et al.  The acquisition of allophonic rules: Statistical learning with linguistic constraints , 2006, Cognition.

[11]  P. Jusczyk,et al.  Sensitivity to discontinuous dependencies in language learners: evidence for limitations in processing space , 1998, Cognition.

[12]  Mary R. Newsome,et al.  The Beginnings of Word Segmentation in English-Learning Infants , 1999, Cognitive Psychology.

[13]  R. Gómez,et al.  The Developmental Trajectory of Nonadjacent Dependency Learning. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[14]  J R Saffran,et al.  Emerging integration of sequential and suprasegmental information in preverbal speech segmentation. , 1995, Child development.

[15]  Elizabeth K. Johnson English-Learning Infants' Representations of Word Forms With Iambic Stress. , 2005, Infancy.

[16]  Elizabeth K. Johnson,et al.  Infants exposed to fluent natural speech succeed at cross-gender word recognition. , 2012, Journal of speech, language, and hearing research : JSLHR.

[17]  P K Kuhl,et al.  Perceptual strategies in prelingual speech segmentation , 1993, Journal of Child Language.

[18]  P. Kuhl Speech perception in early infancy: perceptual constancy for spectrally dissimilar vowel categories. , 1979, The Journal of the Acoustical Society of America.

[19]  Marina Nespor,et al.  How to hit Scylla without avoiding Charybdis: comment on Perruchet, Tyler, Galland, and Peereman (2004). , 2006, Journal of experimental psychology. General.

[20]  A. Cutler,et al.  Processing resyllabified words in French , 2003 .

[21]  J. Weissenborn,et al.  The Recognition of Discontinuous Verbal Dependencies by German 19-Month-Olds: Evidence for Lexical and Structural Influences on Children's Early Processing Capacities , 2006 .

[22]  Pierre Perruchet,et al.  A role for backward transitional probabilities in word segmentation? , 2008, Memory & cognition.

[23]  Rochelle S Newman,et al.  Infants’ listening in multitalker environments: Effect of the number of background talkers , 2009, Attention, perception & psychophysics.

[24]  Rushen Shi,et al.  French-learning toddlers use gender information on determiners during word recognition. , 2009, Developmental science.

[25]  Elizabeth K. Johnson,et al.  Word Segmentation by 8-Month-Olds: When Speech Cues Count More Than Statistics , 2001 .

[26]  Denis Burnham,et al.  The developmental course of lexical tone perception in the first year of life , 2008, Cognition.

[27]  Chandan R. Narayan,et al.  The interaction between acoustic salience and language experience in developmental speech perception: evidence from nasal place discrimination. , 2010, Developmental science.

[28]  Michael C. Frank,et al.  Modeling Human Performance on Statistical Word Segmentation Tasks , 2007 .

[29]  C. Anton Rytting,et al.  Segmenting words from natural speech: subsegmental variation in segmental cues. , 2010, Journal of child language.

[30]  Elissa L. Newport,et al.  The role of constituent structure in the induction of an artificial language , 1981 .

[31]  Erik D. Thiessen,et al.  Infant-Directed Speech Facilitates Word Segmentation. , 2005, Infancy : the official journal of the International Society on Infant Studies.

[32]  M. Tomasello Do young children have adult syntactic competence? , 2000, Cognition.

[33]  Keren B. Shatzman,et al.  Prosodic Knowledge Affects the Recognition of Newly Acquired Words , 2006, Psychological science.

[34]  D. Gow Does English coronal place assimilation create lexical ambiguity , 2002 .

[35]  Elizabeth K. Johnson,et al.  Infants use prosodically conditioned acoustic-phonetic cues to extract words from speech. , 2008, The Journal of the Acoustical Society of America.

[36]  Dani Byrd,et al.  Discovering Speech, Words, and Mind , 2010 .

[37]  Zellig S. Harris,et al.  From Phoneme to Morpheme , 1955 .

[38]  Elizabeth K. Johnson,et al.  Clause Segmentation by 6-Month-Old Infants: A Crosslinguistic Perspective , 2008 .

[39]  D. Norris,et al.  The Possible-Word Constraint in the Segmentation of Continuous Speech , 1997, Cognitive Psychology.

[40]  Mark S. Seidenberg,et al.  Connecting cues: overlapping regularities support cue discovery in infancy. , 2010, Child development.

[41]  Charles D. Yang Universal Grammar, statistics or both? , 2004, Trends in Cognitive Sciences.

[42]  Jessica F. Hay,et al.  Learning in reverse: Eight-month-old infants track backward transitional probabilities , 2009, Cognition.

[43]  J. Mehler,et al.  Bootstrapping word order in prelexical infants: A Japanese–Italian cross-linguistic study , 2008, Cognitive Psychology.

[44]  Erik D. Thiessen,et al.  Pattern induction by infant language learners. , 2003, Developmental psychology.

[45]  R. Aslin,et al.  Infants are sensitive to within-category variation in speech perception , 2005, Cognition.

[46]  Morten H. Christiansen,et al.  Stress changes the representational landscape: evidence from word segmentation , 2005, Cognition.

[47]  Pierre Perruchet,et al.  "Do we need algebraic-like computations? A reply to Bonatti, Pena, Nespor, and Mehler ": Correction to Perruchet et al. , 2006 .

[48]  Marina Nespor,et al.  The “Soul” of Language does not use Statistics: Reflections on Vowels and Consonants , 2006, Cortex.

[49]  J. Werker,et al.  Cross-language speech perception: Evidence for perceptual reorganization during the first year of life , 1984 .

[50]  Naomi Feldman,et al.  The learner as statistician: three principles of computational success in language acquisition. , 2009, Developmental science.

[51]  Elizabeth K. Johnson,et al.  Statistical learning of tone sequences by human infants and adults , 1999, Cognition.

[52]  E. Newport,et al.  Learning at a distance I. Statistical learning of non-adjacent dependencies , 2004, Cognitive Psychology.

[53]  Linda B. Smith,et al.  Infants rapidly learn word-referent mappings via cross-situational statistics , 2008, Cognition.

[54]  S. Curtiss Genie: A Psycholinguistic Study of a Modern-Day "Wild Child" , 1977 .

[55]  Elizabeth K. Johnson,et al.  Developmental Changes in Infants' Ability to Cope with Dialect Variation in Word Recognition. , 2010, Infancy : the official journal of the International Society on Infant Studies.

[56]  Megha Sundara,et al.  Rhythmic distance between languages affects the development of speech perception in bilingual infants , 2011, J. Phonetics.

[57]  P. Jusczyk The discovery of spoken language , 1997 .

[58]  Jessica Maye,et al.  Infant sensitivity to distributional information can affect phonetic discrimination , 2002, Cognition.

[59]  James L. Morgan,et al.  Is early word-form processing stress-full? How natural variability supports recognition , 2010, Cognitive Psychology.

[60]  Toben H. Mintz Frequent frames as a cue for grammatical categories in child directed speech , 2003, Cognition.

[61]  B. McMurray,et al.  Speaker variability augments phonological processing in early word learning. , 2009, Developmental science.

[62]  J. Saffran,et al.  From Syllables to Syntax: Multilevel Statistical Learning by 12-Month-Old Infants , 2003 .

[63]  R. Gómez,et al.  Infant artificial language learning and language acquisition , 2000, Trends in Cognitive Sciences.

[64]  H. Terrace,et al.  Can an ape create a sentence? , 1979, Science.

[65]  A. Vinter,et al.  PARSER: A Model for Word Segmentation , 1998 .

[66]  Anne Cutler,et al.  Frequency and form as determinants of functor sensitivity in English-acquiring infants. , 2006, The Journal of the Acoustical Society of America.

[67]  J. B. Trobalon,et al.  Statistical computations over a speech stream in a rodent , 2005, Perception & psychophysics.

[68]  Morten H. Christiansen,et al.  The phonological-distributional coherence hypothesis: Cross-linguistic evidence in language acquisition , 2007, Cognitive Psychology.

[69]  J. Saffran The Use of Predictive Dependencies in Language Learning , 2001 .

[70]  Marc D. Hauser,et al.  Word segmentation with universal prosodic cues , 2010, Cognitive Psychology.

[71]  P. Jusczyk,et al.  Do infants segment words or recurring contiguous patterns? , 2001, Journal of experimental psychology. Human perception and performance.

[72]  Pienie Zwitserlood,et al.  Syntactic structure and artificial grammar learning: The learnability of embedded hierarchical structures , 2008, Cognition.

[73]  Erik D. Thiessen,et al.  When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. , 2003, Developmental psychology.

[74]  Daniel Swingley,et al.  Statistical clustering and the contents of the infant vocabulary , 2005, Cognitive Psychology.

[75]  Eleanor Olds Batchelder,et al.  Bootstrapping the lexicon: A computational model of infant speech segmentation , 2002, Cognition.

[76]  Elizabeth K. Johnson,et al.  Testing the limits of statistical learning for word segmentation. , 2010, Developmental science.

[77]  Eugene Buckley,et al.  On the Learning of Arbitrary Phonological Rules , 2005 .

[78]  Virginia Valian,et al.  Anchor points in language learning: The role of marker frequency ☆ , 1988 .

[79]  Keith Johnson Massive reduction in conversational American English , 2004 .

[80]  P. Kuhl,et al.  Perception of native and non-native affricate-fricative contrasts: cross-language tests on adults and infants. , 2006, The Journal of the Acoustical Society of America.

[81]  R. Aslin What's in a look? , 2007, Developmental science.

[82]  Linda Shockey,et al.  Sound Patterns of Spoken English , 2003 .

[83]  Marieke van Heugten,et al.  Linking infants’ distributional learning abilities to natural language acquisition , 2010 .

[84]  Katherine S White,et al.  Preference and processing: The role of speech affect in early spoken word recognition , 2004 .

[85]  J. McQueen,et al.  Possible words and fixed stress in the segmentation of Slovak speech , 2010, Quarterly journal of experimental psychology.

[86]  Jacques Mehler,et al.  Primitive computations in speech processing , 2009, Quarterly journal of experimental psychology.

[87]  N. Chater,et al.  Bootstrapping Word Boundaries: A Bottom-up Corpus-Based Approach to Speech Segmentation , 1997, Cognitive Psychology.

[88]  Elizabeth K. Johnson,et al.  Lexical viability constraints on speech segmentation by infants , 2003, Cognitive Psychology.

[89]  Ted Briscoe,et al.  Models of continuous speech recognition and the contents of the vocabulary , 1995 .

[90]  E. Newport,et al.  Computation of Conditional Probability Statistics by 8-Month-Old Infants , 1998 .