Infants Rapidly Learn Words from Noisy Data via Cross-Situational Statistics Linda Smith (smith4@indiana.edu) Department of Psychological and Brain Sciences, and Cognitive Science Program, Indiana University Bloomington, IN 47405 USA Chen Yu (chenyu@indiana.edu) Department of Psychological and Brain Sciences, and Cognitive Science Program, Indiana University Bloomington, IN 47405 USA uncluttered as the experimental settings in which fast- mapping has been demonstrated. In everyday contexts, there are typically many words, many potential referents, limited cues as to which words go with which referents, and rapid attentional shifts among the many entities in the scene. It is possible that young learners just ignore the information in such highly ambiguous learning contexts and wait for contexts in which the referents of heard words are more certain (Brent & Siskind, 2001). However, a more optimal learner might be expected to make use of all the available data. Second, the evidence indicates that 9-, 10-, and certainly 12-month old infants are accumulating considerable receptive lexical knowledge (Fenson et al, 1994; Swingley & Aslin, 2000). Yet many studies find that children even as old as 18 months have difficulty in making he right inferences about the intended referents of novel words (e.g., Katz, Baker & Macnamara, 1974; Moore, Angepoulos B Hirsh-Pasek, Golinkoff & Hollich, 1999; Pruden, Hirsh-Pasek, Golinkoff & Hennon, 2006). There are studies showing that infants as young as 13 or 14 months (Woodward, Markman & Fitzsimmons, 1994; Woodward & Hoyne, 1999; Schafer & Plunkett, 1998; but perhaps not younger, Werker, Lloyd, Cohen, Cassola & Stager, 1998) can link a name to an object given repeated unambiguous pairings in a single session. Overall, however, these effects are fragile with small experimental variations often leading to no learning (see especially, Woodward & Hoyne, 1999; Werker, et al, 1998; also Oviatt 1980, 1982; and Bloom 2000 for a discussion). This raises the possibility that there might be some other way that young children learn word- referent mappings. The experiment reported here shows for the first time that infants rapidly learn multiple word-referent pairs by accruing statistical evidence across multiple and individually ambiguous word-scene pairings. The indeterminacy problem is solved not in a single trial but across trials, not for a single word and its referent but for a system of many words and referents. This learning is shown to be sufficiently rapid and robust that it could play a significant role in early lexical learning. Figure 1 illustrates how cross-trial statistics might work. The learner hears the unknown words “bat” and “ball” in the context of seeing a BAT and BALL. Without other information, the learner cannot know whether the word form “ball” refers to one or the other visual object. However, if subsequently, while viewing a scene with the potential Abstract First word learning should be difficult because any pairing of a word and scene presents the learner with an infinite number of possible referents. Accordingly, theorists of children’s rapid word learning have sought constraints on word-referent mappings. These constraints are thought to work by enabling learners to resolve the ambiguity inherent in any labeled scene to determine the speaker’s intended referent at that moment. The present study shows that 12- and 14-month old infants can resolve the uncertainty problem in another way, not by unambiguously deciding the referent in a single word-scene pairing, but by rapidly evaluating the statistical evidence across many individually ambiguous words and scenes. Keywords: language acquisition, word learning. Introduction The pairing of a word and a scene is not enough to determine the meaning of the word. To illustrate this point, Quine (1960) famously imagined a stranger who hears a native say “gavagai” and points to a scene. To what does “gavagai” refer -- a rabbit, the grass, a tree, the rabbit’s ears, or perhaps the beauty of the whole? Even if one assumes a perceptual system that segments the scene into separate objects and an attentional system biased towards objects, the intended referent is indeterminate from this one experience. Infants are like strangers who do not know the native language, yet they solve this indeterminacy problem. This paradox -- the uncertainty of the referent in word-scene associations and the fact that infants learn object names nonetheless --is a core theoretical problem in the study of early word learning. For the past 30 years most research on children’s word learning has concentrated on how the learner resolves the ambiguity at the moment the novel word is first encountered. Experimental studies leave no doubt that by the time they are 2 years old children do this at least for object names. That literature points to attentional (Smith, 2000), social (Baldwin 1993, Tomasello, 2000), linguistic (Gleitman, 1990) and representational (Markman, 1990) constraints as crucial to children’s ability to resolve referential ambiguity and fastmap a word to its intended referent. There are two reasons to suspect that this one-encounter solution to referential uncertainty is not the only (or even the most important) mechanism of early word learning. First, not all opportunities for word learning are as
[1]
James F. Knutson,et al.
Unsupervised Concept Learning and Value Systematicity: A Complex Whole Aids Learning the Parts
,
1996
.
[2]
Dare A. Baldwin,et al.
Early referential understanding: Infants' ability to recognize referential acts for what they are.
,
1993
.
[3]
A. Woodward,et al.
Infants' learning about words and sounds in relation to objects.
,
1999,
Child development.
[4]
P. Bloom.
How children learn the meanings of words
,
2000
.
[5]
R. Aslin,et al.
Spoken word recognition and lexical representation in very young children
,
2000,
Cognition.
[6]
Kim Plunkett,et al.
Theories of early language acquisition
,
1997,
Trends in Cognitive Sciences.
[7]
C. Moore,et al.
Word learning in the context of referential and salience cues.
,
1998,
Developmental psychology.
[8]
Kathy Hirsh-Pasek,et al.
The birth of words: ten-month-olds learn words through perceptual salience.
,
2006,
Child development.
[9]
S. Suter.
Meaningful differences in the everyday experience of young American children
,
2005,
European Journal of Pediatrics.
[10]
S L Oviatt,et al.
The emerging ability to comprehend language: an experimental approach.
,
1980,
Child development.
[11]
Linda B. Smith,et al.
Shifting ontological boundaries: how Japanese‐ and English‐speaking children generalize names for animals and artifacts
,
2003
.
[12]
John Macnamara,et al.
What's in a name? A study of how children learn common and proper names.
,
1974
.
[13]
Rebecca J. Panagos.
Meaningful Differences in the Everyday Experience of Young American Children
,
1998
.
[14]
K Plunkett,et al.
Rapid word learning by fifteen-month-olds under tightly controlled conditions.
,
1998,
Child development.
[15]
Chen Yu,et al.
A Statistical Associative Account of Vocabulary Growth in Early Word Learning
,
2008
.
[16]
Ellen M. Markman,et al.
Constraints Children Place on Word Meanings
,
1990,
Cogn. Sci..
[17]
J. Kruschke.
Toward a unified model of attention in associative learning
,
2001
.
[18]
M. Casasola,et al.
Acquisition of word-object associations by 14-month-old infants.
,
1998,
Developmental psychology.
[19]
R. Gómez,et al.
Artificial grammar learning by 1-year-olds leads to specific and abstract knowledge
,
1999,
Cognition.
[20]
J. Kruschke,et al.
Eye gaze and individual differences consistent with learned attention in associative blocking and highlighting.
,
2005,
Journal of experimental psychology. Learning, memory, and cognition.
[21]
K. Hirsh-Pasek,et al.
Trends and Transitions in Language Development: Looking for the Missing Piece
,
1999
.
[22]
K. Hirsh-Pasek,et al.
The eyes have it: lexical and syntactic comprehension in a new paradigm
,
1987,
Journal of Child Language.
[23]
M. Brent,et al.
The role of exposure to isolated words in early vocabulary development
,
2001,
Cognition.
[24]
S. Levinson,et al.
Language Acquisition and Conceptual Development
,
2001
.
[25]
L. Gleitman.
The Structural Sources of Verb Meanings
,
2020,
Sentence First, Arguments Afterward.
[26]
Chen Yu,et al.
The Role of Embodied Intention in Early Lexical Acquisition
,
2005,
Cogn. Sci..
[27]
Scott P. Johnson,et al.
Visual statistical learning in infancy: evidence for a domain general learning mechanism
,
2002,
Cognition.
[28]
D. Gentner,et al.
Making a silk purse out of two sow's ears: young children's use of comparison in category learning.
,
2002,
Journal of experimental psychology. General.
[29]
J. Siskind.
A computational study of cross-situational techniques for learning word-to-meaning mappings
,
1996,
Cognition.
[30]
Sharon L. Oviatt,et al.
Inferring What Words Mean: Early Development in Infants' Comprehension of Common Object Names.
,
1982
.
[31]
R N Aslin,et al.
Statistical Learning by 8-Month-Old Infants
,
1996,
Science.
[32]
Padraic Monaghan,et al.
Proceedings of the 23rd annual conference of the cognitive science society
,
2001
.
[33]
N. Akhtar,et al.
Early lexical acquisition: the role of cross-situational learning
,
1999
.
[34]
M. Tomasello.
Perceiving intentions and learning words in the second year of life
,
2000
.
[35]
Nameera Akhtar.
Relevance and early word learning
,
2002,
Journal of Child Language.
[36]
James F. Knutson,et al.
Unsupervised concept learning and value systematicity: a complex whole aids learning the parts.
,
1996,
Journal of experimental psychology. Learning, memory, and cognition.
[37]
Linda B. Smith,et al.
Rapid Word Learning Under Uncertainty via Cross-Situational Statistics
,
2007,
Psychological science.
[38]
J. Tenenbaum,et al.
Word learning as Bayesian inference.
,
2007,
Psychological review.
[39]
Willard Van Orman Quine,et al.
Word and Object
,
1960
.
[40]
M. Tomasello,et al.
Variability in early communicative development.
,
1994,
Monographs of the Society for Research in Child Development.
[41]
E. Markman,et al.
Rapid Word Learning in 13- and 18-Month-Olds.
,
1994
.
[42]
Linda B. Smith,et al.
Hypothesis Testing and Associative Learning in Cross-Situational Word Learning: Are They One and the Same?
,
2007
.