The Onset of Syntactic Bootstrapping in Word Learning: Evidence from a Computational Study

The Onset of Syntactic Bootstrapping in Word Learning: Evidence from a Computational Study Afra Alishahi Computational Linguistics and Phonetics Saarland University, Germany afra@coli.uni-saarland.de Pirita Pyykk¨onen Computational Linguistics and Phonetics Saarland University, Germany pirita@coli.uni-saarland.de Abstract their language yet. A continuous picture of the develop- mental path of word learning is lacking. In this paper, we propose a novel approach to studying this problem. We use an existing computational model of early verb learning which incrementally learns syntactic constructions of language from usage data. We adapt this model to the task of identifying target words in novel sit- uations given different sets of (perceptual and linguistic) cues. Our results show that having access to linguistic in- formation significantly facilitates identifying verbs in later stages of learning, but no such effect is observed at the earlier stages. For identifying nouns, additional linguistic information does not affect performance at all. The syntactic bootstrapping hypothesis suggests that children’s verb learning is guided by the structural cues that the linguistic context provides. However, the onset of syntactic bootstrapping in word learning is not well studied. To investigate the impact of linguistic infor- mation on word learning during early stages of language acquisition, we use a computational model of learning syntactic constructions from usage data, and adapt it to the task of identifying target words in novel situations. Our results show that having access to linguistic infor- mation significantly improves performance in identifying verbs (but not nouns) in later stages of learning, yet no such effect can be observed in earlier stages. Introduction Learning verbs is a challenging task for young children: their early vocabulary contains many more nouns than verbs, and they learn new nouns easier than new verbs of the same frequency (e.g., Imai et al., 2005; Waxman, 2006). The acquisition of nouns is mainly attributed to cross-situational evidence, or regularities across different situations in which a noun is used (Quine, 1960). In con- trast, learning verbs seems to depend on the syntactic frames that they appear in. It has been suggested that children draw on syntactic cues that the linguistic context provides in verb learning, a hypothesis known as syntactic bootstrapping (Gleitman, 1990). According to this view, verbs are learned with a delay because the linguistic in- formation that supports their acquisition is not available during the early stages of language acquisition. To investigate the impact of linguistic and extralinguis- tic cues in identifying words, Gillette et al. (1999) pro- posed the Human Simulation Paradigm (HSP): adult par- ticipants watch videos of caregivers interacting with their toddlers, and are asked to identify target words marked by a beep. Videos are displayed without sound, and sub- jects are provided with different degrees of information about the linguistic context of the target verbs. Various HSP studies have shown that having access to linguis- tic and structural cues significantly improves the perfor- mance of adults in identifying verbs. Piccin & Waxman (2007) adopted the HSP paradigm for testing school-age children, and showed that children also rely on linguistic information for identifying verbs, but their performance is inferior to adults. These findings hint at a gradual de- velopment of syntactic bootstrapping, but it is uncertain whether the same effect can be observed in much younger children who have not mastered the syntactic structure of Time Course of Syntactic Bootstrapping Several preferential-looking studies have shown that chil- dren are sensitive to the structural regularities of lan- guage from a very young age, and that they use these structural cues to find the referent of a novel word (e.g., Naigles & Hoff-Ginsberg, 1995; Gertner et al., 2006). In a typical setup, children are given more than one interpreta- tion for an utterance (e.g., different activities displayed on parallel screens), and their looking behaviour reveals their preferred interpretation. However, these studies cannot compare the impact of different cues in word learning, since the same type of input is available to subjects in different conditions. In contrast, HSP studies manipulate the number and type of cues that subjects receive for performing a task across conditions, and thus evaluate the impact of each set of cues. In their influential study, Gillette et al. (1999) provided their adult subjects with various combinations of visual cues (videos), a list of co-occurring words, the syntactic pattern of the sentence, and the full transcript of the narration. Their findings and those of later studies have consistently shown that the more linguistic infor- mation adult subjects receive, the more accurately they identify missing verbs. Piccin & Waxman (2007) used HSP for studying seven- year-olds as well as adults. Subjects in each age group were randomly assigned to either ‘no linguistic informa- tion’ (-LI) or ‘full linguistic information’ (+LI) condition. In the -LI condition, participants heard no audio other than beeps indicating the target words. In the +LI con- dition, participants heard all the surrounding speech as well as the beeps. After watching each clip, subjects were asked to guess the target word (a noun or a verb). Their

[1]  A. Alishahi,et al.  Please Scroll down for Article Language and Cognitive Processes a Computational Model of Learning Semantic Roles from Child-directed Language a Computational Model of Learning Semantic Roles from Child-directed Language , 2022 .

[2]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[3]  Amy Perfors,et al.  Joint acquisition of word order and word reference , 2009 .

[4]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[5]  Letitia R. Naigles,et al.  Input to verb learning: Evidence for the plausibility of syntactic bootstrapping , 1995 .

[6]  H. Gleitman,et al.  Human simulations of vocabulary learning , 1999, Cognition.

[7]  C. Fisher,et al.  Learning Words and Rules , 2006, Psychological science.

[8]  Sourabh Niyogi Bayesian Learning at the Syntax-Semantics Interface , 2002 .

[9]  L. Gleitman The Structural Sources of Verb Meanings , 2020, Sentence First, Arguments Afterward.

[10]  Manuela Friedrich,et al.  Early Word Learning , 2010 .

[11]  Sandra R. Waxman,et al.  Why Nouns Trump Verbs in Word Learning: New Evidence from Children and Adults in the Human Simulation Paradigm , 2007 .

[12]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[13]  I. Sigel,et al.  HANDBOOK OF CHILD PSYCHOLOGY , 2006 .

[14]  Afsaneh Fazly,et al.  Integrating Syntactic Knowledge into a Model of Cross-situational Word Learning , 2010 .

[15]  Chen Yu,et al.  Learning Syntax-Semantics Mappings to Bootstrap Word Learning , 2006 .

[16]  Afsaneh Fazly,et al.  A Probabilistic Computational Model of Cross-Situational Word Learning , 2010, Cogn. Sci..

[17]  Mutsumi Imai,et al.  Mapping novel nouns and verbs onto dynamic action events: are verb meanings easier to learn than noun meanings for Japanese children? , 2005, Child development.

[18]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .