Child Acquisition of Multiword Verbs: A Computational Investigation

Traditional theories of grammar, as well as computational modelling of language acquisition, have focused either on aspects of word learning, or grammar learning. Work on intermediate linguistic constructions (the area between words and combinatory grammar rules) has been very limited. Although recent usage-based theories of language learning emphasize the role of multiword constructions, much remains to be explored concerning the precise computational mechanisms that underlie how children learn to identify and interpret different types of multiword lexemes. The goal of the current study is to bring in ideas from computational linguistics on the topic of identifying multiword lexemes, and to explore whether these ideas can be extended in a natural way to the domain of child language acquisition. We take a first step toward computational modelling of the acquisition of a widely-documented class of multiword verbs, such as take the train and give a kiss, that children must master early in language learning. Specifically, we show that simple statistics based on the linguistic properties of these multiword verbs are informative for identifying them in a corpus of child-directed utterances. We present preliminary experiments demonstrating that such statistics can be used within a word learning model to learn associations between meanings and sequences of words.

[1]  N. Snider,et al.  More than words: Frequency effects for multi-word phrases , 2010 .

[2]  Terry Regier,et al.  The Emergence of Words: Attentional Learning in Form and Meaning , 2005, Cogn. Sci..

[3]  Garrison W. Cottrell,et al.  Lexical Ambiguity Resolution: Perspectives from Psycholinguistics, Neuropsychology, and Artificial Intelligence , 1988 .

[4]  R. Moon Fixed Expressions and Idioms in English: A Corpus-Based Approach , 1998 .

[5]  Tzong-hong Lin,et al.  Light verb syntax and the theory of phrase structure , 2001 .

[6]  藤原 由美,et al.  書評 John Newman, Give: A Cognitive Linguistic Study, New York: Mouton de Gruyter, 1996(ジョン・ニューマン『与えること:認知言語学的研究』) , 2001 .

[7]  Suzanne Stevenson,et al.  Distinguishing Subtypes of Multiword Expressions Using Linguistically-Motivated Statistical Measures , 2007 .

[8]  Anna L. Theakston,et al.  Going, going, gone: the acquisition of the verb ‘go’ , 2002, Journal of Child Language.

[9]  Kate Nation,et al.  Investigating individual differences in children's real-time sentence comprehension using language-mediated eye movements. , 2003, Journal of experimental child psychology.

[10]  Suzanne Stevenson,et al.  Gradual Acquisition of Verb Selectional Preferences in a Bayesian Model , 2013, Cognitive Aspects of Computational Language Acquisition.

[11]  M. Tomasello Constructing a Language: A Usage-Based Theory of Language Acquisition , 2003 .

[12]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[13]  A. Sosa,et al.  Evidence for frequency-based constituents in the mental lexicon: collocations involving the word of , 2002, Brain and Language.

[14]  Lynn E. Grant Frequency of ‘core idioms’ in the British National Corpus (BNC) , 2005 .

[15]  Alexander Clark Unsupervised induction of stochastic context-free grammars using distributional clustering , 2001, CoNLL.

[16]  Colin Bannard A Measure of Syntactic Flexibility for Automatically Identifying Multiword Expressions in Corpora , 2007 .

[17]  Alon Lavie,et al.  High-accuracy Annotation and Parsing of CHILDES Transcripts , 2007 .

[18]  Ping Li,et al.  Dynamic Self-Organization and Early Lexical Development in Children , 2007, Cogn. Sci..

[19]  Afsaneh Fazly,et al.  Acquiring Multiword Verbs: The Role of Statistical Evidence , 2009 .

[20]  Tadao Miyamoto The light verb construction in Japanese , 1999 .

[21]  Nancy Chang,et al.  Putting Meaning into Grammar Learning , 2004, Workshop On Psycho-Computational Models Of Human Language Acquisition.

[22]  Cynthia L Fisher,et al.  Structural limits on verb mapping: the role of abstract structure in 2.5‐year‐olds’ interpretations of novel verbs , 2002 .

[23]  Susanne Z. Riehemann,et al.  A constructional approach to idioms and word formation , 2001 .

[24]  Mark Dras,et al.  Automatic Identification of Support Verbs: A Step Towards a Definition of Semantic Weight , 1995, ArXiv.

[25]  J. Tenenbaum,et al.  Variability, negative evidence, and the acquisition of verb argument constructions. , 2010, Journal of child language.

[26]  John Newman,et al.  Give : a cognitive linguistic study , 1996 .

[27]  R. Schreuder,et al.  Idioms : structural and psychological perspectives , 1997 .

[28]  Chen Yu,et al.  Statistical Cross-Situational Learning to Build Word-to-World Mappings , 2006 .

[29]  Afsaneh Fazly,et al.  Integrating Syntactic Knowledge into a Model of Cross-situational Word Learning , 2010 .

[30]  Dedre Gentner,et al.  The Verb Mutability Effect: Studies of the Combinatorial Semantics of Nouns and Verbs , 1990 .

[31]  Roger S. Brown,et al.  Linguistic determinism and the part of speech. , 1957, Journal of abnormal psychology.

[32]  Paul Pauwels,et al.  Put, set, lay and place : a cognitive linguistic approach to verbal meaning , 2000 .

[33]  A. Goldberg Constructions: A Construction Grammar Approach to Argument Structure , 1995 .

[34]  Andrea Krott,et al.  How the parts relate to the whole: Frequency effects on children's interpretations of novel compounds* , 2008, Journal of Child Language.

[35]  Suzanne Stevenson,et al.  A Computational Model of Early Argument Structure Acquisition , 2008, Cogn. Sci..

[36]  Paul Deane,et al.  A Nonparametric Method for Extraction of Candidate Phrasal Terms , 2005, ACL.

[37]  Stefan Evert,et al.  Identifying Morphosyntactic Preferences in Collocations , 2004, LREC.

[38]  Ray Cattell,et al.  ‘Light’ Verbs in English , 1984 .

[39]  Rens Bod,et al.  Children's Grammars Grow More Abstract with Age - Evidence from an Automatic Procedure for Identifying the Productive Units of Language , 2009, Top. Cogn. Sci..

[40]  Uri Zernik,et al.  Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[41]  C. Fisher,et al.  Learning Words and Rules , 2006, Psychological science.

[42]  Josep Alba-Salas,et al.  Light verb constructions in Romance : a syntactic analysis , 2002 .

[43]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[44]  Frank Smadja,et al.  Retrieving Collocations from Text: Xtract , 1993, CL.

[45]  A. Wierzbicka 5. Why can you have a drink when you can't *have an eat ? , 1982 .

[46]  Janet Dean Fodor,et al.  Language Acquisition and Learnability: The Structural Triggers Learner , 2001 .

[47]  Peter Ford Dominey,et al.  A Developmental Model of Syntax Acquisition in the Construction Grammar Framework with Cross-Linguistic Validation in English and Japanese , 2004 .

[48]  Afsaneh Fazly,et al.  A Probabilistic Computational Model of Cross-Situational Word Learning , 2010, Cogn. Sci..

[49]  Kenneth Ward Church,et al.  Using Statistics in Lexical Analysis , 2003, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[50]  A. Cowie The Treatment of Collocations and Idioms in Learners' Dictionaries , 1981 .

[51]  Fintan J. Costello,et al.  Learning to Interpret Novel Noun-Noun Compounds: Evidence from Category Learning Experiments , 2007, Cognitive Aspects of Computational Language Acquisition.

[52]  Mark Dras,et al.  Death and Lightness: Using a Demographic Model to Find Support Verbs , 1996, ArXiv.

[53]  E. Kako,et al.  First contact in verb acquisition: defining a role for syntax. , 1993, Child development.

[54]  Michael C. Frank,et al.  A Bayesian Framework for Cross-Situational Word-Learning , 2007, NIPS.

[55]  Simone Teufel,et al.  Corpus-based Method for Automatic Identification of Support Verbs for Nominalizations , 1995, EACL.

[56]  Afsaneh Fazly,et al.  AUTOMATIC ACQUISITION OF LEXICAL KNOWLEDGE ABOUT , 2007 .

[57]  Sally Rice,et al.  Patterns of usage for English SIT, STAND, and LIE: A cognitively-inspired exploration in corpus linguistics , 2004 .

[58]  Afsaneh Fazly,et al.  Automatically learning semantic knowledge about multiword predicates , 2007, Lang. Resour. Evaluation.

[59]  Suzanne Stevenson,et al.  Learning verb alternations in a usage-based Bayesian model , 2010 .

[60]  Colin Bannard,et al.  Stored Word Sequences in Language Learning , 2008, Psychological science.

[61]  Dekang Lin,et al.  Automatic Identification of Non-compositional Phrases , 1999, ACL.

[62]  N. Chater,et al.  Simplicity: A cure for overgeneralizations in language acquisition? , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[63]  A. Goldberg Constructions at Work: The Nature of Generalization in Language , 2006 .

[64]  C. Fisher,et al.  Two-year-olds use distributional cues to interpret transitivity-alternating verbs , 2009, Language and cognitive processes.

[65]  Timothy Baldwin,et al.  A Statistical Approach to the Semantics of Verb-Particles , 2003, ACL 2003.

[66]  Dan Roth,et al.  Starting from Scratch in Semantic Role Labeling: Early Indirect Supervision , 2013, Cognitive Aspects of Computational Language Acquisition.

[67]  Catherine E. Snow,et al.  Children's language , 1990 .

[68]  R. Brown,et al.  A First Language , 1973 .

[69]  Merja Kytö Collocational and Idiomatic Aspects of Verbs in Early Modern English : A Corpus-based Study of MAKE, HAVE, GIVE, TAKE, and DO , 1999 .

[70]  Aravind K. Joshi,et al.  Measuring the Relative Compositionality of Verb-Noun (V-N) Collocations by Integrating Features , 2005, HLT.

[71]  Suzanne Stevenson,et al.  Classifying Particle Semantics in English Verb-Particle Constructions , 2006 .

[72]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[73]  John Carroll,et al.  Detecting a Continuum of Compositionality in Phrasal Verbs , 2003, ACL 2003.