Modelling the Meaning of Argument Constructions with Distributional Semantics

Current computational models of argument constructions typically represent their semantic content with hand-made formal structures. Here we present a distributional model implementing the idea that the meaning of a construction is intimately related to the semantics of its typical verbs. First, we identify the typical verbs occurring with a given syntactic construction and build their distributional vectors. We then calculate the weighted centroid of these vectors in order to derive the distributional signature of a construction. In order to assess the goodness of our approach, we replicated the priming effect described by Johnson and Golberg (2013) as a function of the semantic distance between a construction and its prototypical verbs. Additional support for our view comes from a regression analysis showing that our distributional information can be used to model behavioral data collected with a crowdsourced elicitation experiment. Argument constructions and verb meaning One of the main tenets of Constructionist approaches (Goldberg 2006; Hoffmann and Trousdale 2013) is that grammar contains argument constructions (Cxns) associated with abstract meanings independently of the verbs occurring in them. For instance, the Ditransitive Cxn “Subj V Obj1 Obj2” (She gave him a book) is associated with a semantic content that can be paraphrased as X CAUSES Y TO RECEIVE Z. This represents a major breaking point between Constructionist models and so-called “projectionist” approaches stemming from the formal generative tradition (Chomsky 2000) and arguing for the the pivotal role played by the main verb in the interpretation of the sentence. According to this latter view, grammar includes only formal combinatorial principles, while the semantic properties of a sentence are determined by the syntactic and semantic properties projected from the main verb. For instance, the syntactic configurations in the sentences in (1) are projections of the main verb to slice: (1) a. He sliced the bread. b. Pat sliced the carrots into the salad. c. Pat sliced Chris a piece of pie. d. Pat sliced the box open. (examples from Goldberg (2006)) Copyright c © 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. This view has been criticized in the Constructionist tradition, following theoretical considerations as well as psycholinguistics arguments (Goldberg 1995; 2006). From the theoretical side, it has been pointed out that, if argument structure is projected solely by the meaning of the verb, than it is necessary to stipulate a different meaning for each occurrence of a given verb in various argument structures. As for the sentences in (1), this view would lead to postulate special senses of this verb roughly meaning (1a) to cut something with a sharp instrument; (1b) to cut something with a sharp instrument so as to move it; (1c) to cut something for someone else with a sharp instrument; (1d) to cut something with a sharp instrument so as to change its state. In a Constructionist perspective, the verb “to slice” is always used with the intuitive meaning of to cut with a sharp instrument, the additional meaning coming from the construction in which it occurs, whose semantics can be paraphrased as (1a) something acting on something else; (1b) something causing something else to move; (1c) someone intending to cause someone to receive something; (1d) someone causing something to change state (Goldberg 1995). Psycholinguistic evidence, on the other side, mostly originates from research on language comprehension and language acquisition. As for the former, studies like Bencini and Goldberg (2000), Kaschak and Glenberg (2000), Kako (2006), Goldwater and Markman (2009) and Johnson and Goldberg (2013) support the idea that the construction of a sentence (rather than the verb only) plays a role in its interpretation. In a sorting experiment, Bencini and Goldberg (2000) showed that, when asked to sort sentences on the basis of their overall meaning, subjects were as likely to rely on the verb as on the construction. Kaschak and Glenberg (2000) and Goldwater and Markman (2009) tapped into the semantic content of different syntactic frames by using novel denominal verbs in a comprehension task. Likewise, Kako (2006) investigated the meaning of six syntactic frames by collecting linguistic judgments over phrases whose content words were replaced by nonsense words (a.k.a. “Jabberwocky”sentences like The grack mecked the zarg). While all these works exploited off-line tasks or explicit judgments, Johnson and Goldberg (2013) demonstrated that the constructional meaning is accessed quickly by asking their participants to perform a speeded lexical decision task on a target verb, after being exposed to JabThe AAAI 2017 Spring Symposium on Computational Construction Grammar and Natural Language Understanding Technical Report SS-17-02

[1]  R. H. Baayen,et al.  Analyzing Linguistic Data: Solutions to the exercises , 2008 .

[2]  Lila R. Gleitman,et al.  The Role of Syntax in Verb Learning , 2019, The Handbook of Child Language.

[3]  Stefan Evert,et al.  A Large Scale Evaluation of Distributional Semantic Models: Parameters, Interactions and Model Selection , 2014, TACL.

[4]  Arthur B. Markman,et al.  Constructional sources of implicit agents in sentence comprehension , 2006 .

[5]  Graeme Trousdale,et al.  The Oxford Handbook of Construction Grammar , 2013 .

[6]  Michael P. Kaschak,et al.  Constructing Meaning: The Role of Affordances and Grammatical Constructions in Sentence Comprehension , 2000 .

[7]  Adele E. Goldberg,et al.  The contribution of argument structure constructions to sentence meaning , 2000 .

[8]  Stephen Clark,et al.  Improving Distributional Semantic Vectors through Context Selection and Normalisation , 2014, EACL.

[9]  John A Bullinaria,et al.  Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD , 2012, Behavior Research Methods.

[10]  Marco Baroni,et al.  Composition in Distributional Semantics , 2013, Lang. Linguistics Compass.

[11]  Alessandro Lenci,et al.  Distributional semantics in linguistic and cognitive research , 2008 .

[12]  Suzanne Stevenson,et al.  A Computational Model of Early Argument Structure Acquisition , 2008, Cogn. Sci..

[13]  Michael Tomasello,et al.  Lexical frequency and exemplar-based learning effects in language acquisition: evidence from sentential complements , 2010 .

[14]  Adele E. Goldberg,et al.  Constructions at Work , 2005 .

[15]  Mark Davies The 385+ million word Corpus of Contemporary American English (1990―2008+): Design, architecture, and linguistic insights , 2009 .

[16]  Stefan Evert,et al.  Corpora and collocations , 2007 .

[17]  Mirella Lapata,et al.  Dependency-Based Construction of Semantic Space Models , 2007, CL.

[18]  Alessandro Lenci,et al.  Distributional Memory: A General Framework for Corpus-Based Semantics , 2010, CL.

[19]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[20]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[21]  A. Goldberg,et al.  Evidence for automatic accessing of constructional meaning: Jabberwocky sentences prime associated verbs , 2013 .

[22]  Ted Briscoe,et al.  A Large Subcategorization Lexicon for Natural Language Processing Applications , 2006, LREC.

[23]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[24]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[25]  Libby Barak,et al.  Modeling the Emergence of an Exemplar Verb in Construction Learning , 2013, CogSci.

[26]  Noam Chomsky,et al.  New horizons in the study of language , 1997 .

[27]  Allyson Ettinger,et al.  Evaluating vector space models using human semantic priming results , 2016, RepEval@ACL.

[28]  Yoav Goldberg,et al.  A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books , 2013, *SEMEVAL.

[29]  L. Gleitman,et al.  Language and Experience: Evidence from the Blind Child , 1988 .

[30]  P. Tichý Constructions , 1986, Philosophy of Science.

[31]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[32]  H. Gleitman,et al.  Human simulations of vocabulary learning , 1999, Cognition.

[33]  A. Goldberg The Emergence of the Semantics of Argument Structure Constructions , 2013 .

[34]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[35]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[36]  Edward Thomas Kako,et al.  The event semantics of syntactic structures , 1998 .

[37]  J. Bullinaria,et al.  Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[38]  Magnus Sahlgren,et al.  The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .