Towards an Inventory of English Verb Argument Constructions

This paper outlines and pilots our approach towards developing an inventory of verb-argument constructions based upon English form, function, and usage. We search a tagged and dependency-parsed BNC (a 100-million word corpus of English) for Verb-Argument Constructions (VACs) including those previously identified in the pattern grammar resulting from the COBUILD project. This generates (1) a list of verb types that occupy each construction. We next tally the frequency profiles of these verbs to produce (2) a frequency ranked type-token distribution for these verbs, and we determine the degree to which this is Zipfian. Since some verbs are faithful to one construction while others are more promiscuous, we next produce (3) a contingency-weighted list reflecting their statistical association. To test whether each of these measures is a step towards increasing the learnability of VACs as categories, following principles of associative learning, we examine 20 verbs from each distribution. Here we explore whether there is an increase in the semantic cohesion of the verbs occupying each construction using semantic similarity measures. From inspection, this seems to be so. We are developing measures of this using network measures of clustering in the verb-space defined by WordNet and Roget's Thesaurus.

[1]  Christopher R. Johnson,et al.  Background to Framenet , 2003 .

[2]  David R. Shanks,et al.  The Psychology of Associative Learning , 1995 .

[3]  Sergey N. Dorogovtsev,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW (Physics) , 2003 .

[4]  Kristin Davidse Gill Francis, Susan Hunston and Elizabeth Manning (eds.) Collins COBUILD Grammar Patterns 1: Verbs , 1998 .

[5]  Susan Hunston,et al.  The automatic recognition of verb patterns: a feasibility study , 2004 .

[6]  Adele E. Goldberg,et al.  Learning argument structure generalizations , 2004 .

[7]  A. Goldberg Constructions at Work: The Nature of Generalization in Language , 2006 .

[8]  Nick C. Ellis,et al.  Constructions and their acquisition: Islands and the distinctiveness of their occupancy , 2009 .

[9]  Ricard V. Solé,et al.  Language networks: Their structure, function, and evolution , 2007, Complex..

[10]  Ronald Carter,et al.  Trust the Text: Language, Corpus and Discourse , 2004 .

[11]  Stefan Th. Gries,et al.  Collostructions: Investigating the interaction of words and constructions , 2003 .

[12]  John Sinclair,et al.  Grammar patterns. 1., Verbs , 1998 .

[13]  A. Ninio Pathbreaking verbs in syntactic development and the question of prototypical transitivity , 1999, Journal of Child Language.

[14]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[15]  Ted Briscoe,et al.  The BNC Parsed with RASP4UIMA , 2008, LREC.

[16]  Albert-László Barabási,et al.  Evolution of Networks: From Biological Nets to the Internet and WWW , 2004 .

[17]  S. Gries,et al.  Extending collostructional analysis: A corpus-based perspective on `alternations' , 2004 .

[18]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[19]  Susan Hunston,et al.  Grammar Patterns 1: Verbs , 1996 .

[20]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[21]  Jean Christophe Verstraeh Frequency and the emergence of linguistic structure , 2005 .