Assessing the Potential of Metaphoricity of verbs using corpus data

The paper investigates the relation between metaphoricity and distributional characteristics of verbs, introducing POM, a corpus-derived index that can be used to define the upper bound of metaphoricity of any expression in which a given verb occurs. The work moves from the observation that while some verbs can be used to create highly metaphoric expressions, others can not. We conjecture that this fact is related to the number of contexts in which a verb occurs and to the frequency of each context. This intuition is modelled by introducing a method in which each context of a verb in a corpus is assigned a vector representation, and a clustering algorithm is employed to identify similar contexts. Eventually, the Standard Deviation of the relative frequency values of the clusters is computed and taken as the POM of the target verb. We tested POM in two experimental settings obtaining values of accuracy of 84% and 92%. Since we are convinced, along with (Shutoff, 2015), that metaphor detection systems should be concerned only with the identification of highly metaphoric expressions, we believe that POM could be profitably employed by these systems to a priori exclude expressions that, due to the verb they include, can only have low degrees of metaphoricity

[1]  Alexander Gelbukh,et al.  Computational Linguistics and Intelligent Text Processing , 2015, Lecture Notes in Computer Science.

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[4]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[5]  Eduard Hovy,et al.  Identifying Metaphorical Word Use with Tree Kernels , 2013 .

[6]  Patrick Hanks,et al.  THE SYNTAGMATICS OF METAPHOR AND IDIOM , 2004 .

[7]  Ido Dagan,et al.  Modeling Word Meaning in Context with Substitute Vectors , 2015, NAACL.

[8]  Yair Neuman,et al.  Literal and Metaphorical Sense Identification through Concrete and Abstract Context , 2011, EMNLP.

[9]  Yorick Wilks,et al.  Making Preferences More Active , 1978, Artif. Intell..

[10]  Gerard J. Steen,et al.  A method for linguistic metaphor identification : from MIP to MIPVU , 2010 .

[11]  Stefan Th. Gries,et al.  Metaphoricity is gradable , 2006 .

[12]  Ekaterina Shutova,et al.  Design and Evaluation of Metaphor Processing Systems , 2015, CL.

[13]  Jeremy H. Clear,et al.  The British national corpus , 1993 .

[14]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[15]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[16]  Bryan Rink,et al.  Cross-lingual Semantic Generalization for the Detection of Metaphor , 2015, Int. J. Comput. Linguistics Appl..

[17]  Jonathan Dunn Gradient Semantic Intuitions of Metaphoric Expressions , 2010 .

[18]  Geoffrey Nunberg,et al.  Poetic and Prosaic Metaphors , 1987, TINLAP.

[19]  Jonathan Dunn What metaphor identification systems can tell us about metaphor-in-language , 2013 .

[20]  Jonathan Dunn,et al.  Measuring metaphoricity , 2014, ACL.

[21]  Caroline Sporleder,et al.  Using Gaussian Mixture Models to Detect Figurative Language in Context , 2010, NAACL.

[22]  Lin Sun,et al.  Unsupervised Metaphor Identification Using Hierarchical Graph Factorization Clustering , 2013, NAACL.