Different Texts, Same Metaphors: Unigrams and Beyond

Current approaches to supervised learning of metaphor tend to use sophisticated features and restrict their attention to constructions and contexts where these features apply. In this paper, we describe the development of a supervised learning system to classify all content words in a running text as either being used metaphorically or not. We start by examining the performance of a simple unigram baseline that achieves surprisingly good results for some of the datasets. We then show how the recall of the system can be improved over this strong baseline.

[1]  Anna Korhonen,et al.  Metaphor Identification Using Verb and Noun Clustering , 2010, COLING.

[2]  悠太 菊池,et al.  大規模要約資源としてのNew York Times Annotated Corpus , 2015 .

[3]  Ralph Weischedel,et al.  Automatic Extraction of Linguistic Metaphors with LDA Topic Modeling , 2013 .

[4]  Haixun Wang,et al.  Data-Driven Metaphor Recognition and Explanation , 2013, TACL.

[5]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[6]  Jonathan Dunn What metaphor identification systems can tell us about metaphor-in-language , 2013 .

[7]  Eduard Hovy,et al.  Identifying Metaphorical Word Use with Tree Kernels , 2013 .

[8]  Eyal Beigman,et al.  Analyzing Disagreements , 2008, COLING 2008.

[9]  John Bryant,et al.  Catching Metaphors , 2006 .

[10]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[11]  G. Lakoff,et al.  Metaphors We Live by , 1982 .

[12]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[13]  Gerard J. Steen,et al.  A method for linguistic metaphor identification : from MIP to MIPVU , 2010 .

[14]  Andreas Musolff,et al.  Mirror Images of Europe: Metaphors in the Public Debate About Europe in Britain and Germany , 2000 .

[15]  Diana McCarthy,et al.  Domain-Speci(cid:12)c Sense Distributions and Predominant Sense Acquisition , 2022 .

[16]  James H. Martin,et al.  Topic Model Analysis of Metaphor Frequency for Psycholinguistic Stimuli , 2009, HLT-NAACL 2009.

[17]  Dan Fass,et al.  met*: A Method for Discriminating Metonymy and Metaphor by Computer , 1991, CL.

[18]  Yair Neuman,et al.  Literal and Metaphorical Sense Identification through Concrete and Abstract Context , 2011, EMNLP.

[19]  J. Charteris-Black Politicians and Rhetoric: The Persuasive Power of Metaphor , 2004 .

[20]  Michael Flor,et al.  Argumentation-Relevant Metaphors in Test-Taker Essays , 2013 .

[21]  Simone Teufel,et al.  Metaphor Corpus Annotated for Source - Target Domain Mappings , 2010, LREC.

[22]  James H. Martin A Computational Model of Metaphor Interpretation , 1990 .

[23]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[24]  David Yarowsky,et al.  One Sense Per Discourse , 1992, HLT.

[25]  Lin Sun,et al.  Unsupervised Metaphor Identification Using Hierarchical Graph Factorization Clustering , 2013, NAACL.

[26]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[27]  D. Sculley,et al.  Mining millions of metaphors , 2008, Lit. Linguistic Comput..

[28]  J. Underhill Language in Metaphors , 2011 .

[29]  Q. Mcnemar Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[30]  R. Gibbs,et al.  MIP: A method for identifying metaphorically used words in discourse , 2007 .

[31]  Michael Mohler,et al.  Semantic Signatures for Example-Based Linguistic Metaphor Detection , 2013 .

[32]  Amy Beth Warriner,et al.  Concreteness ratings for 40 thousand generally known English word lemmas , 2014, Behavior research methods.