论文信息 - Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation

Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation

Metaphor comprehension and understanding is a complex cognitive task that requires interpreting metaphors by grasping the interaction between the meaning of their target and source concepts. This is very challenging for humans, let alone computers. Thus, automatic metaphor interpretation is understudied in part due to the lack of publicly available datasets. The creation and manual annotation of such datasets is a demanding task which requires huge cognitive effort and time. Moreover, there will always be a question of accuracy and consistency of the annotated data due to the subjective nature of the problem. This work addresses these issues by presenting an annotation scheme to interpret verb-noun metaphoric expressions in text. The proposed approach is designed with the goal of reducing the workload on annotators and maintain consistency. Our methodology employs an automatic retrieval approach which utilises external lexical resources, word embeddings and semantic similarity to generate possible interpretations of identified metaphors in order to enable quick and accurate annotation. We validate our proposed approach by annotating around 1,500 metaphors in tweets which were annotated by six native English speakers. As a result of this work, we publish as linked data the first gold standard dataset for metaphor interpretation which will facilitate research in this area.

Paul Buitelaar | John P. McCrae | Omnia Zayed

[1] Anna Korhonen,et al. Unsupervised Metaphor Paraphrasing using a Vector Space Model , 2012, COLING.

[2] James W. Manns. METAPHOR AND PARAPHRASE , 1975 .

[3] Luis Alfonso Ureña López,et al. Language technologies applied to document simplification for helping autistic people , 2015, Expert Syst. Appl..

[4] Jean Maillard,et al. Black Holes and White Rabbits: Metaphor Identification with Visual Features , 2016, NAACL.

[5] Simone Teufel,et al. Metaphor Corpus Annotated for Source - Target Domain Mappings , 2010, LREC.

[6] G. Lakoff,et al. Metaphors We Live by , 1982 .

[7] L. Cameron. Metaphor in Educational Discourse , 2003 .

[8] Jean Véronis,et al. EXTRACTING KNOWLEDGE BASES FROM MACHINE- READABLE DICTIONARIES : HAVE WE WASTED OUR TIME? , 1999 .

[9] Iryna Gurevych,et al. Wiktionary: a new rival for expert-built lexicons? Exploring the possibilities of collaborative lexicography , 2012 .

[10] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[11] Shalom Lappin,et al. Predicting Human Metaphor Paraphrase Judgments with Deep Neural Networks , 2018, Fig-Lang@NAACL-HLT.