Metaphors in Text Simplification: To change or not to change, that is the question

We present an analysis of metaphors in news text simplification. Using features that capture general and metaphor specific characteristics, we test whether we can automatically identify which metaphors will be changed or preserved, and whether there are features that have different predictive power for metaphors or literal words. The experiments show that the Age of Acquisition is the most distinctive feature for both metaphors and literal words. Features that capture Imageability and Concreteness are useful when used alone, but within the full set of features they lose their impact. Frequency of use seems to be the best feature to differentiate metaphors that should be changed and those to be preserved.

[1]  Goran Glavas,et al.  Simplifying Lexical Simplification: Do We Need Simplified Corpora? , 2015, ACL.

[2]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[3]  Danushka Bollegala,et al.  Metaphor Interpretation Using Paraphrases Extracted from the Web , 2013, PloS one.

[4]  Son Bao Pham,et al.  Learning to Simplify Children Stories with Limited Data , 2014, ACIIDS.

[5]  Lucia Specia,et al.  SemEval 2016 Task 11: Complex Word Identification , 2016, *SEMEVAL.

[6]  Marie-Francine Moens,et al.  Text simplification for children , 2010, SIGIR 2010.

[7]  Pamela Faber,et al.  Reviewing imagery in resemblance and non-resemblance metaphors , 2010 .

[8]  Michael B. Lewis,et al.  Age of acquisition and the cumulative-frequency hypothesis: a review of the literature and a new multi-task investigation. , 2004, Acta psychologica.

[9]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[10]  Elena Semino,et al.  The Routledge Handbook of Metaphor and Language , 2017 .

[11]  Anne Golden 3. Grasping the point: A study of 15-year-old students’ comprehension of metaphorical expressions in schoolbooks , 2010 .

[12]  Chris Callison-Burch,et al.  Simplification Using Paraphrases and Context-Based Lexical Substitution , 2018, NAACL.

[13]  Dirk De Hertog,et al.  Deep Learning Architecture for Complex Word Identification , 2018, BEA@NAACL-HLT.

[14]  Heiner Stuckenschmidt,et al.  Automatic Assessment of Absolute Sentence Complexity , 2017, IJCAI.

[15]  Tomek Strzalkowski,et al.  Using Imageability and Topic Chaining to Locate Metaphors in Linguistic Corpora , 2013, SBP.

[16]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  Ekaterina Kochmar,et al.  CAMB at CWI Shared Task 2018: Complex Word Identification with Ensemble-Based Voting , 2018, BEA@NAACL-HLT.

[19]  Marc Brysbaert,et al.  Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English , 2009, Behavior research methods.

[20]  A. Paivio,et al.  Concreteness, imagery, and meaningfulness values for 925 nouns. , 1968, Journal of experimental psychology.

[21]  Ekaterina Shutova,et al.  Metaphor Identification as Interpretation , 2013, *SEMEVAL.

[22]  A. Paivio,et al.  Dimensions of metaphor , 1983 .

[23]  Goran Glavas,et al.  Explicit Retrofitting of Distributional Word Vectors , 2018, ACL.

[24]  Chris Callison-Burch,et al.  Optimizing Statistical Machine Translation for Text Simplification , 2016, TACL.

[25]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[26]  M. Brysbaert,et al.  Age-of-acquisition ratings for 30,000 English words , 2012, Behavior research methods.

[27]  Iryna Gurevych,et al.  A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.

[28]  Walt Detmar Meurers,et al.  Readability assessment for text simplification: From analysing documents to identifying sentential simplifications , 2014 .

[29]  Iryna Gurevych,et al.  Token-Level Metaphor Detection using Neural Networks , 2016 .

[30]  Matt J. Kusner,et al.  From Word Embeddings To Document Distances , 2015, ICML.

[31]  Gülşen Eryiğit,et al.  Simplification of Turkish Sentences , 2016 .

[32]  Chris Callison-Burch,et al.  Problems in Current Text Simplification Research: New Data Can Help , 2015, TACL.

[33]  Lucia Specia,et al.  UOW-SHEF: SimpLex – Lexical Simplicity Ranking based on Contextual and Psycholinguistic Features , 2012, *SEMEVAL.

[34]  David Kauchak,et al.  Learning a Lexical Simplifier Using Wikipedia , 2014, ACL.

[35]  Sampo Pyysalo,et al.  brat: a Web-based Tool for NLP-Assisted Text Annotation , 2012, EACL.

[36]  Magdalena Wolska,et al.  Simplifying metaphorical language for young readers: A corpus study on news text , 2017, BEA@EMNLP.

[37]  Sabine Schulte im Walde,et al.  Improving Verb Metaphor Detection by Propagating Abstractness to Words, Phrases and Individual Senses , 2017 .

[38]  Véronique Hoste,et al.  All Mixed Up? Finding the Optimal Feature Set for General Readability Prediction and Its Application to English and Dutch , 2016, Computational Linguistics.

[39]  Tyler Marghetis,et al.  Literal and Metaphorical Senses in Compositional Distributional Semantic Models , 2016, ACL.

[40]  Sara Tonelli,et al.  ERNESTA: A Sentence Simplification Tool for Children's Stories in Italian , 2013, CICLing.

[41]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[42]  Yair Neuman,et al.  Literal and Metaphorical Sense Identification through Concrete and Abstract Context , 2011, EMNLP.

[43]  Gerard J. Steen,et al.  A method for linguistic metaphor identification : from MIP to MIPVU , 2010 .

[44]  Lucia Specia,et al.  A Report on the Complex Word Identification Shared Task 2018 , 2018, BEA@NAACL-HLT.

[45]  G. Lakoff,et al.  Metaphors We Live by , 1982 .

[46]  Horacio Saggion,et al.  Towards Automatic Lexical Simplification in Spanish: An Empirical Study , 2012, PITR@NAACL-HLT.

[47]  Sergiu Nisioi,et al.  Exploring Neural Text Simplification Models , 2017, ACL.