Simplifying metaphorical language for young readers: A corpus study on news text

The paper presents first results of an ongoing project on text simplification focusing on linguistic metaphors. Based on an analysis of a parallel corpus of news text professionally simplified for different grade levels, we identify six types of simplification choices falling into two broad categories: preserving metaphors or dropping them. An annotation study on almost 300 source sentences with metaphors (grade level 12) and their simplified counterparts (grade 4) is conducted. The results show that most metaphors are preserved and when they are dropped, the semantic content tends to be preserved rather than dropped, however, it is reworded without metaphorical language. In general, some of the expected tendencies in complexity reduction, measured with psycholinguistic variables linked to metaphor comprehension, are observed, suggesting good prospect for machine learning-based metaphor simplification.

[1]  Cornelia Müller,et al.  The Dynamics of Metaphor: Foregrounding and Activating Metaphoricity in Conversational Interaction , 2010 .

[2]  Pamela Faber,et al.  Reviewing imagery in resemblance and non-resemblance metaphors , 2010 .

[3]  Chris Callison-Burch,et al.  Problems in Current Text Simplification Research: New Data Can Help , 2015, TACL.

[4]  R. Gibbs Metaphor Interpretation as Embodied Simulation , 2006 .

[5]  J. Maule,et al.  The Discourse Dynamics Approach to Metaphor and Metaphor-Led Discourse Analysis , 2009 .

[6]  Lucia Specia,et al.  UOW-SHEF: SimpLex – Lexical Simplicity Ranking based on Contextual and Psycholinguistic Features , 2012, *SEMEVAL.

[7]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[8]  Walt Detmar Meurers,et al.  Readability assessment for text simplification: From analysing documents to identifying sentential simplifications , 2014 .

[9]  A. Paivio,et al.  Concreteness, imagery, and meaningfulness values for 925 nouns. , 1968, Journal of experimental psychology.

[10]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[11]  Harold Edward Palmer The grading and simplifying of literary material : a memorandum , .

[12]  D. McNamara,et al.  A Linguistic Analysis of Simplified and Authentic Texts , 2007 .

[13]  A. Paivio,et al.  Metaphor and Thought: Psychological processes in metaphor comprehension and memory , 1993 .

[14]  Jonathan Dunn,et al.  Measuring metaphoricity , 2014, ACL.

[15]  Danielle S. McNamara,et al.  Text simplification and comprehensible input: A case for an intuitive approach , 2012 .

[16]  Advaith Siddharthan,et al.  A survey of research on text simplification , 2014 .

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[18]  Gerard J. Steen,et al.  A method for linguistic metaphor identification : from MIP to MIPVU , 2010 .