Investigations of Synonym Replacement for Swedish

We present results from an investigation on automatic synonym replacement for Swedish. Three different methods for choosing alternative synonyms were evaluated: (1) based on word frequency, (2) based on word length, and (3) based on level of synonymy. These three strategies were evaluated in terms of standardized readability metrics for Swedish, average word length, proportion of long words, and in relation to the ratio of errors in relation to replacements. The results show an improvement in readability for most strategies, but also show that erroneous substitutions are frequent.

[1]  Viggo Kann,et al.  Free construction of a free Swedish dictionary of synonyms , 2005, NODALIDA.

[2]  Alexander F. Gelbukh,et al.  Synonymous Paraphrasing Using WordNet and Internet , 2004, NLDB.

[3]  Fuchun Peng,et al.  Context sensitive synonym discovery for web search queries , 2009, CIKM.

[4]  David West,et al.  UNC-CH at DUC 2007: Query Expansion, Lexical Simplification and Sentence Selection Strategies for Multi-Document Summarization , 2007 .

[5]  Qing Zeng-Treitler,et al.  A semantic and syntactic text simplification tool for health content. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[6]  Rickard Domeij,et al.  Granska-an efficient hybrid system for Swedish grammar checking , 1999, NODALIDA.

[7]  Robin Keskisärkkä Automatic Text Simplification via Synonym Replacement , 2012 .

[8]  Lars-Olof Delsing,et al.  Håller språket ihop Norden? : en forskningsrapport om ungdomars förståelse av danska, svenska och norska , 2005 .

[9]  Partha Lal,et al.  Extract-based Summarization with Simplification , 2002, ACL 2002.

[10]  Katarina Heimann Mühlenbock,et al.  Using the probability of readability to order Swedish texts , 2012 .

[11]  Bin Xia,et al.  Synthesis of CeO2 nanoparticles by salt-assisted ultrasonic aerosol decomposition , 2001 .

[12]  Christian Smith,et al.  Automatic summarization as means of simplifying texts, an evaluation for Swedish , 2011, NODALIDA.

[13]  Siobhan Devlin,et al.  Simplifying Text for Language-Impaired Readers , 1999, EACL.

[14]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[15]  Advaith Siddharthan,et al.  Generating Anaphora for Simplifying Text , 2002 .

[16]  Markus Forsberg,et al.  All in the Family: A Comparison of SALDO and WordNet , 2009 .