论文信息 - ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking

ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking

This paper presents the systems we developed while participating in the first task (English Lexical Simplification) of SemEval 2012. Our first system relies on n-grams frequencies computed from the Simple English Wikipedia version, ranking each substitution term by decreasing frequency of use. We experimented with several other systems, based on term frequencies, or taking into account the context in which each substitution term occurs. On the evaluation corpus, we achieved a 0.465 score with the first system.

Delphine Bernhard | Cyril Grouin | Anne-Laure Ligozat | Anne Garcia-Fernandez

[1] Iryna Gurevych,et al. A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.

[2] Lucia Specia,et al. SemEval-2012 Task 1: English Lexical Simplification , 2012, *SEMEVAL.

[3] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.

[4] Helmut Schmidt,et al. Probabilistic part-of-speech tagging using decision trees , 1994 .