ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking
暂无分享,去创建一个
This paper presents the systems we developed while participating in the first task (English Lexical Simplification) of SemEval 2012. Our first system relies on n-grams frequencies computed from the Simple English Wikipedia version, ranking each substitution term by decreasing frequency of use. We experimented with several other systems, based on term frequencies, or taking into account the context in which each substitution term occurs. On the evaluation corpus, we achieved a 0.465 score with the first system.
[1] Iryna Gurevych,et al. A Monolingual Tree-based Translation Model for Sentence Simplification , 2010, COLING.
[2] Lucia Specia,et al. SemEval-2012 Task 1: English Lexical Simplification , 2012, *SEMEVAL.
[3] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.
[4] Helmut Schmidt,et al. Probabilistic part-of-speech tagging using decision trees , 1994 .