UWB at SemEval-2016 Task 11: Exploring Features for Complex Word Identification
暂无分享,去创建一个
In this paper, we present our system developed for the SemEval 2016 Task 11: Complex Word Identification. Our team achieved the 3rd place among 21 participants. Our systems ranked 4th and 13th among 42 submitted systems. We proposed multiple features suitable for complex word identification, evaluated them, and discussed their properties. According to the results of our experiments, our final system used maximum entropy classifier with a single feature – document frequency.
[1] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.
[2] Michal Konkol. Brainy: A Machine Learning Library , 2014, ICAISC.
[3] Lucia Specia,et al. SemEval 2016 Task 11: Complex Word Identification , 2016, *SEMEVAL.
[4] Miloslav Konopík,et al. Latent semantics in language models , 2015, Comput. Speech Lang..
[5] Mark A. Finlayson. Java Libraries for Accessing the Princeton Wordnet: Comparison and Evaluation , 2014, GWC.