Corpus-Based Diacritic Restoration for South Slavic Languages
暂无分享,去创建一个
[1] Nikola Ljubesic,et al. Discriminating Between Closely Related Languages on Twitter , 2015, Informatica.
[2] Tomaz Erjavec,et al. Predicting the Level of Text Standardness in User-generated Content , 2015, RANLP.
[3] David Yarowsky,et al. A Comparison of Corpus-Based Techniques for Restoring Accents in Spanish and French Text , 1999 .
[4] Rada Mihalcea,et al. Letter Level Learning for Language Independent Diacritics Restoration , 2002, CoNLL.
[5] Dan Tufis,et al. DIAC+: a Professional Diacritics Recovering System , 2008, LREC.
[6] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.
[7] Borbála Siklósi,et al. Automatic Diacritics Restoration for Hungarian , 2015, EMNLP.
[8] Nikola Ljubesic,et al. {bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian , 2014, WaC@EACL.
[9] J. Šnajder,et al. Automatic Diacritics Restoration in Croatian Texts , 2009 .
[10] Tomaž Erjavec,et al. The slWaC 2 . 0 Corpus of the Slovene Web , 2014 .
[11] Tomaz Erjavec,et al. TweetCaT: a tool for building Twitter corpora of smaller languages , 2014, LREC.
[12] A G N,et al. Bibliographical References , 1965 .
[13] Michel Simard. Automatic Insertion of Accents in French Text , 1998, EMNLP.
[14] David Yarowsky,et al. DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.