A Statistical Study of the WPT-03 Corpus
暂无分享,去创建一个
[1] W. B. Cavnar,et al. N-gram-based text categorization , 1994 .
[2] Campbell B. Read,et al. Zipf's Law , 2004 .
[3] José João Almeida,et al. jspell.pm: um módulo de análise morfológica para uso em processamento de linguagem natural , 2001 .
[4] Mário J. Silva,et al. Language identification in web pages , 2005, SAC '05.
[5] Daniel Gomes,et al. A Characterization of the Portuguese Web , 2003 .
[6] Daniel Gomes. Tarântula-Sistema de Recolha de Documentos na WWW , 2001 .
[7] 関口 洋一,et al. Web Corpus Construction with Quality Improvement , 2003 .
[8] Diana Santos,et al. Evaluating CETEMPúblico, a Free Resource for Portuguese , 2001, ACL.
[9] João P. Campos. Versus: a Web Data Repository with Time Support , 2003 .
[10] Michael Oakes,et al. Statistics for Corpus Linguistics , 1998 .
[11] Thorsten Brants,et al. TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.
[12] Mário J. Silva,et al. The Case for a Portuguese Web Search Engine , 2003, ICWI.
[13] Eric Brill,et al. A Simple Rule-Based Part of Speech Tagger , 1992, HLT.
[14] George Kingsley Zipf,et al. Human behavior and the principle of least effort , 1949 .
[15] Luís Sarmento,et al. O projecto AC/DC: acesso a corpora/disponibilização de corpora , 2003 .
[16] Oi Yee Kwong,et al. Natural Language Processing - IJCNLP 2004, First International Joint Conference, Hainan Island, China, March 22-24, 2004, Revised Selected Papers , 2005, IJCNLP.