An exploration of language identification techniques for the Dutch folktale database
暂无分享,去创建一个
[1] Fei Xia,et al. Language ID in the Context of Harvesting Language Data off the Web , 2009, EACL.
[2] Theo Meder. From a Dutch Folktale Database towards an International Folktale Database , 2010 .
[3] M Damashek,et al. Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.
[4] Ted E. Dunning,et al. Statistical Identification of Language , 1994 .
[5] Wouter Weerkamp,et al. Microblog language identification: overcoming the limitations of short, unedited and idiomatic text , 2012, Language Resources and Evaluation.
[6] W. B. Cavnar,et al. N-gram-based text categorization , 1994 .
[7] Timothy Baldwin,et al. Language Identification: The Long and the Short of the Matter , 2010, NAACL.
[8] E. Mark Gold,et al. Language Identification in the Limit , 1967, Inf. Control..
[9] H. Isahara,et al. Language identification based on string kernels , 2005, IEEE International Symposium on Communications and Information Technology, 2005. ISCIT 2005..
[10] Paul McNamee,et al. Language identification: a solved problem suitable for undergraduate instruction , 2005 .
[11] Timothy Baldwin,et al. Reconsidering Language Identification for Written Language Resources , 2006, LREC.
[12] Rafael Dueire Lins,et al. Automatic language identification of written texts , 2004, SAC '04.
[13] Mário J. Silva,et al. Language identification in web pages , 2005, SAC '05.