Natural Language Identification using Corpus-Based Models
暂无分享,去创建一个
This paper describes three approaches to the task of automatically identifying the language a text is written in. We conducted experiments to compare the success of each approach in identifying languages from a set of texts in Dutch/Friesian, English, French, Gaelic (Irish), German, Italian, Portuguese, Serbo-Croat and Spanish.....
[1] George Yule,et al. The study of language , 1998 .
[2] Vladimir Batagelj,et al. Automatic clustering of languages , 1992 .