Automatic language identification of written texts
暂无分享,去创建一个
Language identification is one of the search keys of most widespread use in the Internet. This article describes efficient and easily extensible solutions to the problem of identifying the language of written texts based on closed grammatical classes. An identification tool was developed for recognizing texts written in Portuguese, Spanish, French and English.
[1] Chris Mellish,et al. Natural Language Processing in Pop-11: An Introduction to Computational Linguistics , 1989 .
[2] Gerald Gazdar,et al. Natural Language Processing in PROLOG: An Introduction to Computational Linguistics , 1989 .
[3] KweeTjoeLiong. Review of "Natural language processing in LISP , 1990 .
[4] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.