Automatically acquiring a language model for POS tagging using decision trees
暂无分享,去创建一个
We present an algorithm that automatically acquires a statistically{based language model for POS tagging, using statistical decision trees. The learning algorithm deals with more complex contextual information than simple collections of n{grams and it is able to use information of diierent nature. The acquired models are independent enough to be easily incorporated , as a statistical core of constraints/rules, in any exible tagger. They are also complete enough to be directly used as sets of POS disam-biguation rules. We have implemented a simple and fast tagger that has been tested and evaluated on the WSJ corpus with a remarkable accuracy. Comparative results are reported.
[1] Donald Hindle,et al. Acquiring Disambiguation Rules from Text , 1989, ACL.
[2] Wendy G. Lehnert,et al. Using Decision Trees for Coreference Resolution , 1995, IJCAI.
[3] Penelope Sibun,et al. A Practical Part-of-Speech Tagger , 1992, ANLP.