论文信息 - A Maximum Entropy Model for Part-Of-Speech Tagging

A Maximum Entropy Model for Part-Of-Speech Tagging

This paper presents a statistical model which trains from a corpus annotated with Part Of Speech tags and assigns them to previously unseen text with state of the art accuracy The model can be classi ed as a Maximum Entropy model and simultaneously uses many contextual features to predict the POS tag Furthermore this paper demonstrates the use of specialized fea tures to model di cult tagging decisions discusses the corpus consistency problems discovered during the implementation of these features and proposes a training strategy that mitigates these problems

Adwait Ratnaparkhi | A. Ratnaparkhi

[1] J. Darroch,et al. Generalized Iterative Scaling for Log-Linear Models , 1972 .

[2] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[3] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[4] Ronald Rosenfeld,et al. Adaptive Language Modeling Using the Maximum Entropy Principle , 1993, HLT.

[5] Richard M. Schwartz,et al. Coping with Ambiguity and Unknown Words through Probabilistic Models , 1993, CL.

[6] Adwait Ratnaparkhi,et al. A Maximum Entropy Model for Prepositional Phrase Attachment , 1994, HLT.

[7] Bernard Mérialdo,et al. Tagging English Text with a Probabilistic Model , 1994, CL.

[8] John D. Lafferty,et al. Decision Tree Parsing using a Hidden Derivation Model , 1994, HLT.

[9] Eric Brill,et al. Some Advances in Transformation-Based Part of Speech Tagging , 1994, AAAI.

[10] David M. Magerman. Statistical Decision-Tree Models for Parsing , 1995, ACL.

[11] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[12] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..