论文信息 - Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons

Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons

Models for many natural language tasks benefit from the flexibility to use overlapping, non-independent features. For example, the need for labeled data can be drastically reduced by taking advantage of domain knowledge in the form of word lists, part-of-speech tags, character n-grams, and capitalization patterns. While it is difficult to capture such inter-dependent features with a generative probabilistic model, conditionally-trained models, such as conditional maximum entropy models, handle them well. There has been significant work with such models for greedy sequence modeling in NLP (Ratnaparkhi, 1996; Borthwick et al., 1998).

Wei Li | Andrew McCallum | A. McCallum | Wei Li

[1] Adwait Ratnaparkhi,et al. A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[2] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Ralph Grishman,et al. Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition , 1998, VLC@COLING/ACL.

[4] Joe F. Zhou,et al. Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, : 21-22 June 1999, University of Maryland, College Park, MD, USA , 1999 .

[5] Yoram Singer,et al. Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[6] Ellen Riloff. Bootstrapping for text learning tasks , 1999 .

[7] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[8] Rob Malouf,et al. A Comparison of Algorithms for Maximum Entropy Parameter Estimation , 2002, CoNLL.

[9] Andrew Mccallum,et al. Chinese Word Segmentation with Conditional Random Fields and Integrated Domain Knowledge , 2003 .

[10] Andrew McCallum,et al. Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[11] Fernando Pereira,et al. Shallow Parsing with Conditional Random Fields , 2003, NAACL.