论文信息 - Name Tagging with Word Clusters and Discriminative Training

Name Tagging with Word Clusters and Discriminative Training

We present a technique for augmenting annotated training data with hierarchical word clusters that are automatically derived from a large unannotated corpus. Cluster membership is encoded in features that are incorporated in a discriminatively trained tagging model. Active learning is used to select training examples. We evaluate the technique for named-entity tagging. Compared with a state-of-the-art HMM-based name finder, the presented technique requires only 13% as much annotated data to achieve the same level of performance. Given a large annotated training set of 1,000,000 words, the technique achieves a 25% reduction in error over the state-of-the-art HMM trained on the same material.

[1] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[2] Naftali Tishby,et al. Distributional Clustering of English Words , 1993, ACL.

[3] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[4] David M. Magerman. Statistical Decision-Tree Models for Parsing , 1995, ACL.

[5] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[6] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[7] Andrew McCallum,et al. Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[8] Kamal Nigamyknigam,et al. Employing Em in Pool-based Active Learning for Text Classiication , 1998 .

[9] Yoram Singer,et al. Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[10] Ellen Riloff,et al. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[11] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[12] Ralph Weischedel,et al. Rapid annotation through human-machine collaboration , 2002 .

[13] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[14] Ralph Grishman,et al. Bootstrapped Learning of Semantic Classes from Positive and Negative Examples , 2003 .

[15] Fernando Pereira,et al. Shallow Parsing with Conditional Random Fields , 2003, NAACL.