论文信息 - A preliminary evaluation of word representations for named-entity recognition

A preliminary evaluation of word representations for named-entity recognition

We use different word representations as word features for a named-entity recognition (NER) system with a linear model. This work is part of a larger empirical survey, evaluating different word representations on different NLP tasks. We evaluate Brown clusters, Collobert and Weston (2008) embeddings, and HLBL (Mnih & Hinton, 2009) embeddings of words. All three representations improve accuracy on NER, with the Brown clusters providing a larger improvement than the two embeddings, and the HLBL embeddings more than the Collobert and Weston (2008) embeddings. We also discuss some of the practical issues in using embeddings as features. Brown clusters are simpler than embeddings because they require less hyperparameter tuning.

Joseph P. Turian

[1] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[2] Hermann Ney,et al. Algorithms for bigram and trigram word clustering , 1995, Speech Commun..

[3] Akira Ushioda,et al. Hierarchical Clustering of Words , 1996, COLING.

[4] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[5] Jean-Luc Gauvain,et al. Connectionist language modeling for large vocabulary continuous speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Tong Zhang,et al. A Robust Risk Minimization based Named Entity Recognition System , 2003, CoNLL.

[7] Blockin Blockin,et al. Quick Training of Probabilistic Neural Nets by Importance Sampling , 2003 .

[8] Scott Miller,et al. Name Tagging with Word Clusters and Discriminative Training , 2004, NAACL.

[9] Percy Liang,et al. Semi-Supervised Learning for Natural Language , 2005 .

[10] Wei Li,et al. Semi-Supervised Sequence Modeling with Syntactic Topic Models , 2005, AAAI.

[11] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.