论文信息 - On the Strength of Character Language Models for Multilingual Named Entity Recognition

On the Strength of Character Language Models for Multilingual Named Entity Recognition

Character-level patterns have been widely used as features in English Named Entity Recognition (NER) systems. However, to date there has been no direct investigation of the inherent differences between name and nonname tokens in text, nor whether this property holds across multiple languages. This paper analyzes the capabilities of corpus-agnostic Character-level Language Models (CLMs) in the binary task of distinguishing name tokens from non-name tokens. We demonstrate that CLMs provide a simple and powerful model for capturing these differences, identifying named entity tokens in a diverse set of languages at close to the performance of full NER systems. Moreover, by adding very simple CLM-based features we can significantly improve the performance of an off-the-shelf NER system for multiple languages.

[1] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[2] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[3] Dan Roth,et al. Two Discourse Driven Language Models for Semantics , 2016, ACL.

[4] Daniel S. Weld,et al. Fine-Grained Entity Recognition , 2012, AAAI.

[5] Dan Klein,et al. Named Entity Recognition with Character-Level Models , 2003, CoNLL.

[6] Stephen D. Mayhew,et al. CogCompNLP: Your Swiss Army Knife for NLP , 2018, LREC.

[7] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8] Stephanie Strassel,et al. LORELEI Language Packs: Data, Tools, and Resources for Technology Development in Low Resource Languages , 2016, LREC.

[9] Christopher D. Manning,et al. Classifying Unknown Proper Noun Phrases Without Context , 2002 .

[10] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[11] Baltescu Paul,et al. OxLM: A Neural Language Modelling Framework for Machine Translation , 2014, Prague Bull. Math. Linguistics.

[12] David Yarowsky,et al. Language Independent Named Entity Recognition Combining Morphological and Contextual Evidence , 1999, EMNLP.

[13] Dan Roth,et al. Design Challenges and Misconceptions in Named Entity Recognition , 2009, CoNLL.