Token Gazetteer and Character Gazetteer for Named Entity Recognition
暂无分享,去创建一个
Named entity recognition (NER) in information extraction (IE) sys- tems is usually based on large gazetteers — datasets of well-known and classi- fied entities. NER is also often performed by independent look-up piece of code, which is considered as a bottleneck of many NER systems. In this paper, we present two approaches for building tree gazetteers for NER; i.e. lookup by token and by character.
[1] Ladislav Hluchý,et al. Ontea: Platform for Pattern Based Automated Semantic Annotation , 2009, Comput. Informatics.
[2] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.