论文信息 - Multilayer ToI Detection Approach for Nested NER - 字舞流文

Multilayer ToI Detection Approach for Nested NER

Nested entities commonly exist in news articles and biomedical corpora. The performance of nested NER is still a great challenge in the field of named entity recognition (NER). Unlike the structural models in previous work, this paper presents a comprehensive study of nested NER by means of text-of-interest (ToI) detection. This paper presents a novel ToI-CNN with dual transformer encoders (ToI-CNN + DTE) model for this solution. We design a directional self-attention mechanism to encode contextual representation over the whole-sentence in the forward and backward directions. The features of the entities are extracted from the contextual token representations by a convolutional neural network. Moreover, we use HAT pooling operation to convert the various length ToIs to a fixed length vector and connect with a fully connected network for classification. The layer where the nested entities are located can be evaluated by multi-task learning jointly with layer classification. The experimental results show that our model achieves excellent performance in F1 score, training cost and layer evaluation on the nested NER datasets.

Lin Sun | Fule Ji | Chi Wang | Kai Zhang

[1] Sophia Ananiadou,et al. A Neural Layered Model for Nested Named Entity Recognition , 2018, NAACL.

[2] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.

[3] Steven Bethard,et al. A Survey on Recent Advances in Named Entity Recognition from Deep Learning models , 2018, COLING.

[4] Mark A. Przybocki,et al. The Automatic Content Extraction (ACE) Program – Tasks, Data, and Evaluation , 2004, LREC.

[5] Jan Hajic,et al. Neural Architectures for Nested NER through Linearization , 2019, ACL.

[6] Wei Lu,et al. Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators , 2017, EMNLP.

[7] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[8] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[9] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[10] Lin Sun,et al. TOI-CNN: a Solution of Information Extraction on Chinese Insurance Policy , 2019, NAACL.

[11] Eric Nichols,et al. Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.

[12] Yaojie Lu,et al. Sequence-to-Nuggets: Nested Entity Mention Detection via Anchor-Region Networks , 2019, ACL.

[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[14] Makoto Miwa,et al. Deep Exhaustive Model for Nested Named Entity Recognition , 2018, EMNLP.

[15] Zita Marinho,et al. Hierarchical Nested Named Entity Recognition , 2019 .

[16] Jakob Uszkoreit,et al. A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.

[17] Christopher D. Manning,et al. Nested Named Entity Recognition , 2009, EMNLP.

[18] Christopher D. Manning,et al. Efficient, Feature-based, Conditional Random Field Parsing , 2008, ACL.

[19] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[20] Wei Lu,et al. Neural Segmental Hypergraphs for Overlapping Mention Recognition , 2018, EMNLP.

[21] Bowen Zhou,et al. A Structured Self-attentive Sentence Embedding , 2017, ICLR.

[22] Claire Cardie,et al. Nested Named Entity Recognition Revisited , 2018, NAACL.

[23] Hongxia Jin,et al. A Neural Transition-based Model for Nested Mention Recognition , 2018, EMNLP.

[24] Dan Roth,et al. Joint Mention Extraction and Classification with Mention Hypergraphs , 2015, EMNLP.

[25] Andreas Vlachos,et al. Merge and Label: A Novel Neural Network Architecture for Nested NER , 2019, ACL.

[26] Jun'ichi Tsujii,et al. GENIA corpus - a semantically annotated corpus for bio-textmining , 2003, ISMB.

[27] Le-Minh Nguyen,et al. Nested Named Entity Recognition Using Multilayer Recurrent Neural Networks , 2017, PACLING.