论文信息 - Corpus-Aware Graph Aggregation Network for Sequence Labeling - 字舞流文

Corpus-Aware Graph Aggregation Network for Sequence Labeling

Current state-of-the-art sequence labeling models are typically based on sequential architecture such as Bi-directional LSTM (BiLSTM). However, the structure of processing a word at a time based on the sequential order restricts the full utilization of non-sequential features, including syntactic relationships, word co-occurrence relations, and document topics. They can be regarded as the corpus-level features and critical for sequence labeling. In this paper, we propose a Corpus-Aware Graph Aggregation Network. Specifically, we build three types of graphs, i.e., a word-topic graph, a word co-occurrence graph, and a word syntactic dependency graph, to express different kinds of corpus-level non-sequential features. After that, a graph convolutional network (GCN) is adapted to model the relations between words and non-sequential features. Finally, we employ a label-aware attention mechanism to aggregate corpus-aware non-sequential features and sequential ones for sequence labeling. The experimental results on four sequence labeling tasks (named entity recognition, chunking, multilingual sequence labeling, and target-based sentiment analysis) show that our model achieves state-of-the-art performance.

Jiangyue Yan | Liuhong Yu | Zhenxi Lin | Haibin Chen | Qianli Ma

[1] Eric Nichols,et al. Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.

[2] Yuan Luo,et al. Graph Convolutional Networks for Text Classification , 2018, AAAI.

[3] Guoyin Wang,et al. Joint Embedding of Words and Labels for Text Classification , 2018, ACL.

[4] Xin Li,et al. A Unified Model for Opinion Target Extraction and Target Sentiment Prediction , 2018, AAAI.

[5] Zhicheng Dou,et al. Leveraging Multi-Token Entities in Document-Level Named Entity Recognition , 2020, AAAI.

[6] Wei Xu,et al. Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[7] Jean-David Ruvini,et al. Learning Better Internal Structure of Words for Sequence Labeling , 2018, EMNLP.

[8] Jungo Kasai,et al. Robust Multilingual Part-of-Speech Tagging via Adversarial Training , 2017, NAACL.

[9] Suresh Manandhar,et al. SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[10] Graham Neubig,et al. Cross-Lingual Word Embeddings for Low-Resource Language Modeling , 2017, EACL.

[11] Anima Anandkumar,et al. Deep Active Learning for Named Entity Recognition , 2017, Rep4NLP@ACL.

[12] Kewei Tu,et al. Automated Concatenation of Embeddings for Structured Prediction , 2020, ACL.

[13] Hamid R. Arabnia,et al. ES-LDA: Entity Summarization using Knowledge-based Topic Modeling , 2017, IJCNLP.

[14] Sabine Buchholz,et al. Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[15] Chandra Bhagavatula,et al. Semi-supervised sequence tagging with bidirectional language models , 2017, ACL.

[16] Xipeng Qiu,et al. Sequence Labeling With Deep Gated Dual Path CNN , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[17] Hui Chen,et al. GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition , 2019, AAAI.

[18] Samuel R. Bowman,et al. A Gold Standard Dependency Corpus for English , 2014, LREC.

[19] Andrew McCallum,et al. Fast and Accurate Entity Recognition with Iterated Dilated Convolutions , 2017, EMNLP.

[20] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[21] Sang-goo Lee,et al. Learning Context Using Segment-Level LSTM for Neural Sequence Labeling , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[22] Yue Zhang,et al. Sentence-State LSTM for Text Representation , 2018, ACL.

[23] Philippe Langlais,et al. Robust Lexical Features for Improved Neural Network Named-Entity Recognition , 2018, COLING.

[24] Anders Søgaard,et al. Deep multi-task learning with low level tasks supervised at lower layers , 2016, ACL.

[25] Benjamin Van Durme,et al. Open Domain Targeted Sentiment , 2013, EMNLP.

[26] Yue Zhang,et al. Hierarchically-Refined Label Attention Network for Sequence Labeling , 2019, EMNLP.

[27] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[28] Lidong Bing,et al. Exploiting BERT for End-to-End Aspect-based Sentiment Analysis , 2019, EMNLP.

[29] Diego Marcheggiani,et al. Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling , 2017, EMNLP.

[30] Fandong Meng,et al. GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling , 2019, ACL.

[31] Christopher D. Manning,et al. An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition , 2006, ACL.

[32] Hwee Tou Ng,et al. Towards Robust Linguistic Analysis using OntoNotes , 2013, CoNLL.

[33] Xiang Wan,et al. Improving Named Entity Recognition with Attentive Ensemble of Syntactic Information , 2020, FINDINGS.

[34] Quoc V. Le,et al. Semi-Supervised Sequence Modeling with Cross-View Training , 2018, EMNLP.

[35] Roland Vollgraf,et al. Pooled Contextualized Embeddings for Named Entity Recognition , 2019, NAACL.

[36] Xiang Ren,et al. Empower Sequence Labeling with Task-Aware Neural Language Model , 2017, AAAI.

[37] Regina Barzilay,et al. GraphIE: A Graph-Based Framework for Information Extraction , 2018, NAACL.

[38] Eduard H. Hovy,et al. End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[39] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[40] LeeSang-goo,et al. Learning Context Using Segment-Level LSTM for Neural Sequence Labeling , 2020 .

[41] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[42] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[43] Bowen Zhou,et al. Neural Models for Sequence Chunking , 2017, AAAI.

[44] Masanori Hattori,et al. Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition , 2016, NLPCC/ICCPOL.

[45] Haris Papageorgiou,et al. SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[46] Hai Zhao,et al. Hierarchical Contextualized Representation for Named Entity Recognition , 2019, AAAI.

[47] Xin Li,et al. Aspect Term Extraction with History Attention and Selective Transformation , 2018, IJCAI.

[48] Wei Lu,et al. Dependency-Guided LSTM-CRF for Named Entity Recognition , 2019, EMNLP.