论文信息 - Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction - 字舞流文

Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction

Joint extraction of entities and their relations benefits from the close interaction between named entities and their relation information. Therefore, how to effectively model such cross-modal interactions is critical for the final performance. Previous works have used simple methods such as label-feature concatenation to perform coarse-grained semantic fusion among cross-modal instances, but fail to capture fine-grained correlations over token and label spaces, resulting in insufficient interactions. In this paper, we propose a deep Cross-Modal Attention Network (CMAN) for joint entity and relation extraction. The network is carefully constructed by stacking multiple attention units in depth to fully model dense interactions over token-label spaces, in which two basic attention units are proposed to explicitly capture fine-grained correlations across different modalities (e.g., token-to-token and labelto-token). Experiment results on CoNLL04 dataset show that our model obtains state-of-the-art results by achieving 90.62% F1 on entity recognition and 72.97% F1 on relation classification. In ADE dataset, our model surpasses existing approaches by more than 1.9% F1 on relation classification. Extensive analyses further confirm the effectiveness of our approach.

Fang Liu | Shan Zhao | Zhiping Cai | Minghao Hu | Minghao Hu | Zhiping Cai | Shan Zhao | Fang Liu

[1] Mingxin Zhou,et al. Entity-Relation Extraction as Multi-Turn Question Answering , 2019, ACL.

[2] L. Ohno-Machado. Journal of Biomedical Informatics , 2001 .

[3] Jun'ichi Tsujii,et al. A Rich Feature Vector for Protein-Protein Interaction Extraction from Multiple Corpora , 2009, EMNLP.

[4] Thomas Demeester,et al. Adversarial training for multi-context joint entity and relation extraction , 2018, EMNLP.

[5] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[6] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[7] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Guoyin Wang,et al. Joint Embedding of Words and Labels for Text Classification , 2018, ACL.

[9] Heike Adel,et al. Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification , 2017, EMNLP.

[10] Dan Roth,et al. A Linear Programming Formulation for Global Inference in Natural Language Tasks , 2004, CoNLL.

[11] Tung Tran,et al. Neural Metric Learning for Fast End-to-End Relation Extraction , 2019, ArXiv.

[12] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.

[13] Margaret Mitchell,et al. VQA: Visual Question Answering , 2015, International Journal of Computer Vision.

[14] Juliane Fluck,et al. Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports , 2012, J. Biomed. Informatics.

[15] Makoto Miwa,et al. End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures , 2016, ACL.

[16] Claire Cardie,et al. Going out on a limb: Joint Extraction of Entity Mentions and Relations without Dependency Trees , 2017, ACL.

[17] Zhiyuan Liu,et al. Relation Classification via Multi-Level Attention CNNs , 2016, ACL.

[18] Peng Zhou,et al. Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme , 2017, ACL.

[19] BMC Bioinformatics , 2005 .

[20] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[21] Fei Li,et al. A neural joint model for entity and relation extraction from biomedical text , 2017, BMC Bioinformatics.

[22] Adrian Ulges,et al. Span-based Joint Entity and Relation Extraction with Transformer Pre-training , 2020, ECAI.

[23] Noah A. Smith,et al. Proceedings of EMNLP , 2007 .

[24] Zhou Yu,et al. Deep Modular Co-Attention Networks for Visual Question Answering , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Yue Zhang,et al. Hierarchically-Refined Label Attention Network for Sequence Labeling , 2019, EMNLP.

[26] Wei Lu,et al. Attention Guided Graph Convolutional Networks for Relation Extraction , 2019, ACL.