Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models

Pretrained Transformer models have emerged as state-of-the-art approaches that learn contextual information from the text to improve the performance of several NLP tasks. These models, albeit powerful, still require specialized knowledge in specific scenarios. In this paper, we argue that context derived from a knowledge graph (in our case: Wikidata) provides enough signals to inform pretrained transformer models and improve their performance for named entity disambiguation (NED) on Wikidata KG. We further hypothesize that our proposed KG context can be standardized for Wikipedia, and we evaluate the impact of KG context on the state of the art NED model for the Wikipedia knowledge base. Our empirical results validate that the proposed KG context can be generalized (for Wikipedia), and providing KG context in transformer architectures considerably outperforms the existing baselines, including the vanilla transformer models.

[1]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[2]  Jens Lehmann,et al.  Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text , 2019, NAACL.

[3]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[4]  Thomas Hofmann,et al.  Deep Joint Entity Disambiguation with Local Neural Attention , 2017, EMNLP.

[5]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[6]  Stefano Bragaglia,et al.  A Neural Approach to Entity Linking on Wikidata , 2019, ECIR.

[7]  Thomas Hofmann,et al.  End-to-End Neural Entity Linking , 2018, CoNLL.

[8]  Yi Yang,et al.  Collective Entity Disambiguation with Structured Gradient Tree Boosting , 2018, NAACL.

[9]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[10]  Zhe Zhao,et al.  K-BERT: Enabling Language Representation with Knowledge Graph , 2019, AAAI.

[11]  Yueting Zhuang,et al.  Learning Dynamic Context Augmentation for Global Entity Linking , 2019, EMNLP.

[12]  Alexander Peysakhovich,et al.  PyTorch-BigGraph: A Large-scale Graph Embedding System , 2019, SysML.

[13]  Shuang Chen,et al.  Improving Entity Linking by Modeling Latent Entity Type Information , 2020, AAAI.

[14]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[15]  Stefano Bragaglia,et al.  Named Entity Disambiguation using Deep Learning on Graphs , 2018, ArXiv.

[16]  Olivier Raiman,et al.  DeepType: Multilingual Entity Linking by Neural Type System Evolution , 2018, AAAI.

[17]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[18]  Antonin Delpeuch,et al.  OpenTapioca: Lightweight Entity Linking for Wikidata , 2019, Wikidata@ISWC.

[19]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[20]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[21]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[22]  Maria-Esther Vidal,et al.  Falcon 2.0: An Entity and Relation Linking Tool over Wikidata , 2020, CIKM.

[23]  Hiroyuki Shindo,et al.  Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation , 2016, CoNLL.

[24]  Jens Lehmann,et al.  Why Reinvent the Wheel: Let's Build Question Answering Systems Together , 2018, WWW.

[25]  Yanan Cao,et al.  Joint Entity Linking with Deep Reinforcement Learning , 2019, WWW.

[26]  Ivan Titov,et al.  Boosting Entity Linking Performance by Leveraging Unlabeled Documents , 2019, ACL.

[27]  Akhilesh Vyas,et al.  Encoding Knowledge Graph Entity Aliases in Attentive Neural Network for Wikidata Entity Linking , 2019, WISE.

[28]  Denny Vrandecic,et al.  Wikidata: a new platform for collaborative data collection , 2012, WWW.

[29]  Maria-Esther Vidal,et al.  FALCON 2.0: An Entity and Relation Linking framework over Wikidata , 2019 .