Collaborative Knowledge Graph Fusion by Exploiting the Open Corpus

To ease the process of building Knowledge Graphs (KGs) from scratch, a cost-effective method is required to enrich a KG using the triples extracted from a corpus. However, it is challenging to enrich a KG with newly extracted triples since they contain noisy information. This paper proposes to refine a KG by leveraging information extracted from a corpus. In particular, we first formulate the task of building KGs as two coupled sub-tasks, namely join event extraction and knowledge graph fusion. We then propose a collaborative knowledge graph fusion framework, which is composed of an explorer and a supervisor, to allow the involved two sub-tasks to mutually assist each other in an alternative manner. More concretely, an explorer extracts triples from a corpus supervised by both the ground-truth annotation and the KG provided by the supervisor. Furthermore, a supervisor then evaluates the extracted triples and enriches the KG with those that are highly ranked. To implement this evaluation, we further propose a translated relation alignment scoring mechanism to align and translate the extracted triples to the KG. Experimental results verify that this collaboration can improve both the performance of our sub-tasks, and contribute to high-quality enriched knowledge graphs.

[1]  Xiaojie Yuan,et al.  Toward Tweet Entity Linking With Heterogeneous Information Networks , 2022, IEEE Transactions on Knowledge and Data Engineering.

[2]  Bing Xiang,et al.  REKnow: Enhanced Knowledge for Joint Entity and Relation Extraction , 2022, ArXiv.

[3]  Zijian Wang,et al.  Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation , 2021, WSDM.

[4]  Zhaoli Zhang,et al.  Learning Knowledge Graph Embedding With Heterogeneous Relation Attention Networks , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Lynda Tamine,et al.  Knowledge Base Embedding By Cooperative Knowledge Distillation , 2020, COLING.

[6]  Weidong Xiao,et al.  Joint Event Extraction with Hierarchical Policy Network , 2020, COLING.

[7]  Philip S. Yu,et al.  Cross-Supervised Joint-Event-Extraction with Heterogeneous Information Networks , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[8]  Hoang Long Nguyen,et al.  Knowledge graph fusion for smart systems: A Survey , 2020, Inf. Fusion.

[9]  Ying Lin,et al.  A Joint Neural Model for Information Extraction with Global Features , 2020, ACL.

[10]  Zhifei Li,et al.  Multi-Scale Dynamic Convolutional Network for Knowledge Graph Embedding , 2020, IEEE Transactions on Knowledge and Data Engineering.

[11]  Yan Jia,et al.  Multi-source knowledge fusion: a survey , 2020, World Wide Web.

[12]  Philip S. Yu,et al.  A Survey on Knowledge Graphs: Representation, Acquisition, and Applications , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Elise van der Pol,et al.  Contrastive Learning of Structured World Models , 2019, ICLR.

[14]  Xing Xie,et al.  News Graph: An Enhanced Knowledge Graph for News Recommendation , 2019, KaRS@CIKM.

[15]  Rui Zhang,et al.  Entity Alignment between Knowledge Graphs Using Attribute Embeddings , 2019, AAAI.

[16]  Gerhard Weikum,et al.  Neural Relation Extraction for Knowledge Base Enrichment , 2019, ACL.

[17]  Yixin Cao,et al.  KGAT: Knowledge Graph Attention Network for Recommendation , 2019, KDD.

[18]  Chenliang Li,et al.  A Survey on Deep Learning for Named Entity Recognition , 2018, IEEE Transactions on Knowledge and Data Engineering.

[19]  Thien Huu Nguyen,et al.  One for All: Neural Joint Modeling of Entities and Events , 2018, AAAI.

[20]  Yuanzhuo Wang,et al.  Shared Embedding Based Neural Networks for Knowledge Graph Completion , 2018, CIKM.

[21]  Ming Zhou,et al.  Neural Open Information Extraction , 2018, ACL.

[22]  Lei Zou,et al.  Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs , 2018, IEEE Transactions on Knowledge and Data Engineering.

[23]  Jonathan Berant,et al.  The Web as a Knowledge-Base for Answering Complex Questions , 2018, NAACL.

[24]  Ambedkar Dukkipati,et al.  Learning beyond Datasets: Knowledge Graph Augmented Neural Networks for Natural Language Processing , 2018, NAACL.

[25]  Dai Quoc Nguyen,et al.  A Novel Embedding Model for Knowledge Base Completion Based on Convolutional Neural Network , 2017, NAACL.

[26]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[27]  Achim Rettinger,et al.  Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO , 2017, Semantic Web.

[28]  Jian Pei,et al.  A Survey on Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[29]  Shashi Narayan,et al.  Creating Training Corpora for NLG Micro-Planners , 2017, ACL.

[30]  Pasquale Minervini,et al.  Convolutional 2D Knowledge Graph Embeddings , 2017, AAAI.

[31]  Nicoleta Preda,et al.  Online Relation Alignment for Linked Datasets , 2017, ESWC.

[32]  Sanjeev Arora,et al.  A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[33]  Nigel Collier,et al.  Bidirectional LSTM for Named Entity Recognition in Twitter Messages , 2016, NUT@COLING.

[34]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[35]  Mausam,et al.  Open Information Extraction Systems and Downstream Applications , 2016, IJCAI.

[36]  Tom M. Mitchell,et al.  Joint Extraction of Events and Entities within a Document Context , 2016, NAACL.

[37]  Ido Dagan,et al.  Getting More Out Of Syntax with PropS , 2016, ArXiv.

[38]  Chris Dyer,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[39]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[40]  Li Guo,et al.  Knowledge Base Completion Using Embeddings and Rules , 2015, IJCAI.

[41]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[42]  Divesh Srivastava,et al.  Knowledge Curation and Knowledge Fusion: Challenges, Models and Applications , 2015, SIGMOD Conference.

[43]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[44]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[45]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[46]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[47]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[48]  Wei Zhang,et al.  From Data Fusion to Knowledge Fusion , 2014, Proc. VLDB Endow..

[49]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[50]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[51]  Heng Ji,et al.  Joint Event Extraction via Structured Prediction with Global Features , 2013, ACL.

[52]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[53]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[54]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[55]  Erik F. Tjong Kim Sang,et al.  Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition , 2002, CoNLL.

[56]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[57]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[58]  Oscar Ricardo Vergara data fusion , 2022 .