论文信息 - Large-Scale Knowledge Graph Identification Using PSL

Large-Scale Knowledge Graph Identification Using PSL

Building a web-scale knowledge graph, which captures information about entities and the relationships between them, represents a formidable challenge. While many largescale information extraction systems operate on web corpora, the candidate facts they produce are noisy and incomplete. To remove noise and infer missing information in the knowledge graph, we propose knowledge graph identification: a process of jointly reasoning about the structure of the knowledge graph, utilizing extraction confidences and leveraging ontological information. Scalability is often a challenge when building models in domains with rich structure, but we use probabilistic soft logic (PSL), a recentlyintroduced probabilistic modeling framework which easily scales to millions of facts. In practice, our method performs joint inference on a real-world dataset containing over 1M facts and 80K ontological constraints in 12 hours and produces a high-precision set of facts for inclusion into a knowledge graph.

Lise Getoor | William W. Cohen | Jay Pujara | Hui Miao

[1] Jeffrey P. Bigham,et al. Organizing and Searching the World Wide Web of Facts - Step One: The One-Million Fact Extraction Challenge , 2006, AAAI.

[2] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[3] Lise Getoor,et al. Probabilistic Similarity Logic , 2010, UAI.

[4] Estevam R. Hruschka,et al. Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[5] Oren Etzioni,et al. Open Information Extraction from the Web , 2007, CACM.

[6] Dejing Dou,et al. Learning to Refine an Automatically Extracted Knowledge Base Using Markov Logic , 2012, 2012 IEEE 12th International Conference on Data Mining.

[7] Dianne P. O'Leary,et al. Scaling MPE Inference for Constrained Continuous Markov Random Fields with Consensus Optimization , 2012, NIPS.

[8] Matthew Richardson,et al. Markov logic networks , 2006, Machine Learning.

[9] Lise Getoor,et al. A short introduction to probabilistic soft logic , 2012, NIPS 2012.