Trust from the past: Bayesian Personalized Ranking based Link Prediction in Knowledge Graphs

Estimating the confidence for a link is a critical task for Knowledge Graph construction. Link prediction, or predicting the likelihood of a link in a knowledge graph based on prior state is a key research direction within this area. We propose a Latent Feature Embedding based link recommendation model for prediction task and utilize Bayesian Personalized Ranking based optimization technique for learning models for each predicate. Experimental results on large-scale knowledge bases such as YAGO2 show that our approach achieves substantially higher performance than several state-of-art approaches. Furthermore, we also study the performance of the link prediction algorithm in terms of topological properties of the Knowledge Graph and present a linear regression model to reason about its expected level of accuracy.

[1]  Chun How Tan,et al.  Trust, but verify: predicting contribution quality for knowledge base construction and curation , 2014, WSDM.

[2]  Gerhard Weikum,et al.  YAGO2: exploring and querying world knowledge in time, space, context, and many languages , 2011, WWW.

[3]  Nitesh V. Chawla,et al.  Link Prediction and Recommendation across Heterogeneous Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining.

[4]  Charu C. Aggarwal,et al.  When will it happen?: relationship prediction in heterogeneous information networks , 2012, WSDM '12.

[5]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[6]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[7]  Lars Schmidt-Thieme,et al.  Predicting RDF triples in incomplete knowledge bases with tensor factorization , 2012, SAC '12.

[8]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[9]  Xueyan Jiang,et al.  Reducing the Rank in Relational Factorization Models by Including Observable Patterns , 2014, NIPS.

[10]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[11]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[12]  Evgeniy Gabrilovich,et al.  Constructing and Mining Web-scale Knowledge Graphs , 2016, SIGIR.

[13]  Nicolas Le Roux,et al.  A latent factor model for highly multi-relational data , 2012, NIPS.

[14]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[15]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[16]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[17]  Mohammad Al Hasan,et al.  Name disambiguation from link data in a collaboration graph using temporal and topological features , 2014, Social Network Analysis and Mining.

[18]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[19]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[20]  Lise Getoor,et al.  Learning Probabilistic Relational Models , 1999, IJCAI.

[21]  Mohammad Al Hasan,et al.  Name disambiguation from link data in a collaboration graph , 2014, ASONAM.

[22]  Dejing Dou,et al.  Learning to Refine an Automatically Extracted Knowledge Base Using Markov Logic , 2012, 2012 IEEE 12th International Conference on Data Mining.

[23]  Alfred O. Hero,et al.  Deep Community Detection , 2014, IEEE Transactions on Signal Processing.

[24]  Andrew McCallum,et al.  Relation Extraction with Matrix Factorization and Universal Schemas , 2013, NAACL.

[25]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[26]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[27]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[28]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[29]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[30]  Kai-Wei Chang,et al.  Typed Tensor Decomposition of Knowledge Bases for Relation Extraction , 2014, EMNLP.

[31]  Xueyan Jiang,et al.  Link Prediction in Multi-relational Graphs using Additive Models , 2012, SeRSy.

[32]  Tom M. Mitchell,et al.  Random Walk Inference and Learning in A Large Scale Knowledge Base , 2011, EMNLP.