Epitaph or Breaking News? Analyzing and Predicting the Stability of Knowledge Base Properties

Knowledge bases (KBs) contain huge amounts of facts about entities, their properties, and relations between them. They are thus the key asset in any intelligent system for tasks such as structured search and question answering. However, due to dynamics in the real world, properties and relations change over time, and stored knowledge may become outdated. While KB information evolves steadily, there is no information whether or not a KB property might be subject to change with high probability or whether it is likely to be stable. Systems exploiting KB information, however, could benefit a lot if they had access to this kind of information. In this paper, we analyze and predict the stability of KB entries, which allows to accompany entries with stability scores. Our predictive model exploits entity-based features and learns through historic data. A particular challenge to determine stability scores is that KB entries are not only added or modified due to real-world changes but also to reduce the incompleteness of KBs in general. Nevertheless, our evaluation of sample properties demonstrates the effectiveness of our method for predicting the one-year stability of KB properties.

[1]  Clive Loughlin Researching the future , 2009 .

[2]  Tom M. Mitchell,et al.  "A Spousal Relation Begins with a Deletion of engage and Ends with an Addition of divorce": Learning State Changing Verbs from Wikipedia Revision History , 2015, EMNLP.

[3]  Jiawei Han,et al.  Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions , 2015, IEEE Transactions on Knowledge and Data Engineering.

[4]  Thomas Pellissier Tanon,et al.  Property Label Stability in Wikidata: Evolution and Convergence of Schemas in Collaborative Knowledge Bases , 2018, WWW.

[5]  Yanghua Xiao,et al.  How to Keep a Knowledge Base Synchronized with Its Encyclopedia Source , 2017, IJCAI.

[6]  Wolfgang Nejdl,et al.  Extracting Event-Related Information from Article Updates in Wikipedia , 2013, ECIR.

[7]  R. Baeza-Yates Searching the Future , 2022 .

[8]  M. de Rijke,et al.  Query modeling for entity search based on terms, categories, and examples , 2011, TOIS.

[9]  Gerhard Weikum,et al.  TEQUILA: Temporal Question Answering over Knowledge Bases , 2018, CIKM.

[10]  Joemon M. Jose,et al.  Wikipedia as a time machine , 2014, WWW.

[11]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[12]  Wenfei Fan,et al.  Inferring data currency and consistency for conflict resolution , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[13]  Simon Razniewski,et al.  "Knowledge base recall: detecting and resolving the unknown unknowns" by Simon Razniewski and Gerhard Weikum with Martin Vesely as coordinator , 2018, SIGWEB Newsl..

[14]  Gerhard Weikum,et al.  diaNED: Time-Aware Named Entity Disambiguation for Diachronic Corpora , 2018, ACL.

[15]  Leon Derczynski,et al.  Time and information retrieval: Introduction to the special issue , 2015, Inf. Process. Manag..

[16]  Zhen Wang,et al.  Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.

[17]  Simon Razniewski,et al.  Optimizing Update Frequencies for Decaying Information , 2016, CIKM.

[18]  Simon Razniewski,et al.  Predicting Completeness in Knowledge Bases , 2016, WSDM.