Expanding Wikidata's Parenthood Information by 178%, or How To Mine Relation Cardinalities
暂无分享,去创建一个
While automated knowledge base construction so far has largely focused on fully qualified facts, e.g. 〈Obama, hasChild, Malia〉, the Web contains also extensive amounts of cardinality information, such as that someone has two children without giving their names. In this paper we argue that the extraction of such information could substantially increase the scope of knowledge bases. For the sample of the hasChild relation in Wikidata, we show that simple regular-expression based extraction from Wikipedia can increase the size of the relation by 178%. We also show how such cardinality information can be used to estimate the recall of knowledge bases.
[1] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .
[2] Wei Zhang,et al. Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.
[3] Simon Razniewski,et al. Predicting Completeness in Knowledge Bases , 2016, WSDM.
[4] Markus Krötzsch,et al. Wikidata , 2014, Commun. ACM.
[5] Oren Etzioni,et al. Identifying Relations for Open Information Extraction , 2011, EMNLP.