Expanding Wikidata's Parenthood Information by 178%, or How To Mine Relation Cardinalities

While automated knowledge base construction so far has largely focused on fully qualified facts, e.g. 〈Obama, hasChild, Malia〉, the Web contains also extensive amounts of cardinality information, such as that someone has two children without giving their names. In this paper we argue that the extraction of such information could substantially increase the scope of knowledge bases. For the sample of the hasChild relation in Wikidata, we show that simple regular-expression based extraction from Wikipedia can increase the size of the relation by 178%. We also show how such cardinality information can be used to estimate the recall of knowledge bases.