Applying a Multi-Level Modeling Theory to Assess Taxonomic Hierarchies in Wikidata

Wikidata captures structured data on a number of subject domains, managing, among others, the information underlying Wikipedia and other Wikimedia projects. Wikidata serves as a repository of structured data, whose purpose is to support the consistent sharing and linking of data on the Web. To support these purposes, it is key that Wikidata is built on consistent data models and representation schemas, which are constructed and managed in a collaborative platform. In this paper, we address the quality of taxonomic hierarchies in Wikidata. We focus on taxonomic hierarchies with entities at different classification levels (particular individuals, types of individuals, types of types of individuals, etc.). We use an axiomatic theory for multi-level modeling to analyze current Wikidata content, and identify a significant number of problematic classification and taxonomic statements. The problems seem to arise from an inadequate use of instantiation and subclassing in certain Wikidata hierarchies.

[1]  E. Mayr The Growth of Biological Thought: Diversity, Evolution, and Inheritance , 1983 .

[2]  Bernd Neumayr,et al.  Multi-Level Domain Modeling with M-Objects and M-Relationships , 2009, APCCM.

[3]  James Odell,et al.  Power Types , 1994, J. Object Oriented Program..

[4]  Colin Atkinson,et al.  Meta-level Independent Modelling , 2000 .

[5]  Giancarlo Guizzardi,et al.  Ontological foundations for structural conceptual models , 2005 .

[6]  Brian Henderson-Sellers,et al.  A powertype-based metamodelling framework , 2006, Software & Systems Modeling.

[7]  João Paulo A. Almeida,et al.  Towards a Well-Founded Theory for Multi-Level Conceptual Modelling , 2015 .

[8]  Thomas Kühne Contrasting Classification with Generalisation , 2009, APCCM.

[9]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[10]  Colin Atkinson,et al.  The Essence of Multilevel Metamodeling , 2001, UML.

[11]  Luca Cardelli,et al.  Structural subtyping and the notion of power type , 1988, POPL '88.

[12]  Mark Needleman,et al.  The W3C Semantic Web Activity , 2003 .

[13]  João Paulo A. Almeida,et al.  Toward a well-founded theory for multi-level conceptual modeling , 2018, Software & Systems Modeling.

[14]  Giancarlo Guizzardi,et al.  Extending the Foundations of Ontology-Based Conceptual Modeling with a Multi-level Theory , 2015, ER.

[15]  Giancarlo Guizzardi,et al.  Using a Well-Founded Multi-level Theory to Support the Analysis and Representation of the Powertype Pattern in Conceptual Modeling , 2016, CAiSE.