Statistical Knowledge Patterns for Characterising Linked Data

Knowledge Patterns (KPs), and even more specifically Ontology Design Patterns (ODPs), are no longer only generated in a top-down fashion, rather patterns are being extracted in a bottom-up fashion from online ontologies and data sources, such as Linked Data. These KPs can assist in tasks such as making sense of datasets and formulating queries over data, including performing query expansion to manage the diversity of properties used in datasets. This paper presents an extraction method for generating what we call Statistical Knowledge Patterns (SKPs) from Linked Data. SKPs describe and characterise classes from any reference ontology, by presenting their most frequent properties and property characteristics, all based on analysis of the underlying data. SKPs are stored as small OWL ontologies but can be continuously updated in a completely automated fashion. In the paper we exemplify this method by applying it to the classes of the DBpedia ontology, and in particular we evaluate our method for extracting range axioms from data. Results show that by setting appropriate thresholds, SKPs can be generated that cover (i.e. allow us to query, using the properties of the SKP) over 94% of the triples about individuals of that class, while only needing to care about 27% of the total number of distinct properties that are used in the data.

[1]  Jens Lehmann,et al.  LODStats - An Extensible Framework for High-Performance Dataset Analytics , 2012, EKAW.

[2]  Aldo Gangemi,et al.  Ontology Design Patterns , 2005 .

[3]  Eva Blomqvist OntoCase-Automatic Ontology Enrichment Based on Ontology Design Patterns , 2009, International Semantic Web Conference.

[4]  Aldo Gangemi,et al.  Pattern-Based Ontology Design , 2012, Ontology Engineering in a Networked World.

[5]  Andrea Giovanni Nuzzolese Knowledge Pattern Extraction and Their Usage in Exploratory Search , 2012, International Semantic Web Conference.

[6]  Jérôme David,et al.  Keys and Pseudo-Keys Detection for Web Datasets Cleansing and Interlinking , 2012, EKAW.

[7]  Isabelle Mirbel,et al.  DFS-based frequent graph pattern extraction to characterize the content of RDF Triple Stores , 2010 .

[8]  Kurt Sandkuhl,et al.  Patterns in Ontology Engineering: Classification of Ontology Patterns , 2005, ICEIS.

[9]  Aldo Gangemi,et al.  Towards a pattern science for the Semantic Web , 2010, Semantic Web.

[10]  Aldo Gangemi,et al.  Ontology Design Patterns for Semantic Web Content , 2005, SEMWEB.

[11]  Isabelle Augenstein,et al.  Mining Equivalent Relations from Linked Data , 2013, ACL.

[12]  Andrea Giovanni Nuzzolese,et al.  Encyclopedic Knowledge Patterns from Wikipedia Links , 2011, SEMWEB.

[13]  Lora Aroyo,et al.  Extracting Core Knowledge from Linked Data , 2011, COLD.

[14]  Isabelle Augenstein,et al.  Statistical Knowledge Patterns: Identifying Synonymous Relations in Large Linked Datasets , 2013, International Semantic Web Conference.

[15]  Aldo Gangemi,et al.  Aemoo: Exploratory Search based on KnowledgePatterns over the Semantic Web , 2011 .