Mining Definitions from RDF Annotations Using Formal Concept Analysis

The popularization and quick growth of Linked Open Data (LOD) has led to challenging aspects regarding quality assessment and data exploration of the RDF triples that shape the LOD cloud. Particularly, we are interested in the completeness of data and its potential to provide concept definitions in terms of necessary and sufficient conditions. In this work we propose a novel technique based on Formal Concept Analysis which organizes RDF data into a concept lattice. This allows data exploration as well as the discovery of implications, which are used to automatically detect missing information and then to complete RDF data. Moreover, this is a way of reconciling syntax and semantics in the LOD cloud. Finally, experiments on the DBpedia knowledge base show that the approach is wellfounded and effective.

[1]  Amedeo Napoli,et al.  Mining gene expression data with pattern structures in formal concept analysis , 2011, Inf. Sci..

[2]  Heiko Paulheim,et al.  Detecting Incorrect Numerical Data in DBpedia , 2014, ESWC.

[3]  Dominik Benz,et al.  Semantics made by you and me: Self-emerging ontologies can capture the diversity of shared knowledge , 2010 .

[4]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[5]  Heiko Paulheim,et al.  Type Inference on Noisy RDF Data , 2013, SEMWEB.

[6]  Martin Hepp,et al.  Swiqa - a semantic web information quality assessment framework , 2011, ECIS.

[7]  Bernhard Ganter,et al.  Pattern Structures and Their Projections , 2001, ICCS.

[8]  Jens Lehmann,et al.  User-driven quality evaluation of DBpedia , 2013, I-SEMANTICS '13.

[9]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[10]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[11]  Jeff Heflin,et al.  Detecting abnormal data for ontology based information integration , 2011, 2011 International Conference on Collaboration Technologies and Systems (CTS).

[12]  Claudio Carpineto,et al.  Concept data analysis - theory and applications , 2004 .

[13]  Víctor Codocedo,et al.  A Proposition for Combining Pattern Structures and Relational Concept Analysis , 2014, ICFCA.

[14]  Jeff Heflin,et al.  Extending Functional Dependency to Detect Abnormal Data in RDF Graphs , 2011, SEMWEB.

[15]  B. C. Brookes,et al.  Information Sciences , 2020, Cognitive Skills You Need for the 21st Century.