SWARM: An Approach for Mining Semantic Association Rules from Semantic Web Data

The ever growing amount of Semantic Web data has made it increasingly difficult to analyse the information required by the users. Association rule mining is one of the most useful techniques for discovering frequent patterns among RDF triples. In this context, some statistical methods strongly rely on the user intervention that is time-consuming and error-prone due to a large amount of data. In these studies, the rule quality factors (e.g. Support and Confidence measures) consider only knowledge in the instance-level data. However, Semantic Web data contains knowledge in both instance-level and schema-level. In this paper, we introduce an approach called SWARM (Semantic Web Association Rule Mining) to automatically mine Semantic Association Rules from RDF data. We discuss how to utilize knowledge encode in the schema-level to enrich the semantics of rules. We also show that our approach is able to reveal common behavioral patterns associated with knowledge in the instance-level and schema-level. The proposed rule quality factors (Support and Confidence) consider knowledge not only in the instance-level but also schema-level. Experiments performed on the DBpedia Dataset (3.8) demonstrate the usefulness of the proposed approach.

[1]  Fabian M. Suchanek,et al.  Fast rule mining in ontological knowledge bases with AMIE+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$+$$\end{docu , 2015, The VLDB Journal.

[2]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[3]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[4]  Bart Goethals,et al.  Relational Association Rules: Getting WARMeR , 2002, Pattern Detection and Discovery.

[5]  Felix Naumann,et al.  Amending RDF Entities with New Facts , 2014, KNOW@LOD.

[6]  Felix Naumann,et al.  Improving RDF Data Through Association Rule Mining , 2013, Datenbank-Spektrum.

[7]  Jianfeng Gao,et al.  Embedding Entities and Relations for Learning and Inference in Knowledge Bases , 2014, ICLR.

[8]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[9]  Fabian M. Suchanek,et al.  AMIE: association rule mining under incomplete evidence in ontological knowledge bases , 2013, WWW.

[10]  Shamim Ripon,et al.  Knowledge-based Data Mining Using Semantic Web☆ , 2014 .

[11]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[12]  Rafael Berlanga Llavori,et al.  Finding association rules in semantic web data , 2012, Knowl. Based Syst..

[13]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.