SWApriori : a new approach to mining association rules from semantic web data

With the introduction and standardization of the semantic web as the third generation of the Web, this technology has attracted and received more human attention than ever and thus the amount of semantic web data is constantly growing. These semantic web data are a rich source of useful knowledge for feeding data mining techniques. Semantic web data have some complexities, such as the heterogeneous structure of the data, the lack of exactly defined transactions, the existence of typed relationships between entities etc. One of the data mining techniques is association rule mining, the goal of which is to find interesting rules based on frequent item-sets. In this paper we propose a novel method that considers the complex nature of semantic web data and, without end-user involvement and any data conversion to traditional forms, mines association rules directly from semantic web datasetsat the instance level. This method assumes that data have been stored in triple format (Subject, Predicate, and Object) in a single dataset. For evaluation purposes the proposed method has been applied to a drugs dataset that experiments results show the ability of the proposed algorithm in mining ARs from semantic web data without end-user involvement.

[1]  Witold Pedrycz,et al.  An improved association rules mining method , 2012, Expert Syst. Appl..

[2]  Dimitris Kontokostas,et al.  Internationalization of Linked Data: The case of the Greek DBpedia edition , 2012, J. Web Semant..

[3]  Nada Lavrac,et al.  Using Ontologies in Semantic Data Mining with SEGS and g-SEGS , 2011, Discovery Science.

[4]  Ryutaro Ichise,et al.  LiDDM: A Data Mining System for Linked Data , 2011, LDOW.

[5]  Felix Naumann,et al.  Context and target configurations for mining RDF data , 2011, SMER '11.

[6]  Yun Chi,et al.  Frequent Subtree Mining - An Overview , 2004, Fundam. Informaticae.

[7]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[8]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[9]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[10]  Nicola Fanizzi,et al.  Metric-based stochastic conceptual clustering for ontologies , 2009, Inf. Syst..

[11]  Tao Jiang,et al.  Mining RDF metadata for generalized association rules: knowledge discovery in the semantic web era , 2006, WWW '06.

[12]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[13]  Mohammad Saraee,et al.  Finding association rules in linked data, a centralization approach , 2013, 2013 21st Iranian Conference on Electrical Engineering (ICEE).

[14]  Benjamin C. M. Fung,et al.  Hierarchical Document Clustering using Frequent Itemsets , 2003, SDM.

[15]  Krys J. Kochut,et al.  SPARQLeR: Extended Sparql for Semantic Association Discovery , 2007, ESWC.

[16]  Francisco Herrera,et al.  Special issue on "New Trends in Data Mining" NTDM , 2012, Knowl. Based Syst..

[17]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[18]  R GruberThomas Toward principles for the design of ontologies used for knowledge sharing , 1995 .

[19]  Huilin Wang,et al.  Discovering Associations among Semantic Links , 2009, 2009 International Conference on Web Information Systems and Mining.

[20]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[21]  Rafael Berlanga Llavori,et al.  Finding association rules in semantic web data , 2012, Knowl. Based Syst..

[22]  Ian Horrocks,et al.  OIL: An Ontology Infrastructure for the Semantic Web , 2001, IEEE Intell. Syst..

[23]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[24]  Tae-Sun Chung,et al.  An Improved Frequent Pattern Tree Based Association Rule Mining Technique , 2011, 2011 International Conference on Information Science and Applications.

[25]  Jacinto Mata Vázquez,et al.  An evolutionary algorithm to discover quantitative association rules from huge databases without the need for an a priori discretization , 2012, Expert Syst. Appl..

[26]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[27]  Shingo Mabu,et al.  A novel evolutionary method to search interesting association rules by keywords , 2011, Expert Syst. Appl..

[28]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[29]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[30]  Max Bramer,et al.  Principles of Data Mining , 2013, Undergraduate Topics in Computer Science.

[31]  Stephan Bloehdorn,et al.  Kernel Methods for Mining Instance Data in Ontologies , 2007, ISWC/ASWC.

[32]  Abraham Bernstein,et al.  Adding Data Mining Support to SPARQL Via Statistical Relational Learning Methods , 2008, ESWC.

[33]  Bernardo Cuenca Grau,et al.  OWL 2 Web Ontology Language: Profiles , 2009 .

[34]  Vivek Tiwari,et al.  Association rule mining: A graph based approach for mining frequent itemsets , 2010, 2010 International Conference on Networking and Information Technology.

[35]  Mustafa Mat Deris,et al.  A soft set approach for association rules mining , 2011, Knowl. Based Syst..

[36]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[37]  Christian Hidber,et al.  Association Rule Mining , 2017 .

[38]  Luc De Raedt,et al.  Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[39]  Christian Böhm,et al.  Frequent subgraph discovery in dynamic networks , 2010, MLG '10.

[40]  Andreas Hotho,et al.  Semantic Web Mining: State of the art and future directions , 2006, J. Web Semant..

[41]  Lise Getoor,et al.  Link mining: a new data mining challenge , 2003, SKDD.

[42]  Bart Baesens,et al.  New Trends in Data Mining , 2005 .

[43]  E. Rambabu,et al.  Association rule mining using FPTree as directed acyclic graph , 2012, IEEE-International Conference On Advances In Engineering, Science And Management (ICAESM -2012).