Walking Linked Data: a Graph Traversal Approach to Explain Clusters

Link traversal is one of the biggest advantages of Linked Data, as it allows the serendipitous discovery of new knowledge thanks to the natural connections between data of different sources. Our general problem is to understand how such a property can benefit the Knowledge Discovery process: in particular, we aim at using Linked Data to explain the patterns of data that have been extracted from a typical data mining process such as clustering. The strategy we propose here is Linked Data traversal, in which we explore and build on-the-fly an unknown Linked Data graph by simply deferencing entities' URIs until we find, by following the links between entities, a valid explanation to our clusters. The experiments section gives an insight into the performance of such an approach, in terms of time and scalability, and show how the links easily gather knowledge from different data sources.

[1]  Günter Ladwig,et al.  SIHJoin: Querying Remote and Local Linked Data , 2011, ESWC.

[2]  Eyal Oren,et al.  Sindice.com: a document-oriented lookup index for open linked data , 2008, Int. J. Metadata Semant. Ontologies.

[3]  Olaf Hartig,et al.  A Database Perspective on Consuming Linked Data on the Web , 2010, Datenbank-Spektrum.

[4]  Peter Sanders,et al.  Engineering Route Planning Algorithms , 2009, Algorithmics of Large and Complex Networks.

[5]  Jürgen Umbrich,et al.  Link traversal querying for a diverse Web of Data , 2014, Semantic Web.

[6]  Tommaso Di Noia,et al.  Top-N recommendations from implicit feedback leveraging linked open data , 2013, IIR.

[7]  Marcelo Arenas,et al.  nSPARQL: A navigational language for RDF , 2010, J. Web Semant..

[8]  Olaf Hartig,et al.  SQUIN: a traversal based query execution system for the web of linked data , 2013, SIGMOD '13.

[9]  Enrico Motta,et al.  Dedalo: Looking for Clusters Explanations in a Labyrinth of Linked Data , 2014, ESWC.

[10]  Claudio Gutiérrez,et al.  The swget portal: Navigating and acting on the web of linked data , 2014, J. Web Semant..

[11]  Tommaso Di Noia,et al.  A Linked Data Recommender System Using a Neighborhood-Based Graph Kernel , 2014, EC-Web.

[12]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[13]  Ansgar Scherp,et al.  Get the google feeling: Supporting users in finding relevant sources of linked open data at web-scale , 2012 .

[14]  Barry Bishop,et al.  FactForge: A fast track to the Web of data , 2011, Semantic Web.