Hybrid Index Structure based on MBB Approximation for Linked Data

Although a pragmatic approach towards achieving Semantic Web has gained some traction with Linked Data, there are still a lot of open problems in the area of Linked Data. Because Linked Data are modeled as RDF graphs, we cannot directly adopt existing solutions from database systems or Web technologies. This paper presents a hybrid method between the centralized approach and the distributed approach based on query processing to increase the join query performance. Using auxiliary indexes we can retrieve distributed data resources participating on a query result, rapidly reducing the amount of data that are really needed to be accessed on-demand. The performance of the proposed index structure is compared with some existing methods on a real RDF dataset. Our method outperforms the existing methods due to its ability to reduce a large amount of irrelevant resources.

[1]  Irena Holubová,et al.  Efficient querying of distributed linked data , 2011, PhD '11.

[2]  Jürgen Umbrich,et al.  Comparing data summaries for processing live queries over Linked Data , 2011, World Wide Web.

[3]  Olaf Hartig,et al.  An Overview on Execution Strategies for Linked Data Queries , 2013, Datenbank-Spektrum.

[4]  Abraham Bernstein,et al.  Hexastore: sextuple indexing for semantic web data management , 2008, Proc. VLDB Endow..

[5]  Hai Jin,et al.  TripleBit: a Fast and Compact System for Large Scale RDF Data , 2013, Proc. VLDB Endow..

[6]  James A. Hendler,et al.  Matrix "Bit" loaded: a scalable lightweight join query processor for RDF data , 2010, WWW '10.

[7]  Steffen Staab,et al.  Federated Data Management and Query Optimization for Linked Open Data , 2011, New Directions in Web Data Management 1.

[8]  Thanh Tran Structure Index for RDF Data , 2010 .

[9]  Ulf Leser,et al.  Querying Distributed RDF Data Sources with SPARQL , 2008, ESWC.

[10]  Irena Holubová,et al.  On Distributed Querying of Linked Data , 2012, DATESO.

[11]  Andreas Harth,et al.  Optimized index structures for querying RDF from the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[12]  Jürgen Umbrich,et al.  Data summaries for on-demand queries over linked data , 2010, WWW '10.

[13]  Jens Lehmann,et al.  Introduction to Linked Data and Its Lifecycle on the Web , 2013, Reasoning Web.

[14]  Bo Hu,et al.  Path Queries Based RDF Index , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[15]  Gerhard Weikum,et al.  RDF-3X: a RISC-style engine for RDF , 2008, Proc. VLDB Endow..

[16]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[17]  Wolfram Wöß,et al.  A Semantic Web middleware for Virtual Data Integration on the Web , 2008, ESWC.