Scalable RDF triple store using summary of hashed information and Bit comparison

In this paper, we proposed a scalable RDF triple store for massive-scale RDF data that processes the SPARQL query with many join operations in efficient manner. Graph characteristic of RDF data model hinders scalable and efficient indexing and querying over RDF triples. To address the problem, our query processing uses the pruning algorithm based on Bit-structure and summarized information to minimize data-reading. Our approach guarantees scalability and flexibility even for massive-scale RDF data by storing RDF triples in distributed fashion, providing the modifiable structure, and optimizing memory footprint of usage. The experiments shows that our system is better performing for queries with many join operations while uses less memory footprints.

[1]  Deborah L. McGuinness,et al.  Tracking RDF Graph Provenance using RDF Molecules , 2005 .

[2]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[3]  Olivier Curé,et al.  RDF Database Systems: Triples Storage and SPARQL Query Processing , 2014 .

[4]  Jane Hunter,et al.  A scale-out RDF molecule store for distributed processing of biomedical data , 2008, WWW 2008.

[5]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[6]  Gerhard Weikum,et al.  RDF-3X: a RISC-style engine for RDF , 2008, Proc. VLDB Endow..

[7]  Sherif Sakr,et al.  D-SPARQ: Distributed, Scalable and Efficient RDF Query Engine , 2013, International Semantic Web Conference.

[8]  Timothy W. Finin,et al.  Policy-Based Access Control for an RDF Store , 2005, IJCAI 2007.

[9]  James A. Hendler,et al.  BitMat: A Main-memory Bit Matrix of RDF Triples for Conjunctive Triple Pattern Queries , 2008, SEMWEB.

[10]  Abraham Bernstein,et al.  Hexastore: sextuple indexing for semantic web data management , 2008, Proc. VLDB Endow..

[11]  Nina Amenta,et al.  Efficient hash tables on the gpu , 2011 .

[12]  Kevin Skadron,et al.  Accelerating Braided B+ Tree Searches on a GPU with CUDA , 2011 .

[13]  James A. Hendler,et al.  Proceedings of the Policy Management for the Web workshop , 2005 .

[14]  Andreas Harth,et al.  Optimized index structures for querying RDF from the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[15]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.