A Survey on RDF Data Store Based on NoSQL Systems for the Semantic Web Applications

Today the Resource Description Framework (RDF) that allows computers to understand and exploit Web data becomes very much in a progressive way, as well as the amount of web data that becomes very large. The storage and efficient management of this large RDF data is a real challenge in front of the classic RDF databases called triplestore. Recently, several researches focus on storing RDF data in triplestores based on NoSQL data management systems like HBase, Cassandra, Accumulo, and Couchbase. The majority of these researches are based on HBase. This NoSQL technology that is intended to handle this phenomenon of data explosion called Big Data, provided benefits like scalability and high availability compared to traditional triplestores. In this paper, we review existing works and systems that use NoSQL databases to store massive RDF data.

[1]  Mouad Banane,et al.  Storing RDF Data into Big Data NoSQL Databases , 2017 .

[2]  Adina Crainiceanu,et al.  Rya: a scalable RDF triple store for the clouds , 2012, Cloud-I '12.

[3]  J. Chris Anderson,et al.  CouchDB: The Definitive Guide , 2010 .

[4]  Mouad Banane,et al.  RDF Data Management Systems Based on NoSQL Databases: A Comparative Study , 2018 .

[5]  Yon Dohn Chung,et al.  SPIDER: a system for scalable, parallel / distributed evaluation of large-scale RDF data , 2009, CIKM.

[6]  Andreas Harth,et al.  CumulusRDF: Linked Data Management on Nested Key-Value Stores , 2011 .

[7]  Abdessamad Belangour,et al.  A Comparative Study of Hadoop-based Big Data Architectures , 2017, Int. J. Web Appl..

[8]  Bhavani M. Thuraisingham,et al.  Jena-HBase: A Distributed, Scalable and Effcient RDF Triple Store , 2012, SEMWEB.

[9]  Ioannis Konstantinou,et al.  H2RDF: adaptive query processing on RDF data in the cloud. , 2012, WWW.

[10]  Paul T. Groth,et al.  NoSQL Databases for RDF: An Empirical Evaluation , 2013, International Semantic Web Conference.

[11]  Abdessamad Belangour,et al.  A Big Data Hadoop building blocks comparative study , 2017 .

[12]  Mc Brown Getting Started with Couchbase Server , 2012 .

[13]  Georg Lausen,et al.  PigSPARQL: A SPARQL Query Processing Baseline for Big Data , 2013, International Semantic Web Conference.

[14]  Rusty Klophaus,et al.  Riak Core: building distributed applications without shared state , 2010, CUFP '10.

[15]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[16]  Jianling Sun,et al.  Scalable RDF store based on HBase and MapReduce , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[17]  Rong Gu,et al.  Rainbow: A distributed and hierarchical RDF triple store with dynamic scalability , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[18]  Gordon S. Blair,et al.  A generic component model for building systems software , 2008, TOCS.

[19]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[20]  Borislav Iordanov,et al.  HyperGraphDB: A Generalized Graph Database , 2010, WAIM Workshops.