RDF Database Systems: Triples Storage and SPARQL Query Processing

RDF Database Systems is a cutting-edge guide that distills everything you need to know to effectively use or design an RDF database. This book starts with the basics of linked open data and covers the most recent research, practice, and technologies to help you leverage semantic technology. With an approach that combines technical detail with theoretical background, this book shows how to design and develop semantic web applications, data models, indexing and query processing solutions. Understand the Semantic Web, RDF, RDFS, SPARQL, and OWL within the context of relational database management and NoSQL systems Learn about the prevailing RDF triples solutions for both relational and non-relational databases, including column family, document, graph, and NoSQLImplement systems using RDF data with helpful guidelines and various storage solutions for RDF Process SPARQL queries with detailed explanations of query optimization, query plans, caching, and more Evaluate which approaches and systems to use when developing Semantic Web applications with a helpful description of commercial and open-source systems

[1]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[2]  Latifur Khan,et al.  RDFKB: efficient support for RDF inference queries and knowledge management , 2009, IDEAS '09.

[3]  Said Mirza Pahlevi,et al.  RDFCube: A P2P-Based Three-Dimensional Index for Structural Joins on Distributed Triple Stores , 2005, DBISP2P.

[4]  Amit P. Sheth,et al.  Graph Summaries for Subgraph Frequency Estimation , 2008, ESWC.

[5]  Nicholas Gibbins,et al.  3store: Efficient Bulk RDF Storage , 2003, PSSS.

[6]  David Maier,et al.  From databases to dataspaces: a new abstraction for information management , 2005, SGMD.

[7]  Michael Zakharyaschev,et al.  Ontology-Based Data Access: Ontop of Databases , 2013, SEMWEB.

[8]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[9]  Riccardo Rosati,et al.  Improving Query Answering over DL-Lite Ontologies , 2010, KR.

[10]  Alon Y. Halevy,et al.  Indexing dataspaces , 2007, SIGMOD '07.

[11]  Diego Calvanese,et al.  High Performance Query Answering over DL-Lite Ontologies , 2012, KR.

[12]  Michael Stonebraker,et al.  The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[13]  Pascal Hitzler,et al.  DistEL: A Distributed EL+ Ontology Classifier , 2013, SSWS@ISWC.

[14]  Abraham Bernstein,et al.  Hexastore: sextuple indexing for semantic web data management , 2008, Proc. VLDB Endow..

[15]  Dave Reynolds,et al.  Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[16]  Jianyong Wang,et al.  AWETO: efficient incremental update and querying in rdf storage system , 2011, CIKM '11.

[17]  Sherif Sakr,et al.  D-SPARQ: Distributed, Scalable and Efficient RDF Query Engine , 2013, International Semantic Web Conference.

[18]  Daniel J. Abadi,et al.  Scalable SPARQL querying of large RDF graphs , 2011, Proc. VLDB Endow..

[19]  Dave Reynolds,et al.  SPARQL basic graph pattern optimization using selectivity estimation , 2008, WWW.

[20]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[21]  Dave J. Beckett,et al.  The design and implementation of the redland RDF application framework , 2001, WWW '01.

[22]  Sang-goo Lee,et al.  SPARQL basic graph pattern processing with iterative MapReduce , 2010, MDAC '10.

[23]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[24]  James A. Hendler,et al.  Parallel Materialization of the Finite RDFS Closure for Hundreds of Millions of Triples , 2009, SEMWEB.

[25]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[26]  Ian Horrocks,et al.  Description logic programs: combining logic programs with description logic , 2003, WWW '03.

[27]  Jeff Z. Pan,et al.  SAOR: Template Rule Optimisations for Distributed Reasoning over 1 Billion Linked Data Triples , 2010, SEMWEB.

[28]  M. Tamer Özsu,et al.  A comprehensive XQuery to SQL translation using dynamic interval encoding , 2003, SIGMOD '03.

[29]  Viktor K. Prasanna,et al.  Parallel Inferencing for OWL Knowledge Bases , 2008, 2008 37th International Conference on Parallel Processing.

[30]  Martin Fowler,et al.  NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence , 2012 .

[31]  Guy Jacobson,et al.  Space-efficient static trees and graphs , 1989, 30th Annual Symposium on Foundations of Computer Science.

[32]  Frank van Harmelen,et al.  QueryPIE: Backward Reasoning for OWL Horst over Very Large Knowledge Bases , 2011, SEMWEB.

[33]  Martin L. Kersten,et al.  Column-store support for RDF data management: not all swans are white , 2008, Proc. VLDB Endow..

[34]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[35]  François Goasdoué,et al.  Efficient query answering against dynamic RDF databases , 2013, EDBT '13.

[36]  Roberto Grossi,et al.  High-order entropy-compressed text indexes , 2003, SODA '03.

[37]  Daniel J. Abadi,et al.  Scalable Semantic Web Data Management Using Vertical Partitioning , 2007, VLDB.

[38]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[39]  Werner Vogels,et al.  Building reliable distributed systems at a worldwide scale demands trade-offs between consistency and availability. , 2022 .

[40]  Orri Erling,et al.  RDF Support in the Virtuoso DBMS , 2007, CSSW.

[41]  Setrag Khoshafian,et al.  A decomposition storage model , 1985, SIGMOD Conference.

[42]  Bhavani M. Thuraisingham,et al.  Data Intensive Query Processing for Large RDF Graphs Using Cloud Computing Tools , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[43]  Lei Zou,et al.  gStore: Answering SPARQL Queries via Subgraph Matching , 2011, Proc. VLDB Endow..

[44]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[45]  Michael Kifer,et al.  Database Systems: An Application Oriented Approach, Complete Version (2nd Edition) , 2005 .

[46]  Michael Stonebraker,et al.  MapReduce and parallel DBMSs: friends or foes? , 2010, CACM.

[47]  Carsten Lutz,et al.  Conjunctive Query Answering in the Description Logic EL Using a Relational Database System , 2009, IJCAI.

[48]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[49]  Michael Stonebraker,et al.  OLTP through the looking glass, and what we found there , 2008, SIGMOD Conference.

[50]  Frank van Harmelen,et al.  Marvin: Distributed reasoning over large-scale Semantic Web data , 2009, J. Web Semant..

[51]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[52]  Min Cai,et al.  RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network , 2004, WWW '04.

[53]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[54]  Eugene Inseok Chong,et al.  An Efficient SQL-based RDF Querying Scheme , 2005, VLDB.

[55]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[56]  V. S. Subrahmanian,et al.  GRIN: A Graph Based RDF Index , 2007, AAAI.

[57]  Annika Hinze,et al.  Storing RDF as a graph , 2003, Proceedings of the IEEE/LEOS 3rd International Conference on Numerical Simulation of Semiconductor Optoelectronic Devices (IEEE Cat. No.03EX726).

[58]  Miguel A. Martínez-Prieto,et al.  Exchange and Consumption of Huge RDF Data , 2012, ESWC.

[59]  Herman J. ter Horst,et al.  Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary , 2005, J. Web Semant..

[60]  Jürgen Umbrich,et al.  YARS2: A Federated Repository for Querying Graph Structured Data from the Web , 2007, ISWC/ASWC.

[61]  V. S. Subrahmanian,et al.  DOGMA: A Disk-Oriented Graph Matching Algorithm for RDF Databases , 2009, SEMWEB.

[62]  Claudio Gutiérrez,et al.  Querying RDF Data from a Graph Database Perspective , 2005, ESWC.

[63]  Hector J. Levesque,et al.  Knowledge Representation and Reasoning , 2004 .

[64]  Andreas Harth,et al.  Optimized index structures for querying RDF from the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[65]  Vangelis Th. Paschos,et al.  A Bottom-Up Method and Fast Algorithms for max independent set , 2010, SWAT.

[66]  Boris Motik,et al.  Efficient Query Answering for OWL 2 , 2009, SEMWEB.

[67]  Haixun Wang,et al.  A Distributed Graph Engine for Web Scale RDF Data , 2013, Proc. VLDB Endow..

[68]  Donald R. Morrison,et al.  PATRICIA—Practical Algorithm To Retrieve Information Coded in Alphanumeric , 1968, J. ACM.

[69]  James A. Hendler,et al.  Scalable reduction of large datasets to interesting subsets , 2010, J. Web Semant..

[70]  Daniel J. Abadi,et al.  SW-Store: a vertically partitioned DBMS for Semantic Web data management , 2009, The VLDB Journal.

[71]  Steven Pemberton,et al.  RDFa in XHTML: Syntax and Processing A collection of attributes and processing rules for extending XHTML to support RDF , 2008 .

[72]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[73]  Hector J. Levesque,et al.  Foundations of a Functional Approach to Knowledge Representation , 1984, Artif. Intell..

[74]  A. Gibbons Algorithmic Graph Theory , 1985 .

[75]  Georg Lausen,et al.  PigSPARQL: A SPARQL Query Processing Baseline for Big Data , 2013, International Semantic Web Conference.

[76]  Alistair Moffat,et al.  Off-line dictionary-based compression , 1999, Proceedings of the IEEE.

[77]  Miguel A. Martínez-Prieto,et al.  Compact Representation of Large RDF Data Sets for Publishing and Exchange , 2010, SEMWEB.

[78]  Diego Calvanese,et al.  QuOnto: Querying Ontologies , 2005, AAAI.

[79]  Min Cai,et al.  MAAN: A Multi-Attribute Addressable Network for Grid Information Services , 2003, Journal of Grid Computing.

[80]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[81]  Daniel J. Abadi,et al.  Performance tradeoffs in read-optimized databases , 2006, VLDB.

[82]  Hai Jin,et al.  TripleBit: a Fast and Compact System for Large Scale RDF Data , 2013, Proc. VLDB Endow..

[83]  Yavor Nenov,et al.  Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems , 2014, AAAI.

[84]  Gerhard Weikum,et al.  x-RDF-3X , 2010, Proc. VLDB Endow..

[85]  David J. DeWitt,et al.  Materialization Strategies in a Column-Oriented DBMS , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[86]  Franz Baader,et al.  CEL - A Polynomial-Time Reasoner for Life Science Ontologies , 2006, IJCAR.

[87]  Dimitris Sacharidis,et al.  On enhancing scalability for distributed RDF/S stores , 2011, EDBT/ICDT '11.

[88]  Brian McBride,et al.  Jena: A Semantic Web Toolkit , 2002, IEEE Internet Comput..

[89]  Krys J. Kochut,et al.  BRAHMS: A WorkBench RDF Store and High Performance Memory System for Semantic Association Discovery , 2005, SEMWEB.

[90]  Gerhard Weikum,et al.  RDF-3X: a RISC-style engine for RDF , 2008, Proc. VLDB Endow..

[91]  Giorgos B. Stamou,et al.  Optimized Query Rewriting for OWL 2 QL , 2011, CADE.

[92]  Steffen Staab,et al.  SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions , 2011, COLD.

[93]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[94]  Jacopo Urbani,et al.  Scalable RDF data compression with MapReduce , 2013, Concurr. Comput. Pract. Exp..

[95]  Katja Hose,et al.  FedX: A Federation Layer for Distributed Query Processing on Linked Open Data , 2011, ESWC.

[96]  George H. L. Fletcher,et al.  Scalable indexing of RDF graphs for efficient join processing , 2009, CIKM.

[97]  Siegfried Handschuh,et al.  Reasoning Web. Semantic Technologies for Information Systems, 5th International Summer School 2009, Brixen-Bressanone, Italy, August 30 - September 4, 2009, Tutorial Lectures , 2009, Reasoning Web.

[98]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[99]  Maribel Acosta,et al.  ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints , 2011, SEMWEB.

[100]  Daniel J. Abadi,et al.  Column Stores for Wide and Sparse Data , 2007, CIDR.

[101]  Katja Hose,et al.  Partout: a distributed engine for efficient RDF processing , 2012, WWW.

[102]  Nigel Shadbolt,et al.  The Design and Implementation of Minimal RDFS Backward Reasoning in 4store , 2011, ESWC.

[103]  Nancy A. Lynch,et al.  Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services , 2002, SIGA.

[104]  Vassilis Christophides,et al.  The ICS-FORTH RDFSuite: Managing Voluminous RDF Description Bases , 2001, SemWeb.

[105]  Franz Baader,et al.  Efficient Reasoning in EL+ , 2006, Description Logics.

[106]  Mihály Héder,et al.  Semantic Web for the Working Ontologist, Second dition: Effective modeling in RDFS and OWL by Allemang Dean and Hendler James, Morgan Kaufmann, 384 pp., $55, ISBN 0-123-85965-4 , 2013, The Knowledge Engineering Review.

[107]  Julian Dolby,et al.  Building an efficient RDF store over a relational database , 2013, SIGMOD '13.

[108]  Daniel J. Abadi,et al.  Integrating compression and execution in column-oriented database systems , 2006, SIGMOD Conference.

[109]  Nader H. Bshouty,et al.  Compression of dictionaries via extensions to front coding , 1992, Proceedings ICCI `92: Fourth International Conference on Computing and Information.

[110]  Muhammad Saleem,et al.  HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation , 2014, ESWC.

[111]  Jan Hidders,et al.  A Structural Approach to Indexing Triples , 2012, ESWC.

[112]  Giovanni Manzini,et al.  Opportunistic data structures with applications , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[113]  Vassilis Christophides,et al.  RQL: a declarative query language for RDF , 2002, WWW.

[114]  Jeff Z. Pan,et al.  TrOWL: Tractable OWL 2 Reasoning Infrastructure , 2010, ESWC.

[115]  Kevin Wilkinson,et al.  Jena Property Table Implementation , 2006 .

[116]  Dave Kolas,et al.  Efficient Linked-List RDF Indexing in Parliament , 2009 .

[117]  Gerhard Weikum,et al.  Scalable join processing on very large RDF graphs , 2009, SIGMOD Conference.

[118]  Kevin Wilkinson,et al.  Application-Specific Schema Design for Storing Large RDF Datasets , 2003, PSSS.

[119]  Günter Ladwig,et al.  Linked Data Query Processing Strategies , 2010, SEMWEB.

[120]  Eric A. Brewer,et al.  Towards robust distributed systems (abstract) , 2000, PODC '00.

[121]  Fabian Prasser,et al.  Efficient distributed query processing for autonomous RDF databases , 2012, EDBT '12.

[122]  Manolis Koubarakis,et al.  Atlas: Storing, updating and querying RDF(S) data on top of DHTs , 2010, J. Web Semant..

[123]  Markus Krötzsch Efficient Rule-Based Inferencing for OWL EL , 2011, IJCAI.

[124]  Alexander Borgida,et al.  Efficient management of transitive relationships in large data and knowledge bases , 1989, SIGMOD '89.

[125]  Daniele Braga,et al.  Querying RDF streams with C-SPARQL , 2010, SGMD.

[126]  James A. Hendler,et al.  BitMat: A Main-memory Bit Matrix of RDF Triples for Conjunctive Triple Pattern Queries , 2008, SEMWEB.

[127]  Frank van Harmelen,et al.  OWL Reasoning with WebPIE: Calculating the Closure of 100 Billion Triples , 2010, ESWC.

[128]  Jürgen Umbrich,et al.  SPARQL Web-Querying Infrastructure: Ready for Action? , 2013, SEMWEB.

[129]  James A. Hendler,et al.  Matrix "Bit" loaded: a scalable lightweight join query processor for RDF data , 2010, WWW '10.